Skip to content
View kdplus's full-sized avatar
⛩️
⛩️

Highlights

  • Pro

Organizations

@dyweb

Block or report kdplus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,433 186 Updated Sep 26, 2025

try add simple classification loss first

Python 5 Updated Mar 31, 2025

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 164 7 Updated Dec 26, 2024

slime is an LLM post-training framework for RL Scaling.

Python 4,194 541 Updated Feb 14, 2026

Official implementation of project Honeybee (CVPR 2024)

Python 465 22 Updated May 10, 2024

💫 Toolkit to help you get started with Spec-Driven Development

Python 69,909 6,032 Updated Feb 12, 2026

Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, published at CVPR 2024.

Python 16 2 Updated Jul 26, 2024

[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 5,611 659 Updated Feb 14, 2026

Anki is a smart spaced repetition flashcard program

Rust 26,444 2,809 Updated Feb 13, 2026

[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"

Python 69 3 Updated Oct 25, 2025

This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""

Python 95 12 Updated May 17, 2024

End-to-End, Single-Stream Temporal Action Detection in Untrimmed Videos (Official Repo for SS-TAD)

Python 108 23 Updated Oct 12, 2017

[ECCV 2024] "Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders"

Python 28 3 Updated Nov 13, 2025
Python 10 1 Updated Jan 26, 2025

A unified inference and post-training framework for accelerated video generation.

Python 3,088 264 Updated Feb 15, 2026
Python 142 31 Updated Apr 28, 2022

The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detection in PRocedural EGOcentric videos.

Python 31 4 Updated Jun 9, 2025
Python 37 2 Updated Mar 22, 2024

[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from user instructions.

Python 210 2 Updated May 5, 2025

[ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

271 15 Updated Feb 10, 2026

[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"

Python 176 20 Updated Sep 27, 2024

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Python 461 40 Updated Apr 27, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,592 180 Updated Dec 6, 2024

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Python 345 30 Updated Jul 19, 2024

Code for Diffusion Action Segmentation (ICCV 2023)

Python 73 12 Updated Aug 16, 2023

The official implementation of Error Detection in Egocentric Procedural Task Videos

Python 21 5 Updated Sep 20, 2025

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

570 42 Updated Jan 30, 2026

[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation

Python 64 5 Updated Dec 23, 2024
Next