Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 549 31 Updated Aug 14, 2025

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,574 93 Updated Apr 16, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,018 373 Updated Apr 6, 2026

NVlabs / QLIP

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 97 3 Updated Mar 1, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,170 1,585 Updated Feb 27, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,327 2,444 Updated Apr 2, 2026

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,210 144 Updated Jun 13, 2026

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,568 72 Updated Feb 8, 2025

deepseek-ai / DeepSeek-R1

91,987 11,719 Updated Jun 27, 2025

NVIDIA / cosmos

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,306 679 Updated Jun 17, 2026

qiulu66 / EgoPlan-Bench2

Jupyter Notebook 31 1 Updated Apr 11, 2025

TencentARC / Divot

Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)

Python 87 3 Updated Feb 27, 2025

TencentARC / Moto

[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

Python 179 8 Updated Oct 1, 2025

TencentARC / FluxKits

Python 110 9 Updated Nov 27, 2024

ttengwang / Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

380 14 Updated Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yixiao Ge yxgeee

Achievements

Achievements

Highlights

Organizations

Block or report yxgeee

Stars

xpeng-robotics / UniT

xpeng-robotics / DIAL

TencentARC / ARC-Chapter

TencentARC / ARC-Hunyuan-Video-7B

Kwai-Keye / Keye

TencentARC / GRPO-CARE

TencentARC / MindOmni

AIDC-AI / Awesome-Unified-Multimodal-Models

TencentARC / TokLIP

Tencent / HaploVLM

TencentARC / Video-Holmes

TencentARC / AnimeGamer

TencentARC / SEED-Bench-R1

TrajectoryCrafter / TrajectoryCrafter

OpenGVLab / InternVideo

bytedance / tarsier