Stars
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
Talk to research papers like talking to authors - Python package with AI agent for arXiv papers
Open-source, community-driven agent harness
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
Generative World Renderer: an AI-native Renderer for Games and Virtual Worlds. 面向游戏与虚拟世界的AI原生渲染引擎
EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards
DreamWorld: Unified World Modeling in Video Generation
[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
PyTorch code and models for VJEPA2 self-supervised learning from video.
This repository is the collection of World model Papers
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Gen-Searcher: Reinforcing Agentic Search for Image Generation
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
Simplifying diffusion/flow policies by treating action trajectories as flow trajectories
🔥 A curated roadmap to the Efficient VLA landscape. We’re keeping this list live—contribute your latest work!
Flash Attention implementatio with attention score
The official implementation of VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference.
A curated list of visual reinforcement learning resources
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving (ICLR 2026)
Paper list for Efficient Reasoning.
SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]