Stars
A curated list of papers on reinforcement learning for video generation
Generative World Renderer: an AI-native Renderer for Games and Virtual Worlds. 面向游戏与虚拟世界的AI原生渲染引擎
Segment Any Concept via Meta-Reinforcement Learning
Interactive World Model papers organized by core research challenges.
Official implementation of the paper: Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models.
A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.
This is the official PyTorch codes for the paper: "Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution"
[CVPR26 Oral] MagicBokeh is the first unified method specifically designed for high-zoom bokeh rendering.
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"
[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation" & Causal Forcing++
MOVA: Towards Scalable and Synchronized Video–Audio Generation
A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…
Create beautiful slides on the web using a coding agent's frontend skills
[CVPR 2026] ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training
Anchor Forcing is a cache-centric framework for interactive streaming video generation that preserves visual quality and coherent motion across prompt switches
Real-Time Physical Action-Conditioned Video Generation
AI agents running research on single-GPU nanochat training automatically
Democratizing AI scientists with ToolUniverse
A paper list for spatial reasoning