-
Ant Group
- Hangzhou, China
- https://scholar.google.com/citations?hl=en&user=VRsy9v8AAAAJ
Starred repositories
A feed-forward 3D foundation model for reconstructing scenes from streaming data
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory
Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.
A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…
Seoul World Model: Grounding World Simulation Models in a Real-World Metropolis
Claude Code VS Code extension patched for Force Local mode — run CLI locally, proxy file ops to remote server via VS Code Remote SSH
Helios: Real Real-Time Long Video Generation Model
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A community collection of OpenClaw use cases for making life easier.
MOVA: Towards Scalable and Synchronized Video–Audio Generation
Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
Causal video-action world model for generalist robot control
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"
Masked Depth Modeling for Spatial Perception
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer