Lists (5)
Sort Name ascending (A-Z)
Stars
This project is the official implementation of "UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation"
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
LongLive: Real-time Interactive Long Video Generation
Enjoy the magic of Diffusion models!
The official repository of "Astra : General Interactive World Model with Autoregressive Denoising"
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
An unofficial pytorch implementation of ReconViaGen
A pipeline parallel training script for diffusion models.
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
MAGI-1: Autoregressive Video Generation at Scale
WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine, reason, and act in the physical world. Unlike passive vide…
[ICLR'24] GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、AI Agent、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
A procedural Blender pipeline for photorealistic training image generation
[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
Stable Virtual Camera: Generative View Synthesis with Diffusion Models