Stars
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
Enjoy the magic of Diffusion models!
Unified framework for robot learning built on NVIDIA Isaac Sim
Muzic: Music Understanding and Generation with Artificial Intelligence
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
[ICLR 2026] LongLive: Real-time Interactive Long Video Generation
[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
HaMeR: Reconstructing Hands in 3D with Transformers
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
HumanNeRF turns a monocular video of moving people into a 360 free-viewpoint video.
[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".
[ICLR 2023 Spotlight] EVA3D: Compositional 3D Human Generation from 2D Image Collections
[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Code repository for Part Grouping Network, ECCV 2018
[ICLR 2024] Code for FreeNoise based on VideoCrafter