Stars
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
[Arxiv 2025] Distilled-3DGS: Distilled 3D Gaussian Splatting
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Enjoy the magic of Diffusion models!
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
Foundation Models and Data for Human-Human and Human-AI interactions.
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
We provide a way to fuse MANO parameters into SMPLX.
(CVPR 2025) DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
[TPAMI 2023] PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images
HaMeR: Reconstructing Hands in 3D with Transformers
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".
Wan: Open and Advanced Large-Scale Video Generative Models
[ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Unified framework for robot learning built on NVIDIA Isaac Sim
Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024
Code for "HumanGif: Single-View Human Diffusion with Generative Prior"
[CVPR 2025] WildAvatar: Learning In-the-wild 3D Avatars from the Web
[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
[TPAMI' 2022'] MPS-NeRF: Generalizable 3D Human Rendering from Multiview Images
[3DV 2024] Official Repository for "TADA! Text to Animatable Digital Avatars".
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models