Highlights
- Pro
Starred repositories
🔥(CVPR 2025 Highlight) Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
Masked Depth Modeling for Spatial Perception
Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior"
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Code for "InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields"
HY-Motion model for 3D human motion or 3D character animation generation.
Lossy PNG compressor — pngquant command based on libimagequant library
Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"
Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"
An unified model for 4D human-scene reconstruction
[CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation
4DHumans: Reconstructing and Tracking Humans with Transformers
An interactive star field background effect with html canvas
A lightweight JavaScript library for creating particles
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sharp Monocular View Synthesis in Less Than a Second
Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton…
A modern static site generator by the Material for MkDocs team
HunyuanVideo-1.5: A leading lightweight video generation model