Lists (9)
Sort Name ascending (A-Z)
Stars
[TVCG2024] PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction
Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery
从零开始玩转OpenClaw:最全面的中文教程,涵盖安装、配置、实战案例和避坑指南(github版)
[CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
[CVPR 2026] tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction
🧙🏻♂️A list of papers curated for you to dive into the Awesome Radiance Field-based 3D Editing.
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
Atom3d, atomising geometry, is a mesh processing toolbox specifically designed for 3D learning.
Sharp Monocular View Synthesis in Less Than a Second
Native and Compact Structured Latents for 3D Generation
[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
Official inference repo for FLUX.2 models
HunyuanVideo-1.5: A leading lightweight video generation model
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Krea Realtime 14B. An open-source realtime AI video model.
[ICLR 2026] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics (NeurIPS 2025)
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
【Accepted by TPAMI】Human Motion Video Generation: A Survey (https://ieeexplore.ieee.org/document/11106267)