Stars
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the generated sequences.
[CVPR 2025] MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
File format for 3D Gaussian splats. About 10x smaller than the PLY equivalent with virtually no perceptible loss in visual quality. Offered as open source by Niantic Labs. More details at https://s…
Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"
Pytorch implementation of Pano2Room (SIGGRAPH Asia 2024)
[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
[CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
Official implementation of "Depth Anything in 360°: Towards Scale Invariance in the Wild".
[CVPR 2026 Highlight] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!
Sharp Monocular View Synthesis in Less Than a Second
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
"MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"
Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training
Industry-standard navigation-mesh toolset for games
HunyuanVideo-1.5: A leading lightweight video generation model
Official implementation of "Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation" (ICCV 2025)
[ICLR 2026] Generative View Stitching
Fast and Universal 3D reconstruction model for versatile tasks
[ICLR 2026] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
Code for "FlashWorld: High-quality 3D Scene Generation within Seconds" (ICLR 2026 Oral)