Skip to content
View Shuaizhang7's full-sized avatar

Block or report Shuaizhang7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[SIGGRAPH2026] Official code for SIGGRAPH2026 paper: R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow

Python 18 3 Updated May 14, 2026

[NeurIPS 2025] ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS

Python 172 2 Updated Mar 11, 2026

[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"

Python 441 11 Updated Sep 19, 2025

[ICLR 2026] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Python 117 9 Updated Mar 11, 2026

Syn4D: A Multiview Synthetic 4D Dataset

72 1 Updated May 7, 2026

Official Implementation of CoInteract: Spatially-Structured Co-Generation for Interactive Human-Object Video Synthesis

Python 151 9 Updated May 7, 2026

[ICML 2026] World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Python 353 13 Updated May 1, 2026

[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 954 50 Updated Feb 25, 2026

[ECCV 2022] An End-to-End Transformer Model for Crowd Localization

Python 114 12 Updated Mar 20, 2023

Official code, models, and data for Vista4D: Video Reshooting with 4D Point Clouds (CVPR 2026 Highlight)

Python 476 36 Updated May 6, 2026

[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Python 189 8 Updated Mar 6, 2026

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 2,466 181 Updated Nov 2, 2025

[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,922 148 Updated May 9, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 530 18 Updated May 15, 2026

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Python 335 17 Updated May 13, 2026

UnrealCV: Connecting Computer Vision to Unreal Engine

Python 2,178 461 Updated Apr 15, 2026

[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI

Python 319 23 Updated Apr 15, 2026

Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]

Python 227 12 Updated Mar 12, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 712 65 Updated Apr 3, 2026

AI agents running research on single-GPU nanochat training automatically

Python 81,242 11,820 Updated Mar 26, 2026

[CVPR '26] SceneTok: A Compressed, Diffusable Token Space for 3D Scenes

Python 177 7 Updated Apr 20, 2026

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Python 168 13 Updated Apr 14, 2026

Process ScanNet with Python 3

Python 10 Updated Sep 14, 2025

TartanAir dataset tools and samples

Jupyter Notebook 411 44 Updated Apr 30, 2026

A Python package for the TartanAir-V2 dataset.

Python 110 19 Updated Apr 16, 2026

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,998 149 Updated Jan 9, 2026

[ICCV 25] Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation

C++ 39 1 Updated Dec 16, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,384 113 Updated Mar 11, 2025

open-sourced video dataset with dynamic scenes and camera movements annotation

Python 92 1 Updated Apr 24, 2025

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Python 46 9 Updated May 29, 2025
Next