Skip to content
View hehao13's full-sized avatar

Highlights

  • Pro

Block or report hehao13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 345 73 Updated Feb 12, 2026
Python 249 7 Updated Apr 3, 2026
Python 201 28 Updated Mar 13, 2023

A simulation platform for versatile Embodied AI research and developments.

Python 1,235 76 Updated Sep 4, 2025

Claude Code CLI integration for Unreal Engine 5.7 - Get AI coding assistance with built-in UE5.7 documentation context directly in the editor.

C++ 489 76 Updated Apr 7, 2026

Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation

Python 260 15 Updated Dec 9, 2025

[CVPR 2025 Highlight] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 232 15 Updated Apr 8, 2025

VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.

Python 49 1 Updated Mar 16, 2026

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,848 145 Updated Jan 1, 2026

A Pragmatic VLA Foundation Model

Python 1,045 90 Updated Mar 12, 2026

Masked Depth Modeling for Spatial Perception

Python 1,034 79 Updated Apr 14, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,040 678 Updated Apr 10, 2026

[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 84 2 Updated Apr 14, 2026

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python 659 73 Updated Feb 24, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 779 49 Updated Apr 8, 2026

[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 310 9 Updated Mar 24, 2026

Python package for the evaluation of odometry and SLAM

Python 4,184 789 Updated Apr 8, 2026

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 498 50 Updated Aug 12, 2024

SAM 3D Objects

Python 6,439 765 Updated Mar 12, 2026

Depth Anything 3

Python 4,982 520 Updated Mar 21, 2026

MAGI-1: Autoregressive Video Generation at Scale

Python 3,676 236 Updated Jun 17, 2025

Visual Spatial Tuning

Jupyter Notebook 195 8 Updated Mar 25, 2026

[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 369 18 Updated Oct 31, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,157 106 Updated Feb 26, 2026

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 652 51 Updated Feb 26, 2026

Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) by learning transferable latent camera pose representations.

Python 145 2 Updated Mar 25, 2026

Native Multimodal Models are World Learners

Python 1,496 61 Updated Dec 30, 2025

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Python 1,565 47 Updated Nov 22, 2023

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,299 70 Updated Mar 5, 2025

[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM

Python 326 15 Updated Mar 2, 2026
Next