hehao13

hehao hehao13

Ph.D. student at MMLab, CUHK

63 followers · 45 following

CUHK
Hong Kong
https://hehao13.github.io
https://scholar.google.com/citations?user=kdbmt6QAAAAJ&hl=en

Achievements

Highlights

Stars

nvidia-cosmos / cosmos-reason2

Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 345 73 Updated Feb 12, 2026

amap-cvlab / ABot-PhysWorld

Python 249 7 Updated Apr 3, 2026

fabbrimatteo / JTA-Dataset

Python 201 28 Updated Mar 13, 2023

InternRobotics / InternUtopia

A simulation platform for versatile Embodied AI research and developments.

Python 1,235 76 Updated Sep 4, 2025

Natfii / UnrealClaude

Claude Code CLI integration for Unreal Engine 5.7 - Get AI coding assistance with built-in UE5.7 documentation context directly in the editor.

C++ 489 76 Updated Apr 7, 2026

haoyi-duan / WorldScore

Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation

Python 260 15 Updated Dec 9, 2025

MarkYu98 / madpose

[CVPR 2025 Highlight] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 232 15 Updated Apr 8, 2025

Hongyang-Du / VideoGPA

VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.

Python 49 1 Updated Mar 16, 2026

nv-tlabs / vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,848 145 Updated Jan 1, 2026

Robbyant / lingbot-vla

A Pragmatic VLA Foundation Model

Python 1,045 90 Updated Mar 12, 2026

Robbyant / lingbot-depth

Masked Depth Modeling for Spatial Perception

Python 1,034 79 Updated Apr 14, 2026

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,040 678 Updated Apr 10, 2026

InternRobotics / MMSI-Bench

[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 84 2 Updated Apr 14, 2026

bertjiazheng / Structured3D

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python 659 73 Updated Feb 24, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 779 49 Updated Apr 8, 2026

InternRobotics / G2VLM

[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 310 9 Updated Mar 24, 2026

MichaelGrupp / evo

Python package for the evaluation of odometry and SLAM

Python 4,184 789 Updated Apr 8, 2026

VQAssessment / DOVER

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 498 50 Updated Aug 12, 2024

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 6,439 765 Updated Mar 12, 2026

ByteDance-Seed / Depth-Anything-3

Depth Anything 3

Python 4,982 520 Updated Mar 21, 2026

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,676 236 Updated Jun 17, 2025

Yangr116 / VST

Visual Spatial Tuning

Jupyter Notebook 195 8 Updated Mar 25, 2026

TencentARC / RollingForcing

[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 369 18 Updated Oct 31, 2025

NVlabs / LongLive

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,157 106 Updated Feb 26, 2026

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 652 51 Updated Feb 26, 2026

vsitzmann / xfactor-nvs

Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) by learning transferable latent camera pose representations.

Python 145 2 Updated Mar 25, 2026

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,496 61 Updated Dec 30, 2025

thu-ml / prolificdreamer

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Python 1,565 47 Updated Nov 22, 2023

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,299 70 Updated Mar 5, 2025

facebookresearch / DepthLM_Official

[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM

Python 326 15 Updated Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hehao hehao13

Achievements

Achievements

Highlights

Block or report hehao13

Stars

nvidia-cosmos / cosmos-reason2

amap-cvlab / ABot-PhysWorld

fabbrimatteo / JTA-Dataset

InternRobotics / InternUtopia

Natfii / UnrealClaude

haoyi-duan / WorldScore

MarkYu98 / madpose

Hongyang-Du / VideoGPA

nv-tlabs / vipe

Robbyant / lingbot-vla

Robbyant / lingbot-depth

open-compass / VLMEvalKit

InternRobotics / MMSI-Bench

bertjiazheng / Structured3D

SandAI-org / MagiAttention

InternRobotics / G2VLM

MichaelGrupp / evo

VQAssessment / DOVER

facebookresearch / sam-3d-objects

ByteDance-Seed / Depth-Anything-3

SandAI-org / MAGI-1

Yangr116 / VST

TencentARC / RollingForcing

NVlabs / LongLive

NVlabs / OmniVinci

vsitzmann / xfactor-nvs

baaivision / Emu3.5

thu-ml / prolificdreamer

tianweiy / DMD2

facebookresearch / DepthLM_Official