GuHuangAI

🎯

Focusing

GuHuangAI

🎯

Focusing

PhD in NUDT now. Major: Generative models and 3D Computer Vision; World models and Embodied AI

44 followers · 12 following

Achievements

Stars

BestJunYu / Awesome-Physics-aware-Generation

Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and shaping the future!

248 5 Updated Dec 23, 2025

thu-ml / RDT2

Official code of RDT 2

Python 606 30 Updated Dec 3, 2025

ReinFlow / ReinFlow

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.

Python 212 19 Updated Dec 23, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,435 3,350 Updated Dec 24, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,372 52 Updated Nov 28, 2025

MrIsland / F2M_Reg

Official PyTorch Implementation of "F2M-Reg: Unsupervised RGB-D registration with Frame-to-Model Optimization“

3 Updated Jul 8, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 541 45 Updated Dec 20, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,937 293 Updated Dec 22, 2025

ZouShilong1024 / CycleDiff

Code for paper "CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation"

Python 63 Updated Nov 12, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,066 1,278 Updated Oct 11, 2025

microsoft / CogACT

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 393 38 Updated Oct 30, 2025

nv-tlabs / GEN3C

[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Jupyter Notebook 1,217 65 Updated Sep 24, 2025

AgibotTech / Genie-Envisioner

Python 343 18 Updated Dec 24, 2025

NVIDIA / GR00T-Dreams

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 422 41 Updated Oct 24, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,323 192 Updated Jun 5, 2025

GuHuangAI / LaDiWM

code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"

Python 39 5 Updated Nov 30, 2025

nvidia-cosmos / cosmos-predict2

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 691 95 Updated Oct 29, 2025

leobarcellona / drema_code

Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination

Python 35 2 Updated May 7, 2025

GuHuangAI / DIffPart

Python 1 Updated Apr 8, 2025

nvidia-cosmos / cosmos-predict1

Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Jupyter Notebook 390 76 Updated Aug 20, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,057 522 Updated Jun 9, 2025

OpenDriveLab / AgiBot-World

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,682 189 Updated Dec 16, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,936 140 Updated Dec 6, 2024

LiLittleCat / awesome-free-chatgpt

🆓免费的 ChatGPT 镜像网站列表，持续更新。List of free ChatGPT mirror sites, continuously updated.

Python 20,704 1,402 Updated Jun 23, 2025

aod321 / ManiSkill

Forked from haosulab/ManiSkill

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Python 1 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GuHuangAI

Achievements

Achievements

Block or report GuHuangAI

Stars

BestJunYu / Awesome-Physics-aware-Generation

thu-ml / RDT2

ReinFlow / ReinFlow

huggingface / lerobot

baaivision / Emu3.5

MrIsland / F2M_Reg

nvidia-cosmos / cosmos-predict2.5

thu-ml / SageAttention

ZouShilong1024 / CycleDiff

facebookresearch / vggt

microsoft / CogACT

nv-tlabs / GEN3C

AgibotTech / Genie-Envisioner

NVIDIA / GR00T-Dreams

test-time-training / ttt-video-dit

GuHuangAI / LaDiWM

nvidia-cosmos / cosmos-predict2

leobarcellona / drema_code

GuHuangAI / DIffPart

nvidia-cosmos / cosmos-predict1

NVIDIA / Cosmos

OpenDriveLab / AgiBot-World

eloialonso / diamond

LiLittleCat / awesome-free-chatgpt

aod321 / ManiSkill

sail-sg / edp

LiheYoung / Depth-Anything

IDEA-Research / Grounded-Segment-Anything

maple-research-lab / CaCo

GuHuangAI / MS2A