Skip to content
View GuHuangAI's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report GuHuangAI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and shaping the future!

248 5 Updated Dec 23, 2025

Official code of RDT 2

Python 606 30 Updated Dec 3, 2025

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.

Python 212 19 Updated Dec 23, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,435 3,350 Updated Dec 24, 2025

Native Multimodal Models are World Learners

Python 1,372 52 Updated Nov 28, 2025

Official PyTorch Implementation of "F2M-Reg: Unsupervised RGB-D registration with Frame-to-Model Optimization“

3 Updated Jul 8, 2025

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 541 45 Updated Dec 20, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,937 293 Updated Dec 22, 2025

Code for paper "CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation"

Python 63 Updated Nov 12, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,066 1,278 Updated Oct 11, 2025

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 393 38 Updated Oct 30, 2025

[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Jupyter Notebook 1,217 65 Updated Sep 24, 2025

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 422 41 Updated Oct 24, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,323 192 Updated Jun 5, 2025

code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"

Python 39 5 Updated Nov 30, 2025

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 691 95 Updated Oct 29, 2025

Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination

Python 35 2 Updated May 7, 2025
Python 1 Updated Apr 8, 2025

Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Jupyter Notebook 390 76 Updated Aug 20, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,057 522 Updated Jun 9, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,682 189 Updated Dec 16, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,936 140 Updated Dec 6, 2024

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Python 20,704 1,402 Updated Jun 23, 2025

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Python 1 Updated Aug 15, 2024

[NeurIPS 2023] Efficient Diffusion Policy

Python 113 8 Updated Oct 31, 2023

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,927 600 Updated Jul 17, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,279 1,565 Updated Sep 5, 2024

CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning

Python 23 4 Updated Mar 10, 2024

code for paper: MS2A: Memory Storage-to-Adaptation for Cross-domain Few-annotation Object Detection

Python 3 Updated Oct 3, 2024
Next