zhonghai1995

Follow

hai zhonghai1995

Follow

16 followers · 208 following

Highlights

Pro

Stars

dongzhuoyao / awesome-flow-matching

A summary of related works about flow matching, stochastic interpolants

616 18 Updated Mar 25, 2025

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

920 22 Updated Dec 17, 2025

metadriverse / PPL

Codebase of Predictive Preference Learning from Human Interventions

Python 3 Updated Dec 4, 2025

bennidict23 / GoRL

An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

Python 21 Updated Dec 3, 2025

qw3rtman / robust-world-model-planning

Code for "Closing the Train-Test Gap in World Models for Gradient-Based Planning"

Python 71 3 Updated Dec 13, 2025

google-deepmind / mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 11,451 1,245 Updated Dec 20, 2025

IliaLarchenko / behavior-1k-solution

1st place solution of 2025 BEHAVIOR Challenge

Python 132 10 Updated Dec 14, 2025

WujiangXu / EPO

The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"

Python 36 1 Updated Oct 1, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,931 140 Updated Dec 6, 2024

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,504 61 Updated Dec 18, 2025

Stable-X / StableNormal

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 733 35 Updated Aug 2, 2025

naumix / BiggerRegularizedCategorical

Python 9 Updated Nov 18, 2025

leggedrobotics / robotic_world_model

Repository for our paper: Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics

Python 374 17 Updated Dec 12, 2025

ESHyperscale / HyperscaleES

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 196 15 Updated Nov 20, 2025

THUDM / TDRM

Python 9 1 Updated Sep 25, 2025

Tsedao / Safe-FinRL

Code for paper 'Safe-FinRL: A Low Bias and Variance Deep Reinforcment Learning Implementation For High-Freq Stock Trading'

Python 6 Updated May 20, 2022

enactic / openarm

A fully open-source humanoid arm for physical AI research and deployment in contact-rich environments.

MDX 1,590 176 Updated Dec 19, 2025

helblazer811 / Diffusion-Explorer

Interactive visualizations of the geometric intuition behind diffusion models.

Svelte 918 44 Updated Dec 20, 2025

xbpeng / MimicKit

A lightweight suite of motion imitation methods for training controllers.

Python 1,199 128 Updated Dec 17, 2025

Physical-Intelligence / real-time-chunking-kinetix

Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".

Python 340 17 Updated Dec 8, 2025

PingchuanMa / NCLaw

[ICML 2023] Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics

Python 142 16 Updated Jun 28, 2023

eth-lre / PedagogicalRL

Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral

Python 27 7 Updated Dec 11, 2025

thu-ml / RoboticsDiffusionTransformer

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,561 146 Updated Sep 28, 2025

Psi-Robot / DexGraspVLA

[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 445 33 Updated Aug 10, 2025

vineetjain96 / Diffusion-Tree-Sampling

Code for the paper "Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models"

Python 8 Updated Dec 5, 2025

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,065 680 Updated Dec 3, 2025

aoberai / trl

Code for "Transitive RL: Value Learning via Divide and Conquer"

Python 44 3 Updated Oct 31, 2025

PneuC / DrAC

Official repository for NeurIPS 2025 publication "Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization"

Python 4 Updated Dec 6, 2025

CMU-AIRe / floq

Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL

Python 21 3 Updated Oct 23, 2025

sizhe-li / neural-jacobian-field

Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!

Jupyter Notebook 194 28 Updated Dec 9, 2025