Skip to content
View zhonghai1995's full-sized avatar

Highlights

  • Pro

Block or report zhonghai1995

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A summary of related works about flow matching, stochastic interpolants

616 18 Updated Mar 25, 2025

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

920 22 Updated Dec 17, 2025

Codebase of Predictive Preference Learning from Human Interventions

Python 3 Updated Dec 4, 2025

An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

Python 21 Updated Dec 3, 2025

Code for "Closing the Train-Test Gap in World Models for Gradient-Based Planning"

Python 71 3 Updated Dec 13, 2025

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 11,451 1,245 Updated Dec 20, 2025

1st place solution of 2025 BEHAVIOR Challenge

Python 132 10 Updated Dec 14, 2025

The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"

Python 36 1 Updated Oct 1, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,931 140 Updated Dec 6, 2024

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,504 61 Updated Dec 18, 2025

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 733 35 Updated Aug 2, 2025

Repository for our paper: Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics

Python 374 17 Updated Dec 12, 2025

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 196 15 Updated Nov 20, 2025
Python 9 1 Updated Sep 25, 2025

Code for paper 'Safe-FinRL: A Low Bias and Variance Deep Reinforcment Learning Implementation For High-Freq Stock Trading'

Python 6 Updated May 20, 2022

A fully open-source humanoid arm for physical AI research and deployment in contact-rich environments.

MDX 1,590 176 Updated Dec 19, 2025

Interactive visualizations of the geometric intuition behind diffusion models.

Svelte 918 44 Updated Dec 20, 2025

A lightweight suite of motion imitation methods for training controllers.

Python 1,199 128 Updated Dec 17, 2025

Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".

Python 340 17 Updated Dec 8, 2025

[ICML 2023] Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics

Python 142 16 Updated Jun 28, 2023

Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral

Python 27 7 Updated Dec 11, 2025

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,561 146 Updated Sep 28, 2025

[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 445 33 Updated Aug 10, 2025

Code for the paper "Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models"

Python 8 Updated Dec 5, 2025

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,065 680 Updated Dec 3, 2025

Code for "Transitive RL: Value Learning via Divide and Conquer"

Python 44 3 Updated Oct 31, 2025

Official repository for NeurIPS 2025 publication "Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization"

Python 4 Updated Dec 6, 2025

Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL

Python 21 3 Updated Oct 23, 2025

Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!

Jupyter Notebook 194 28 Updated Dec 9, 2025
Next