RL
Model Predictive Control for Quadrotors with extension to Perception-Aware MPC
HoloMotion: A Foundation Model for Whole-Body Humanoid Control
This is the official implementation of "OpenREAD:Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic"
End-to-end optimal quadcopter control through Reinforcement Learning
MAVRL: Learn to Fly in Cluttered Environmants with Varying Speed
The neuromorphic PID for altitude control of MAVs with Loihi
Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
A general-purpose machine learning architecture designed for robot training
A high-performance physics simulation engine designed for multibody dynamics and robotics simulation.
Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..
A standalone library to randomize various OpenAI Gym Environments
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
[CoRL 2024] Official repository for: "CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning"
Reinforcement learning based navigation for the Unified Autonomy Stack
End-to-end optimal quadcopter control through Supervised Learning
The code for the paper "Towards Monocular Vision based Obstacle Avoidance through Deep Reinforcement Learning"
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
[RA-L 26] Learning on the Fly: Rapid Policy Adaptation via Differentiable Simulation
[ArXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
Official implementation for DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)