Highlights
- Pro
Stars
Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"
(T-IV) Dream to Drive with Predictive Individual World Model
Code for Data Collection & Training in Sim+Real Envs: [RSS 2024] Natural Language Can Help Bridge the Sim2Real Gap
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
An open-source AI agent that brings the power of Grok directly into your terminal.
Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.
Code for our AAAI 26 paper: "Expressive Temporal Specifications for Reward Monitoring"
ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models
Official PyTorch implementation of 'Reductive Lie Neurons'
LLM Council works together to answer your hardest questions
Learning Safety Constraints for Large Language Models (ICML2025)
Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
PKBoost: Adaptive GBDT for Concept Drift, Built from scratch in Rust, PKBoost manages changing data distributions in fraud detection with a fraud rate of 0.2%. It shows less than 2% degradation und…
Implementation of the Online Adaptive CBF for safety-critical navigation for input constrained systems.
Introduction to Machine Learning Systems
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy
[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)