Stars
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
Resume builder for academics and engineers
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
Paper list of multi-agent reinforcement learning (MARL)
Massively Parallel Deep Reinforcement Learning. 🔥
This is the official implementation of Multi-Agent PPO (MAPPO).
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
My curriculum vitae (CV) written using LaTeX.
Implementations of various VAE-based semi-supervised and generative models in PyTorch
An extension of the PyMARL codebase that includes additional algorithms and environment support
Code for reproducing results of NIPS 2014 paper "Semi-Supervised Learning with Deep Generative Models"
Enhance your résumé with Large Language Models
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
Play with the solutions to the multi-armed-bandit problem.
Python implementation for Mini Metro. Can be used for reinforcement learning.
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
A batteries-included LaTex resume for students
I2Q: A Fully Decentralized Q-Learning Algorithm
Repository for conducting RL experiments on multi-agent systems
Google Map Scraper using python, selenium and headless chromium.
Unity project for Human AI Teaming clone of MiniMetro game in VR
Code for the paper: ReSeeding Latent States for Sequential Language Understanding