Stars
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
Code for the paper: ReSeeding Latent States for Sequential Language Understanding
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
quantumiracle / marl_tf
Forked from hardmaru/slimevolleygymA simple OpenAI Gym environment for single and multi-agent reinforcement learning
Unity project for Human AI Teaming clone of MiniMetro game in VR
I2Q: A Fully Decentralized Q-Learning Algorithm
Python implementation for Mini Metro. Can be used for reinforcement learning.
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Massively Parallel Deep Reinforcement Learning. 🔥
An extension of the PyMARL codebase that includes additional algorithms and environment support
This is the official implementation of Multi-Agent PPO (MAPPO).
A tool to find the optimal layout of lines in the game Mini Metro.
Repository for conducting RL experiments on multi-agent systems
Typst-based CV/resume generator for academics and engineers
My curriculum vitae (CV) written using LaTeX.
A batteries-included LaTex resume for students
Enhance your résumé with Large Language Models
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Play with the solutions to the multi-armed-bandit problem.
Paper list of multi-agent reinforcement learning (MARL)
Google Map Scraper using python, selenium and headless chromium.