Skip to content
View Ttopiac's full-sized avatar

Block or report Ttopiac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 Updated Dec 8, 2025

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 23,846 1,067 Updated Mar 31, 2026

Code for the paper: ReSeeding Latent States for Sequential Language Understanding

Python 3 Updated Sep 29, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,503 775 Updated Aug 20, 2025

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Python 3 1 Updated Feb 14, 2021

Unity project for Human AI Teaming clone of MiniMetro game in VR

C# 6 Updated Jul 15, 2025

I2Q: A Fully Decentralized Q-Learning Algorithm

Python 19 2 Updated Nov 10, 2022
Python 4 2 Updated Jul 17, 2025

Python implementation for Mini Metro. Can be used for reinforcement learning.

Python 35 9 Updated Feb 18, 2026

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,339 148 Updated Mar 13, 2025

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,312 970 Updated Feb 20, 2026

Multi-Agent Reinforcement Learning with JAX

Python 784 145 Updated Jan 14, 2026

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 707 190 Updated Sep 24, 2024

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,961 373 Updated Jul 18, 2024

A tool to find the optimal layout of lines in the game Mini Metro.

Java 3 1 Updated Jun 27, 2022

SIMD-Accelerated Sampling-based Motion Planning

C++ 406 73 Updated Apr 10, 2026

Repository for conducting RL experiments on multi-agent systems

Python 9 Updated Jul 28, 2024

Resume builder for academics and engineers

Python 16,310 1,184 Updated Apr 6, 2026

My curriculum vitae (CV) written using LaTeX.

TeX 916 273 Updated Sep 11, 2024

A batteries-included LaTex resume for students

TeX 22 1 Updated Jul 14, 2023

Enhance your résumé with Large Language Models

Jinja 461 132 Updated Feb 16, 2026

Mini Metro SerpentAI agent

Jupyter Notebook 30 2 Updated Nov 22, 2022

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

Python 425 95 Updated Sep 15, 2024

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Python 32 4 Updated Feb 21, 2020

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 1,025 256 Updated Mar 8, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,568 1,049 Updated Jul 8, 2025

Play with the solutions to the multi-armed-bandit problem.

Python 417 98 Updated May 21, 2024

Paper list of multi-agent reinforcement learning (MARL)

4,790 775 Updated Feb 11, 2026

Google Map Scraper using python, selenium and headless chromium.

Python 9 3 Updated Feb 16, 2020
Next