Skip to content
View Ttopiac's full-sized avatar

Block or report Ttopiac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 23,574 1,052 Updated Mar 10, 2026

Resume builder for academics and engineers

Python 16,152 1,158 Updated Mar 30, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,435 1,033 Updated Jul 8, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,467 767 Updated Aug 20, 2025

Paper list of multi-agent reinforcement learning (MARL)

4,770 770 Updated Feb 11, 2026

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,309 971 Updated Feb 20, 2026

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,941 374 Updated Jul 18, 2024

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,336 146 Updated Mar 13, 2025

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 1,016 255 Updated Mar 8, 2026

My curriculum vitae (CV) written using LaTeX.

TeX 909 274 Updated Sep 11, 2024

Multi-Agent Reinforcement Learning with JAX

Python 779 142 Updated Jan 14, 2026

Implementations of various VAE-based semi-supervised and generative models in PyTorch

Python 710 125 Updated Mar 2, 2020

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 702 190 Updated Sep 24, 2024

记录:规划,决策,机器学习,编程的书籍

518 198 Updated Nov 13, 2018

Code for reproducing results of NIPS 2014 paper "Semi-Supervised Learning with Deep Generative Models"

Python 517 147 Updated Jan 25, 2015

Enhance your résumé with Large Language Models

Jinja 460 131 Updated Feb 16, 2026

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

Python 418 94 Updated Sep 15, 2024

Play with the solutions to the multi-armed-bandit problem.

Python 417 98 Updated May 21, 2024

SIMD-Accelerated Sampling-based Motion Planning

C++ 372 64 Updated Mar 24, 2026

Python implementation for Mini Metro. Can be used for reinforcement learning.

Python 35 9 Updated Feb 18, 2026

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Python 32 4 Updated Feb 21, 2020

Mini Metro SerpentAI agent

Jupyter Notebook 30 2 Updated Nov 22, 2022

A batteries-included LaTex resume for students

TeX 22 1 Updated Jul 14, 2023

I2Q: A Fully Decentralized Q-Learning Algorithm

Python 19 2 Updated Nov 10, 2022

Repository for conducting RL experiments on multi-agent systems

Python 9 Updated Jul 28, 2024

Google Map Scraper using python, selenium and headless chromium.

Python 9 3 Updated Feb 16, 2020

Unity project for Human AI Teaming clone of MiniMetro game in VR

C# 6 Updated Jul 15, 2025
Python 4 1 Updated Jul 17, 2025

Code for the paper: ReSeeding Latent States for Sequential Language Understanding

Python 3 Updated Sep 29, 2025
Next