Ttopiac

Follow

Chi-Hui Lin Ttopiac

Follow

6 followers · 6 following

Achievements

Achievements

Stars

toon-format / toon

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 23,574 1,052 Updated Mar 10, 2026

rendercv / rendercv

Resume builder for academics and engineers

Python 16,152 1,158 Updated Mar 30, 2026

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,435 1,033 Updated Jul 8, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,467 767 Updated Aug 20, 2025

LantaoYu / MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

4,770 770 Updated Feb 11, 2026

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,309 971 Updated Feb 20, 2026

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,941 374 Updated Jul 18, 2024

quantumiracle / Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,336 146 Updated Mar 13, 2025

LucasAlegre / sumo-rl

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Python 1,016 255 Updated Mar 8, 2026

arasgungore / arasgungore-CV

My curriculum vitae (CV) written using LaTeX.

TeX 909 274 Updated Sep 11, 2024

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 779 142 Updated Jan 14, 2026

wohlert / semi-supervised-pytorch

Implementations of various VAE-based semi-supervised and generative models in PyTorch

Python 710 125 Updated Mar 2, 2020

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 702 190 Updated Sep 24, 2024

yangmingustb / planning_books_1

记录：规划，决策，机器学习，编程的书籍

518 198 Updated Nov 13, 2018

dpkingma / nips14-ssl

Code for reproducing results of NIPS 2014 paper "Semi-Supervised Learning with Deep Generative Models"

Python 517 147 Updated Jan 25, 2015

IvanIsCoding / ResuLLMe

Enhance your résumé with Large Language Models

Jinja 460 131 Updated Feb 16, 2026

semitable / robotic-warehouse

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

Python 418 94 Updated Sep 15, 2024

lilianweng / multi-armed-bandit

Play with the solutions to the multi-armed-bandit problem.

Python 417 98 Updated May 21, 2024

KavrakiLab / vamp

SIMD-Accelerated Sampling-based Motion Planning

C++ 372 64 Updated Mar 24, 2026

yanfengliu / python_mini_metro

Python implementation for Mini Metro. Can be used for reinforcement learning.

Python 35 9 Updated Feb 18, 2026

uber-research / D3G

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Python 32 4 Updated Feb 21, 2020

david-macleod / mini-metro

Mini Metro SerpentAI agent

Jupyter Notebook 30 2 Updated Nov 22, 2022

aretrosen / kyvernetes-resume

A batteries-included LaTex resume for students

TeX 22 1 Updated Jul 14, 2023

jiechuanjiang / I2Q

I2Q: A Fully Decentralized Q-Learning Algorithm

Python 19 2 Updated Nov 10, 2022

HIRO-group / marl-experiments

Repository for conducting RL experiments on multi-agent systems

Python 9 Updated Jul 28, 2024

naimazizi / google-map-scraper

Google Map Scraper using python, selenium and headless chromium.

Python 9 3 Updated Feb 16, 2020

EllenGYY / Mini-Metro-Programming-Game

JavaScript 8 2 Updated Dec 30, 2022

xrdesign / Metro

Unity project for Human AI Teaming clone of MiniMetro game in VR

C# 6 Updated Jul 15, 2025

HIRO-group / multiHRI

Forked from StephAO/HAHA

Python 4 1 Updated Jul 17, 2025

HIRO-group / ReSEED

Code for the paper: ReSeeding Latent States for Sequential Language Understanding

Python 3 Updated Sep 29, 2025