Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,731 298 Updated Sep 8, 2022

oxwhirl / pymarl

Python Multi-Agent Reinforcement Learning framework

Python 2,168 411 Updated Dec 8, 2022

1995chen / dnf

Shell 1,934 537 Updated Jan 19, 2026

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,550 188 Updated Mar 22, 2026

toshikwa / fqf-iqn-qrdqn.pytorch

PyTorch implementation of FQF, IQN and QR-DQN.

Python 189 32 Updated Jul 25, 2024

alsyundawy / Microsoft-Office-For-MacOS

Installer Microsoft Office For MacOS

6,179 830 Updated Mar 23, 2026

Kaixhin / Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python 1,663 293 Updated Jan 13, 2022

danijar / dreamerv3

Mastering Diverse Domains through World Models

Python 2,970 490 Updated Sep 23, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,947 748 Updated Feb 4, 2026

friendmine / llm-course-chn

chinese translation of llm-course

Jupyter Notebook 320 35 Updated Apr 3, 2024

JayceNing / AI_study_notes

我的AI学习笔记。包括b站up主deep_thoughts的PyTorch课程笔记和相关代码；北邮深度学习与数字视频PPT代码。

Jupyter Notebook 43 8 Updated Jun 18, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 42,897 5,157 Updated Mar 24, 2026

HJYao00 / Mulberry

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,242 113 Updated Jan 16, 2026

RLHFlow / Self-rewarding-reasoning-LLM

Recipes to train the self-rewarding reasoning LLMs.

Python 231 14 Updated Mar 2, 2025

JSBSim-Team / jsbsim

An open source flight dynamics & control software library

C++ 1,954 553 Updated Mar 19, 2026

liuqh16 / LAG

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 459 139 Updated May 19, 2025

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 11,669 2,443 Updated Aug 5, 2024

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,545 212 Updated Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZemingGao goodgzm

Block or report goodgzm

Stars

test-time-training / discover

schroederdewitt / multiagent_mujoco

karpathy / nanoGPT

hylarucoder / ai-flavor-remover

openclaw / openclaw

TTomilin / MEAL

hiyouga / LlamaFactory

EleutherAI / lm-evaluation-harness

llm-brain-rot / llm-brain-rot

hijkzzz / pymarl2

shibhansh / loss-of-plasticity

awjuliani / deep-rl-plasticity

starry-sky6688 / MARL-Algorithms