AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead of always thinking or never thinking, the model learns when …

Python 52 4 Updated Oct 14, 2025

Simplified-Reasoning / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 452 63 Updated Mar 20, 2026

PhialsBasement / AlphaEvolve-MatrixMul-Verification

Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.

Python 137 10 Updated Jun 14, 2025

AILabDsUnipi / pymarlzooplus

An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks

Python 55 8 Updated Jun 12, 2026

andreaskontogiannis / smpe

[ICML 2025] Official Code of SMPE: "Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration"

Python 35 3 Updated Feb 9, 2026

lpanjwani / MARL-drones

Drone Automation using Multi-Agent Reinforcement Learning

Jupyter Notebook 2 Updated Aug 27, 2023

Lauqz / Drone-Swarm-RL-airsim-sb3

Training of Drone Swarms using StableBaselines3, PettingZoo, AirSim and UE4

Python 89 12 Updated Jun 29, 2025

adinlab / PAC-Bayes-ActorCritic

Python 1 1 Updated Aug 19, 2025

jindongwang / transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 14,337 3,841 Updated Feb 18, 2025

BY571 / FQF-and-Extensions

PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Du…

Jupyter Notebook 34 11 Updated Oct 10, 2020

Silvicek / distributional-dqn

Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.

Python 133 27 Updated May 5, 2019

Bluedotdot2021 / PRML-book_review

PRML Page-by-page配套资料，对PRML全书及各章节的review

17 3 Updated Apr 16, 2024

kaixin96 / rl-generalization-paper

A list of papers regarding generalization in (deep) reinforcement learning

155 19 Updated Aug 12, 2023

ArnaudFickinger / gym-multigrid

Lightweight multi-agent gridworld Gym environment

Python 212 42 Updated Sep 21, 2023

peract / peract

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

Python 494 71 Updated May 9, 2024

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 22,035 6,135 Updated Jul 13, 2023

google-research / robotics_transformer

Python 1,729 198 Updated Jan 31, 2024

OrigamiDream / gato

Unofficial Gato: A Generalist Agent

Python 220 33 Updated Jan 14, 2024

YushuoLi / Gato-A-Generalist-Agent

Minimal code for A Generalist Agent

Python 44 6 Updated Nov 4, 2022

runcat-dev / RunCat365

A cute running cat animation on your windows taskbar.

C# 10,143 835 Updated Jun 12, 2026

RITCHIEHuang / DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 354 42 Updated Mar 25, 2023

chrodan / tdlearn

some common TD Learning algorithms

Python 66 30 Updated Mar 6, 2020

mcmachado / protovalue

Forked from roshanshariff/protovalue

A visualization of proto-value functions

Python 1 Updated Nov 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLT1

Achievements

Achievements

Block or report LLT1

Stars

SHAILAB-IPEC / OpenFly-Platform

btx0424 / OmniDrones

algorithmicsuperintelligence / openevolve

ScienceOne-AI / AutoThink