Skip to content
View LLT1's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Institute of Automation,Chinese Academy of Sciences
  • Beijing

Block or report LLT1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 542 78 Updated Jan 20, 2026

Open-source implementation of AlphaEvolve

Python 6,540 1,045 Updated Mar 18, 2026

AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead of always thinking or never thinking, the model learns when …

Python 52 4 Updated Oct 14, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 452 63 Updated Mar 20, 2026

Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.

Python 137 10 Updated Jun 14, 2025

An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks

Python 55 8 Updated Jun 12, 2026

[ICML 2025] Official Code of SMPE: "Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration"

Python 35 3 Updated Feb 9, 2026

Drone Automation using Multi-Agent Reinforcement Learning

Jupyter Notebook 2 Updated Aug 27, 2023

Training of Drone Swarms using StableBaselines3, PettingZoo, AirSim and UE4

Python 89 12 Updated Jun 29, 2025
Python 1 1 Updated Aug 19, 2025

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 14,337 3,841 Updated Feb 18, 2025

PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Du…

Jupyter Notebook 34 11 Updated Oct 10, 2020

Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.

Python 133 27 Updated May 5, 2019

PRML Page-by-page配套资料,对PRML全书及各章节的review

17 3 Updated Apr 16, 2024

A list of papers regarding generalization in (deep) reinforcement learning

155 19 Updated Aug 12, 2023

Lightweight multi-agent gridworld Gym environment

Python 212 42 Updated Sep 21, 2023

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

Python 494 71 Updated May 9, 2024

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 22,035 6,135 Updated Jul 13, 2023

Unofficial Gato: A Generalist Agent

Python 220 33 Updated Jan 14, 2024

Minimal code for A Generalist Agent

Python 44 6 Updated Nov 4, 2022

A cute running cat animation on your windows taskbar.

C# 10,143 835 Updated Jun 12, 2026

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 354 42 Updated Mar 25, 2023

some common TD Learning algorithms

Python 66 30 Updated Mar 6, 2020

A visualization of proto-value functions

Python 1 Updated Nov 22, 2016
Python 43 14 Updated Feb 9, 2017
Python 1 Updated Dec 28, 2020

Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"

Python 46 8 Updated Sep 20, 2023
Next