Skip to content
View thunderlrr's full-sized avatar

Highlights

  • Pro

Block or report thunderlrr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official paper for EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.

Python 65 1 Updated Jun 5, 2026

A minimalist MVP demonstrating a simple yet profound insight: aligning AI memory with human episodic memory granularity. Shows how this single principle enables simple methods to rival complex memo…

Python 203 18 Updated Apr 16, 2026

[ICML'26] Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Python 16 4 Updated Jun 10, 2026

HKUST(GZ) MPhil Thesis LaTeX Template. Based on @luckyfan-cs's project, reviewed and updated to the latest 2026 version.

TeX 5 Updated Apr 6, 2026

A Collection of Papers about Memory for Language Agents

566 44 Updated Jun 12, 2026

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

2,231 170 Updated May 16, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,992 351 Updated Jun 13, 2026

A agent framework based on the tutorial hello-agents

Python 2,008 497 Updated Jun 8, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,774 526 Updated Jun 13, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 58,816 7,214 Updated Jun 11, 2026

A simple Python code to extract road network (in Shapefile) from OpenStreetMap (OSM)

Python 35 16 Updated Jul 3, 2020

A Trajectory Preprocessing Toolkit in Python

Python 91 27 Updated Sep 3, 2025

The code of paper Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model. Zhihai Wang, Xijun Li, Jie Wang*, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhan…

Python 65 9 Updated May 12, 2023

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Python 205 54 Updated Jul 29, 2020

Deep Reinforcement Learning for UAV Routing in The Presence of Multiple Charging Stations

Python 34 5 Updated May 5, 2026

Code for tasks on Cainiao-LaDe (Last-mile Delivery dataset).

Python 97 29 Updated Dec 23, 2025

An elegant PyTorch deep reinforcement learning library.

Python 10,795 1,319 Updated Apr 3, 2026
Jupyter Notebook 4 Updated Aug 17, 2024

The companion code for KDD'23 FairCod

Python 4 1 Updated Jun 20, 2023

[KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning

Python 19 4 Updated May 18, 2022

A curated list of reinforcement learning with human feedback resources (continually updated)

4,386 255 Updated May 20, 2026

Paper List of Inference/Test Time Scaling/Computing

Python 388 19 Updated May 31, 2026
Next