-
The Hong Kong University of Science and Technology
- Hong Kong SAR, China
- https://hkust.edu.hk/
- https://thunderlrr.github.io/songxinlei.github.io/
- https://scholar.google.com/citations?hl=zh-CN&user=-9qhgDgAAAAJ
Highlights
- Pro
Stars
The official paper for EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.
A minimalist MVP demonstrating a simple yet profound insight: aligning AI memory with human episodic memory granularity. Shows how this single principle enables simple methods to rival complex memo…
[ICML'26] Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
HKUST(GZ) MPhil Thesis LaTeX Template. Based on @luckyfan-cs's project, reviewed and updated to the latest 2026 version.
A Collection of Papers about Memory for Language Agents
[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
SkyRL: A Modular Full-stack RL Library for LLMs
A agent framework based on the tutorial hello-agents
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
A simple Python code to extract road network (in Shapefile) from OpenStreetMap (OSM)
The code of paper Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model. Zhihai Wang, Xijun Li, Jie Wang*, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhan…
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Deep Reinforcement Learning for UAV Routing in The Presence of Multiple Charging Stations
Code for tasks on Cainiao-LaDe (Last-mile Delivery dataset).
An elegant PyTorch deep reinforcement learning library.
[KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning
A curated list of reinforcement learning with human feedback resources (continually updated)
Paper List of Inference/Test Time Scaling/Computing