Starred repositories
Salesforce Enterprise Deep Research
AgentEvolver: Towards Efficient Self-Evolving Agent System
This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets
Learning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks
A live stream development of RL tunning for LLM agents
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
get things from one computer to another, safely
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Toolkit for evaluating the trustworthiness of generative foundation models.
Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
Demystifying Reinforcement Learning in Agentic Reasoning
Post-training with Tinker
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models