-
-
AgentEvolver Public
Forked from modelscope/AgentEvolverAgentEvolver: Towards Efficient Self-Evolving Agent System
Python Apache License 2.0 UpdatedNov 21, 2025 -
enterprise-deep-research Public
Forked from SalesforceAIResearch/enterprise-deep-researchSalesforce Enterprise Deep Research
-
-
DreamGym Public
Forked from Pi3AI/DreamGymThis is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
Python UpdatedNov 9, 2025 -
DeepAgent Public
Forked from RUC-NLPIR/DeepAgent🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets
Python MIT License UpdatedNov 2, 2025 -
torchforge Public
Forked from meta-pytorch/torchforgePyTorch-native post-training at scale
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 28, 2025 -
magic-wormhole Public
Forked from magic-wormhole/magic-wormholeget things from one computer to another, safely
Python MIT License UpdatedOct 23, 2025 -
-
verl-agent Public
Forked from langfengQ/verl-agentverl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Python Apache License 2.0 UpdatedOct 20, 2025 -
MUSE Public
Forked from KnowledgeXLab/MUSELearning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks
Python MIT License UpdatedOct 16, 2025 -
AgentBench Public
Forked from THUDM/AgentBenchA Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Python Apache License 2.0 UpdatedOct 14, 2025 -
SEED-GRPO Public
Forked from Dreamer312/SEED-GRPOThe official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
Python Apache License 2.0 UpdatedOct 14, 2025 -
Open-AgentRL Public
Forked from Gen-Verse/Open-AgentRLDemystifying Reinforcement Learning in Agentic Reasoning
Python Apache License 2.0 UpdatedOct 14, 2025 -
KnowRL Public
Forked from zjunlp/KnowRLKnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Python MIT License UpdatedOct 10, 2025 -
OpenManus-RL Public
Forked from OpenManus/OpenManus-RLA live stream development of RL tunning for LLM agents
Python Apache License 2.0 UpdatedOct 8, 2025 -
tinker-cookbook Public
Forked from thinking-machines-lab/tinker-cookbookPost-training with Tinker
Python Apache License 2.0 UpdatedOct 5, 2025 -
-
-
EPO Public
Forked from WujiangXu/EPOThe code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
Python Apache License 2.0 UpdatedOct 1, 2025 -
-
AgentGym-RL Public
Forked from WooooDyy/AgentGym-RLCode and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
Python MIT License UpdatedSep 11, 2025 -
TrustEval-toolkit Public
Forked from TrustGen/TrustEval-toolkitToolkit for evaluating the trustworthiness of generative foundation models.
Python Other UpdatedAug 22, 2025 -
RLCR Public
Forked from damanimehul/RLCROfficial repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
Python MIT License UpdatedAug 20, 2025 -
Awesome-Efficient-Reasoning-LLMs Public
Forked from Eclipsess/Awesome-Efficient-Reasoning-LLMsStop Overthinking: A Survey on Efficient Reasoning for Large Language Models
UpdatedAug 11, 2025 -
MiroFlow Public
Forked from MiroMindAI/MiroFlowMiroflow is an agent framework that simplifies the development of complex, multi-agent systems. Build, manage, and scale your AI agents with ease.
Python Apache License 2.0 UpdatedAug 8, 2025 -
AWorld Public
Forked from inclusionAI/AWorldBuild, evaluate and train General Multi-Agent Assistance with ease
Python MIT License UpdatedAug 6, 2025 -
deep_research_bench Public
Forked from Ayanami0730/deep_research_benchDeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Python Apache License 2.0 UpdatedAug 3, 2025 -
Influences-on-LLM-Calibration Public
Forked from Yuuxii/Influences-on-LLM-CalibrationPython MIT License UpdatedJul 29, 2025 -