Stars
The absolute trainer to light up AI agents.
Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
AndroidWorld is an environment and benchmark for autonomous agents
A Universal Platform for Training and Evaluation of Mobile Interaction
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Code for InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
SGLang is a fast serving framework for large language models and vision language models.
A project to improve skills of large language models
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Training VLM agents with multi-turn reinforcement learning
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
verl: Volcano Engine Reinforcement Learning for LLMs
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…
Comprehensive benchmark for RAG
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.