Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,048 643 Updated Dec 19, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 9,740 790 Updated Dec 19, 2025

sgl-project / mini-sglang

Python 1,327 95 Updated Dec 18, 2025

Tongyi-Zhiwen / Qwen-Doc

Python 385 18 Updated Dec 16, 2025

NiklasFreymuth / troll

Forked from volcengine/verl

TROLL: Trust Region Optimization for Large Language models

Python 7 Updated Nov 27, 2025

deepreinforce-ai / CUDA-L1

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 277 60 Updated Nov 3, 2025

radixark / miles

Python 610 54 Updated Dec 16, 2025

ISEEKYAN / verl_megatron_practice

(best/better) practices of megatron on veRL and tuning guide

Shell 111 8 Updated Sep 26, 2025

agentica-project / verl

Python 17 13 Updated Sep 22, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,285 327 Updated Dec 15, 2025

IANNXANG / RuscaRL

Python 19 1 Updated Dec 17, 2025

AI-Infra-Team / awesome-papers

Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.

48 1 Updated Nov 11, 2025

verl-project / verl-recipe

A set of examples based on verl for end-to-end RL training recipes.

Python 68 7 Updated Dec 1, 2025

mit-han-lab / fastrl

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 113 8 Updated Dec 5, 2025

alibaba / ROCK

A construction kit for reinforcement learning environment management.

Python 250 26 Updated Dec 19, 2025

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,578 655 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lucas zhanjiqing

Block or report zhanjiqing

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

TIGER-AI-Lab / verl-tool

0russwest0 / Agent-R1

zilliztech / deep-searcher

Visual-Agent / DeepEyesV2

Visual-Agent / DeepEyes

MineDojo / MineDojo

MineDojo / Voyager

ReTool-RL / ReTool

RUCAIBox / R1-Searcher

PeterGriffinJin / Search-R1

Agent-RL / ReCall

inclusionAI / AReaL

Simple-Efficient / RL-Factory

OpenPipe / ART