Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,792 600 Updated Nov 6, 2025

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 2,717 465 Updated Nov 8, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,403 245 Updated Nov 7, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,300 202 Updated Jun 10, 2025

sierra-research / tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 394 74 Updated Nov 7, 2025

ltjed / freephdlabor

freephdlabor: customizing personalized multiagent systems that researchs 24/7 on your own scientific problem

Python 286 43 Updated Oct 22, 2025

FFishy-git / MS-Attn-Simulation

Simulation code for paper "Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality"

Python 4 1 Updated Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xintian Pan XintianPan

Achievements

Achievements

Highlights

Block or report XintianPan

Lists (1)

✨ Inspiration

Stars

langchain-ai / langchain

hiyouga / LLaMA-Factory

Alibaba-NLP / DeepResearch

langchain-ai / open_deep_research