Lists (3)
Sort Name ascending (A-Z)
Stars
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
A high-throughput and memory-efficient inference and serving engine for LLMs
Evaluate and improve models and agents using environments
Scalable toolkit for efficient model reinforcement
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
π EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Tongyi Deep Research, the Leading Open-source Deep Research Agent
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
A Collection of Papers about Memory for Language Agents
An openclaw plugin for autonomous multi-agent job searching.
Realistic Multi-Agent Fire Evacuation Simulator with LLM-Powered Human Behavior (Mesa Framework) π 3rd Place β Agentic Hackathon Zurich (DeepMind x Vercel x ASL) π
A multi-hop multimodal RAG system to chat with your PDFs locally, using iterative retrieval and grounded answers from page-level evidence.