Stars
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
OpenClaw-RL: Train any agent simply by talking
The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"
MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive search and experience-driven memory.
qqr is an RL training framework for open-ended agents.
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
SGLang is a high-performance serving framework for large language models and multimodal models.
An interface library for RL post training with environments.
Build, evaluate and train General Multi-Agent Assistance with ease
(best/better) practices of megatron on veRL and tuning guide
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.
A library for advanced large language model reasoning
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…
[ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.
WentseChen / Verlog
Forked from verl-project/verlVerlog: A Multi-turn RL framework for LLM agents
OpenCUA: Open Foundations for Computer-Use Agents
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.