Skip to content
View yyht's full-sized avatar

Block or report yyht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 4,457 356 Updated Mar 28, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,336 428 Updated Mar 27, 2026

The agent that grows with you

Python 14,851 1,795 Updated Mar 28, 2026

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 379 47 Updated Mar 24, 2026

MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive search and experience-driven memory.

Python 244 27 Updated Mar 27, 2026

qqr is an RL training framework for open-ended agents.

Python 228 20 Updated Mar 25, 2026

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Python 13 4 Updated Oct 20, 2025

Spectral Sphere Optimizer

Python 111 2 Updated Mar 23, 2026

A Really Scalable RL Framework to 10k+ CPUs

Python 39 3 Updated Feb 29, 2024

PyTorch-native post-training at scale

Python 661 96 Updated Mar 27, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 257 19 Updated Jan 17, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,041 5,035 Updated Mar 28, 2026
Shell 11 2 Updated Oct 22, 2025

An interface library for RL post training with environments.

Python 1,406 231 Updated Mar 26, 2026

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,166 120 Updated Mar 27, 2026

(best/better) practices of megatron on veRL and tuning guide

Shell 132 10 Updated Sep 26, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,907 370 Updated Mar 28, 2026
Python 875 48 Updated Sep 15, 2025

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 379 55 Updated Mar 25, 2026

A library for advanced large language model reasoning

Python 2,340 203 Updated Jun 10, 2025

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 244 30 Updated Sep 23, 2025

[ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 181 18 Updated Jul 6, 2025

Verlog: A Multi-turn RL framework for LLM agents

Python 72 7 Updated Mar 27, 2026

Official repo for IRL-VLA

77 4 Updated Aug 13, 2025

OpenCUA: Open Foundations for Computer-Use Agents

Python 724 95 Updated Feb 4, 2026

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Python 49 6 Updated Jan 8, 2026
Python 341 25 Updated Aug 29, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 569 37 Updated Nov 26, 2025

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 554 45 Updated Sep 8, 2025
Next