yyht

yyht

Achievements

Stars

wanshuiyin / Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 4,931 403 Updated Mar 30, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 4,444 443 Updated Mar 30, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 19,238 2,327 Updated Mar 31, 2026

sjtu-sai-agents / ML-Master

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 380 47 Updated Mar 29, 2026

InternScience / MLEvolve

MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive search and experience-driven memory.

Python 246 27 Updated Mar 27, 2026

Alibaba-NLP / qqr

qqr is an RL training framework for open-ended agents.

Python 228 20 Updated Mar 25, 2026

SagnikMukherjee / sparsity_in_rl

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Python 13 4 Updated Oct 20, 2025

Unakar / Spectral-Sphere-Optimizer

Spectral Sphere Optimizer

Python 111 2 Updated Mar 23, 2026

ByteDance-Seed / Seed-Prover

Lean 414 27 Updated Feb 13, 2026

openpsi-project / srl

A Really Scalable RL Framework to 10k+ CPUs

Python 39 3 Updated Feb 29, 2024

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 663 97 Updated Mar 30, 2026

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 259 19 Updated Jan 17, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,266 5,075 Updated Mar 31, 2026

InfiXAI / InfiR2

Shell 11 2 Updated Oct 22, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 1,443 250 Updated Mar 30, 2026

inclusionAI / AWorld

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,168 120 Updated Mar 27, 2026

ISEEKYAN / verl_megatron_practice

(best/better) practices of megatron on veRL and tuning guide

Shell 132 10 Updated Sep 26, 2025

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,951 374 Updated Mar 31, 2026

ByteDance-Seed / seed-oss

Python 875 48 Updated Sep 15, 2025

nvidia-cosmos / cosmos-rl

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 379 55 Updated Mar 31, 2026

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,339 203 Updated Jun 10, 2025

JARVIS-Xs / SE-Agent

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 245 30 Updated Sep 23, 2025

OPPO-PersonalAI / TaskCraft

[ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 181 18 Updated Jul 6, 2025

WentseChen / Verlog

Forked from verl-project/verl

Verlog: A Multi-turn RL framework for LLM agents

Python 72 7 Updated Mar 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yyht

Achievements

Achievements

Block or report yyht

Stars

wanshuiyin / Auto-claude-code-research-in-sleep

Gen-Verse / OpenClaw-RL

NousResearch / hermes-agent

sjtu-sai-agents / ML-Master

InternScience / MLEvolve

Alibaba-NLP / qqr

SagnikMukherjee / sparsity_in_rl

Unakar / Spectral-Sphere-Optimizer

ByteDance-Seed / Seed-Prover

openpsi-project / srl

meta-pytorch / torchforge

THUDM / AgentRL

sgl-project / sglang

InfiXAI / InfiR2

meta-pytorch / OpenEnv

inclusionAI / AWorld

ISEEKYAN / verl_megatron_practice

RLinf / RLinf

ByteDance-Seed / seed-oss

nvidia-cosmos / cosmos-rl

maitrix-org / llm-reasoners

JARVIS-Xs / SE-Agent

OPPO-PersonalAI / TaskCraft

WentseChen / Verlog

IRL-VLA / IRL-VLA

xlang-ai / OpenCUA

bytedance / FTRL

InternLM / InternBootcamp

inclusionAI / ASearcher

OPPO-PersonalAI / Agent_Foundation_Models