Skip to content
View nuoyanLyu's full-sized avatar

Block or report nuoyanLyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,969 892 Updated Jun 12, 2026

The official paper for EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.

Python 65 1 Updated Jun 5, 2026

面向 Claude Code / Codex / OpenCode / Gemini 的多通道AI CLI 任务完成提醒,支持耗时阈值、桌面端与命令行、通用 Webhook(飞书/钉钉/企微)、Telegram、邮件、桌面/声音提示,配备自动监听日志,AI摘要等功能

JavaScript 333 21 Updated Jun 4, 2026

An interface library for RL post training with environments.

Python 2,198 393 Updated Jun 13, 2026

The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".

Python 160 13 Updated Feb 12, 2026

Agentic RL on Any Harness at Scale

Python 554 57 Updated Jun 13, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,228 289 Updated Jun 14, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 9,677 728 Updated Apr 28, 2026

Scalable and extensible reinforcement learning for LM agents.

Python 119 13 Updated May 6, 2026

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 375 44 Updated May 28, 2026

A clean IFEval implementation

Python 15 2 Updated Oct 9, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,444 120 Updated Apr 17, 2026

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 311 46 Updated Jun 5, 2026

Build and run agents you can see, understand and trust.

Python 26,797 3,005 Updated Jun 12, 2026

A repo for open research on building large reasoning models

Python 148 19 Updated Mar 3, 2026

This code can be used to generate simulated NIRCam, NIRISS, or FGS data

Python 47 42 Updated May 27, 2026

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 220 9 Updated Jan 13, 2026

Evaluating LLMs with CommonGen-Lite

Python 95 3 Updated Mar 21, 2024

Google Research

Jupyter Notebook 38,128 8,429 Updated Jun 12, 2026

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Python 5 Updated May 14, 2025
Python 64 4 Updated Oct 25, 2025

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Python 154 2 Updated Jun 1, 2026

Code and Data

Python 2 Updated Jan 1, 2026
Python 48 18 Updated Jul 22, 2024

A collective list of free APIs

Python 441,491 48,385 Updated Jun 13, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,464 130 Updated Nov 9, 2025

Simple RL training for reasoning

Python 3,865 287 Updated Dec 23, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 4,003 208 Updated Jun 12, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,447 486 Updated Jun 9, 2026

Awesome List for Agentic RL

HTML 1,563 61 Updated May 26, 2026
Next