GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 345 21 Updated Aug 24, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,073 497 Updated Feb 4, 2026

nex-agi / Nex-N1

103 3 Updated Dec 5, 2025

nex-agi / NexRL

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 97 5 Updated Feb 4, 2026

openai / codex

Lightweight coding agent that runs in your terminal

Rust 58,952 7,686 Updated Feb 5, 2026

shareAI-lab / learn-claude-code

Bash is all You need - Write a nano Claude Code 0 - 1

Python 16,383 3,546 Updated Feb 1, 2026

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 67,498 8,402 Updated Feb 5, 2026

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 231 49 Updated Jul 13, 2025

MLSysOps / Code-Agent-Survey

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

21 Updated Aug 20, 2024

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,160 1,399 Updated Jan 21, 2026

bytedance / UI-TARS

Pioneering Automated GUI Interaction with Native Agents

Python 9,242 655 Updated Jan 27, 2026

rucliujn / PPlug

LLMs + Persona-Plug = Personalized LLMs

Python 13 4 Updated Oct 16, 2024

RUCAIBox / CAFE

A novel two-stage coarse-to-fine information-seeking method to enhance the multi-document question-answering capabilities of LLMs.

3 Updated Sep 5, 2025

asgeirtj / system_prompts_leaks

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 30,123 4,839 Updated Feb 4, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,730 2,031 Updated Jan 13, 2026

openai / openai-cookbook

Examples and guides for using the OpenAI API

Jupyter Notebook 71,315 11,939 Updated Feb 4, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 77,790 9,205 Updated Feb 5, 2026

xbench-ai / xbench-evals

Evergreen, contamination-free, real-world, domain-specific AI evaluation framework

Python 122 7 Updated Jan 11, 2026

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 568 64 Updated Nov 22, 2025

letta-ai / letta

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Python 20,983 2,195 Updated Jan 29, 2026

RUCAIBox / ManuSearch

Python 30 2 Updated May 27, 2025

RUCAIBox / CIR

Python 14 1 Updated Nov 11, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 25,267 2,280 Updated Jan 23, 2026

CharlesQ9 / Alita

873 48 Updated Aug 30, 2025

RUCAIBox / R1-Searcher-plus

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Python 71 2 Updated May 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Song Huatong SsmallSong

Achievements