Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,068 644 Updated Dec 24, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 2,368 303 Updated Dec 23, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 31,014 2,502 Updated Dec 23, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,314 117 Updated Dec 11, 2025

a2aproject / A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

Shell 21,185 2,168 Updated Dec 24, 2025

supermemoryai / supermemory

Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.

TypeScript 13,817 1,466 Updated Dec 24, 2025

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 20,807 2,928 Updated Dec 19, 2025

allenai / super-benchmark

Jupyter Notebook 49 4 Updated Apr 4, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3,150 213 Updated Dec 23, 2025

Alex-Mathai-98 / kGym-Kernel-Gym

KGym - A platform to run hundreds to thousands of ML4Linux kernel experiments at scale

Python 14 Updated Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bryan Lu luyuzhe111

Achievements

Achievements

Block or report luyuzhe111

Stars

amazon-science / MigrationBench

QwenLM / qwen-code

OpenHands / benchmarks

AnjieCheng / NaVILA

fla-org / flash-linear-attention

amazon-far / residual-offpolicy-rl

NovaSky-AI / SkyRL

snap-stanford / Biomni

OpenHands / OpenHands

StonyBrookNLP / appworld

OpenHands / software-agent-sdk

meta-pytorch / tritonbench

Physical-Intelligence / openpi

rasbt / LLMs-from-scratch

THUDM / slime

skyzh / tiny-llm

luyuzhe111 / MemeryAgent

sierra-research / tau2-bench

modelscope / MCPBench

BoundaryML / baml

OpenPipe / ART