gongel

🎯

Focusing

gongel gongel

🎯

Focusing

77 followers · 54 following

Beijing

Achievements

x2 x3

Achievements

x2 x3

Organizations

Lists (1)

Sort

🔮 Future ideas

Stars

modelscope / AgentJet

Cutting-edge platform for LLM agent tuning. Deliver RL tuning with flexibility, reliability, speed, multi-agent optimization and realtime community benchmarking.

Python 219 24 Updated Jun 4, 2026

Accio-Lab / Dressage

Python 64 4 Updated Jun 20, 2026

vllm-project / vime

An LLM post-training framework with vLLM for RL Scaling

Python 290 30 Updated Jun 22, 2026

NVIDIA-NeMo / ProRL-Agent-Server

Agentic RL on Any Harness at Scale

Python 580 61 Updated Jun 17, 2026

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 432 51 Updated Jun 18, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 199,299 35,398 Updated Jun 22, 2026

NanmiCoder / cc-haha

Claude Code 泄露源码 - 本地可运行版本，新增跨平台桌面端软件补齐Computer Use（附带核心模块解析）

TypeScript 12,797 8,295 Updated Jun 20, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,515 598 Updated May 23, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 177,111 21,622 Updated Jun 22, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,437 707 Updated May 17, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 2,015 358 Updated Jun 22, 2026

AMAP-ML / Tree-GRPO

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

Python 376 37 Updated Jan 26, 2026

WooooDyy / BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping" by Zhiheng Xi et al.

Python 93 6 Updated Jan 29, 2026