-
Renmin University of China
- Beijing
- https://ssmallsong.github.io/
-
-
verl Public
Forked from verl-project/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMar 13, 2026 -
OpenClaw-RL Public
Forked from Gen-Verse/OpenClaw-RLOpenClaw-RL: Train any agent simply by talking
TypeScript MIT License UpdatedMar 12, 2026 -
slime Public
Forked from THUDM/slimeslime is an LLM post-training framework for RL Scaling.
Python Apache License 2.0 UpdatedMar 12, 2026 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Python Apache License 2.0 UpdatedMar 10, 2026 -
CoPaw Public
Forked from agentscope-ai/QwenPawYour Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.
Python Apache License 2.0 UpdatedMar 3, 2026 -
awesome-openclaw-skills Public
Forked from VoltAgent/awesome-openclaw-skillsThe awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞
MIT License UpdatedFeb 28, 2026 -
harbor Public
Forked from harbor-framework/harborHarbor is a framework for running agent evaluations and creating and using RL environments.
Python Apache License 2.0 UpdatedFeb 28, 2026 -
nanobot Public
Forked from HKUDS/nanobot"🐈 nanobot: The Ultra-Lightweight OpenClaw"
Python MIT License UpdatedFeb 27, 2026 -
terminal-bench-2 Public
Forked from harbor-framework/terminal-bench-2Shell Apache License 2.0 UpdatedFeb 27, 2026 -
llm-in-sandbox Public
Forked from llm-in-sandbox/llm-in-sandboxPython Apache License 2.0 UpdatedJan 23, 2026 -
terminal-bench Public
Forked from harbor-framework/terminal-benchA benchmark for LLMs on complicated tasks in the terminal
Python Apache License 2.0 UpdatedJan 22, 2026 -
rllm Public
Forked from rllm-org/rllmDemocratizing Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedJan 2, 2026 -
OpenHands Public
Forked from OpenHands/OpenHands🙌 OpenHands: Code Less, Make More
Python Other UpdatedOct 1, 2025 -
-
terminal-bench-rl Public
Forked from Danau5tin/terminal-bench-rlGRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.
Python UpdatedAug 24, 2025 -
R2E-Gym Public
Forked from R2E-Gym/R2E-Gym[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
Python Apache License 2.0 UpdatedJul 13, 2025 -
deep_research_bench Public
Forked from Ayanami0730/deep_research_benchPython Apache License 2.0 UpdatedJun 13, 2025 -
smolagents Public
Forked from huggingface/smolagents🤗 smolagents: a barebones library for agents that think in code.
Python Apache License 2.0 UpdatedMay 30, 2025 -
DeepResearchAgent Public
Forked from SkyworkAI/DeepResearchAgentFluent MIT License UpdatedMay 27, 2025 -
-
-
Smart-Searcher Public
Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
MIT License UpdatedMay 21, 2025 -
deer-flow Public
Forked from bytedance/deer-flowDeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
TypeScript MIT License UpdatedMay 8, 2025 -
-
-
DeepResearcher Public
Forked from GAIR-NLP/DeepResearcherPython Apache License 2.0 UpdatedApr 3, 2025 -
ii-researcher Public
Forked from Intelligent-Internet/ii-researcherII-Researcher: a new open-source framework designed to aid building search / research agents
Python Apache License 2.0 UpdatedMar 31, 2025 -
Slow_Thinking_with_LLMs Public
Forked from RUCAIBox/Slow_Thinking_with_LLMsA series of technical report on Slow Thinking with LLM
Python UpdatedMar 16, 2025 -
OpenManus Public
Forked from FoundationAgents/OpenManusNo fortress, purely open ground. OpenManus is Coming.
Python MIT License UpdatedMar 10, 2025