SsmallSong

Song Huatong SsmallSong

Homepage: https://ssmallsong.github.io/

50 followers · 39 following

Renmin University of China
Beijing
https://ssmallsong.github.io/

Achievements

SsmallSong.github.io Public

SCSS MIT License Updated Mar 29, 2026
verl Public
Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Mar 13, 2026
OpenClaw-RL Public
Forked from Gen-Verse/OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

TypeScript MIT License Updated Mar 12, 2026
slime Public
Forked from THUDM/slime

slime is an LLM post-training framework for RL Scaling.

Python Apache License 2.0 Updated Mar 12, 2026
OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python Apache License 2.0 Updated Mar 10, 2026
CoPaw Public
Forked from agentscope-ai/QwenPaw

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python Apache License 2.0 Updated Mar 3, 2026
awesome-openclaw-skills Public
Forked from VoltAgent/awesome-openclaw-skills

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

MIT License Updated Feb 28, 2026
harbor Public
Forked from harbor-framework/harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python Apache License 2.0 Updated Feb 28, 2026
nanobot Public
Forked from HKUDS/nanobot

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python MIT License Updated Feb 27, 2026
terminal-bench-2 Public
Forked from harbor-framework/terminal-bench-2

Shell Apache License 2.0 Updated Feb 27, 2026
llm-in-sandbox Public
Forked from llm-in-sandbox/llm-in-sandbox

Python Apache License 2.0 Updated Jan 23, 2026
terminal-bench Public
Forked from harbor-framework/terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python Apache License 2.0 Updated Jan 22, 2026
rllm Public
Forked from rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Jan 2, 2026
OpenHands Public
Forked from OpenHands/OpenHands

🙌 OpenHands: Code Less, Make More

Python Other Updated Oct 1, 2025
UI-TARS Public
Forked from bytedance/UI-TARS

Python Apache License 2.0 Updated Sep 5, 2025
terminal-bench-rl Public
Forked from Danau5tin/terminal-bench-rl

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python Updated Aug 24, 2025
R2E-Gym Public
Forked from R2E-Gym/R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python Apache License 2.0 Updated Jul 13, 2025
deep_research_bench Public
Forked from Ayanami0730/deep_research_bench

Python Apache License 2.0 Updated Jun 13, 2025
smolagents Public
Forked from huggingface/smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python Apache License 2.0 Updated May 30, 2025
DeepResearchAgent Public
Forked from SkyworkAI/DeepResearchAgent

Fluent MIT License Updated May 27, 2025
Alita Public
Forked from CharlesQ9/Alita

Updated May 27, 2025
R1-Searcher-plus Public
Forked from RUCAIBox/R1-Searcher-plus

Updated May 22, 2025
Smart-Searcher Public

Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

MIT License Updated May 21, 2025
deer-flow Public
Forked from bytedance/deer-flow

DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript MIT License Updated May 8, 2025
SimpleDeepSearcher Public
Forked from RUCAIBox/SimpleDeepSearcher

MIT License Updated Apr 11, 2025
EASYEP Public
Forked from RUCAIBox/EASYEP

Python Updated Apr 9, 2025
DeepResearcher Public
Forked from GAIR-NLP/DeepResearcher

Python Apache License 2.0 Updated Apr 3, 2025
ii-researcher Public
Forked from Intelligent-Internet/ii-researcher

II-Researcher: a new open-source framework designed to aid building search / research agents

Python Apache License 2.0 Updated Mar 31, 2025
Slow_Thinking_with_LLMs Public
Forked from RUCAIBox/Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python Updated Mar 16, 2025
OpenManus Public
Forked from FoundationAgents/OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python MIT License Updated Mar 10, 2025

Song Huatong SsmallSong

Achievements

Achievements

SsmallSong.github.io Public

Uh oh!

verl Public

Uh oh!

OpenClaw-RL Public

Uh oh!

slime Public

Uh oh!

OpenRLHF Public

Uh oh!

CoPaw Public

Uh oh!

awesome-openclaw-skills Public

Uh oh!

harbor Public

Uh oh!

nanobot Public

Uh oh!

terminal-bench-2 Public

Uh oh!

llm-in-sandbox Public

Uh oh!

terminal-bench Public

Uh oh!

rllm Public

Uh oh!

OpenHands Public

Uh oh!

UI-TARS Public

Uh oh!

terminal-bench-rl Public

Uh oh!

R2E-Gym Public

Uh oh!

deep_research_bench Public

Uh oh!

smolagents Public

Uh oh!

DeepResearchAgent Public

Uh oh!

Alita Public

Uh oh!

R1-Searcher-plus Public

Uh oh!

Smart-Searcher Public

Uh oh!

deer-flow Public

Uh oh!

SimpleDeepSearcher Public

Uh oh!

EASYEP Public

Uh oh!

DeepResearcher Public

Uh oh!

ii-researcher Public

Uh oh!

Slow_Thinking_with_LLMs Public

Uh oh!

OpenManus Public

Uh oh!