SsmallSong

Song Huatong SsmallSong

Student of RUC and undergraduate majored in AI and Fintech. Developed a 2.4B parameter LLM that was pre-trained from scratch.

38 followers · 31 following

Renmin University of China
Beijing

Achievements

Stars

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,469 465 Updated Jan 22, 2026

GAIR-NLP / daVinci-Agency

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Python 10 Updated Feb 2, 2026

llm-in-sandbox / llm-in-sandbox

LLM-in-Sandbox Elicits General Agentic Intelligence

Python 162 8 Updated Jan 27, 2026

Danau5tin / terminal-bench-rl

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 344 21 Updated Aug 24, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,070 496 Updated Feb 3, 2026

nex-agi / Nex-N1

102 3 Updated Dec 5, 2025

nex-agi / NexRL

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 95 5 Updated Feb 3, 2026

openai / codex

Lightweight coding agent that runs in your terminal

Rust 58,782 7,657 Updated Feb 4, 2026

shareAI-lab / learn-claude-code

Bash is all You need - Write a nano Claude Code 0 - 1

Python 16,275 3,531 Updated Feb 1, 2026

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 67,452 8,395 Updated Feb 3, 2026

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 230 49 Updated Jul 13, 2025

MLSysOps / Code-Agent-Survey

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

21 Updated Aug 20, 2024

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,141 1,396 Updated Jan 21, 2026

bytedance / UI-TARS

Pioneering Automated GUI Interaction with Native Agents

Python 9,162 648 Updated Jan 27, 2026

rucliujn / PPlug

LLMs + Persona-Plug = Personalized LLMs

Python 13 4 Updated Oct 16, 2024

RUCAIBox / CAFE

A novel two-stage coarse-to-fine information-seeking method to enhance the multi-document question-answering capabilities of LLMs.

3 Updated Sep 5, 2025

asgeirtj / system_prompts_leaks

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 29,914 4,813 Updated Jan 30, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,718 2,030 Updated Jan 13, 2026

openai / openai-cookbook

Examples and guides for using the OpenAI API

Jupyter Notebook 71,291 11,938 Updated Feb 3, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 77,711 9,201 Updated Feb 4, 2026

xbench-ai / xbench-evals

Evergreen, contamination-free, real-world, domain-specific AI evaluation framework

Python 122 7 Updated Jan 11, 2026

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 566 64 Updated Nov 22, 2025

letta-ai / letta

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Python 20,965 2,194 Updated Jan 29, 2026

RUCAIBox / ManuSearch

Python 30 2 Updated May 27, 2025

RUCAIBox / CIR

Python 14 1 Updated Nov 11, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 25,244 2,278 Updated Jan 23, 2026

CharlesQ9 / Alita

873 48 Updated Aug 30, 2025

RUCAIBox / R1-Searcher-plus

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Python 71 2 Updated May 25, 2025

qhjqhj00 / awesome-agentic-search

🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to answer complex questions. Explore the latest research, bench…

53 5 Updated Aug 28, 2025

SkyworkAI / DeepResearchAgent

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

JavaScript 3,098 408 Updated Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Song Huatong SsmallSong

Achievements

Achievements

Block or report SsmallSong

Stars

laude-institute / terminal-bench

GAIR-NLP / daVinci-Agency

llm-in-sandbox / llm-in-sandbox

Danau5tin / terminal-bench-rl

rllm-org / rllm

nex-agi / Nex-N1

nex-agi / NexRL

openai / codex

shareAI-lab / learn-claude-code

OpenHands / OpenHands

R2E-Gym / R2E-Gym

MLSysOps / Code-Agent-Survey

Alibaba-NLP / DeepResearch

bytedance / UI-TARS

rucliujn / PPlug

RUCAIBox / CAFE

asgeirtj / system_prompts_leaks

openai / gpt-oss

openai / openai-cookbook

browser-use / browser-use

xbench-ai / xbench-evals

Ayanami0730 / deep_research_bench

letta-ai / letta

RUCAIBox / ManuSearch

RUCAIBox / CIR

huggingface / smolagents

CharlesQ9 / Alita

RUCAIBox / R1-Searcher-plus

qhjqhj00 / awesome-agentic-search

SkyworkAI / DeepResearchAgent