Skip to content
View SsmallSong's full-sized avatar
  • Renmin University of China
  • Beijing

Block or report SsmallSong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A benchmark for LLMs on complicated tasks in the terminal

Python 1,469 465 Updated Jan 22, 2026

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Python 10 Updated Feb 2, 2026

LLM-in-Sandbox Elicits General Agentic Intelligence

Python 162 8 Updated Jan 27, 2026

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 344 21 Updated Aug 24, 2025

Democratizing Reinforcement Learning for LLMs

Python 5,070 496 Updated Feb 3, 2026
102 3 Updated Dec 5, 2025

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 95 5 Updated Feb 3, 2026

Lightweight coding agent that runs in your terminal

Rust 58,782 7,657 Updated Feb 4, 2026

Bash is all You need - Write a nano Claude Code 0 - 1

Python 16,275 3,531 Updated Feb 1, 2026

🙌 OpenHands: AI-Driven Development

Python 67,452 8,395 Updated Feb 3, 2026

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 230 49 Updated Jul 13, 2025

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

21 Updated Aug 20, 2024

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,141 1,396 Updated Jan 21, 2026

Pioneering Automated GUI Interaction with Native Agents

Python 9,162 648 Updated Jan 27, 2026

LLMs + Persona-Plug = Personalized LLMs

Python 13 4 Updated Oct 16, 2024

A novel two-stage coarse-to-fine information-seeking method to enhance the multi-document question-answering capabilities of LLMs.

3 Updated Sep 5, 2025

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 29,914 4,813 Updated Jan 30, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,718 2,030 Updated Jan 13, 2026

Examples and guides for using the OpenAI API

Jupyter Notebook 71,291 11,938 Updated Feb 3, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 77,711 9,201 Updated Feb 4, 2026

Evergreen, contamination-free, real-world, domain-specific AI evaluation framework

Python 122 7 Updated Jan 11, 2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 566 64 Updated Nov 22, 2025

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Python 20,965 2,194 Updated Jan 29, 2026
Python 30 2 Updated May 27, 2025
Python 14 1 Updated Nov 11, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 25,244 2,278 Updated Jan 23, 2026
873 48 Updated Aug 30, 2025

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Python 71 2 Updated May 25, 2025

🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to answer complex questions. Explore the latest research, bench…

53 5 Updated Aug 28, 2025

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

JavaScript 3,098 408 Updated Sep 29, 2025
Next