Skip to content
View Zeng-WH's full-sized avatar
🔭
Researching
🔭
Researching

Block or report Zeng-WH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 4,823 442 Updated Sep 16, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 29,859 3,027 Updated Dec 23, 2025

A Model Context Protocol server for Excel file manipulation

Python 3,016 342 Updated Dec 22, 2025

The official code of ARPO & AEPO

Python 831 38 Updated Dec 20, 2025

This MCP server integrates with your Google Drive and Google Sheets, to enable creating and modifying spreadsheets.

Python 565 153 Updated Dec 6, 2025

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 191 15 Updated Dec 23, 2025

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Python 20 1 Updated Oct 8, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,499 2,375 Updated Dec 23, 2025

A Gym for Agentic LLMs

Python 409 27 Updated Dec 23, 2025

A Model Context Protocol (MCP) server for Gmail integration in Claude Desktop with auto authentication support. This server enables AI assistants to manage Gmail through natural language interactions.

JavaScript 870 239 Updated Aug 6, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,119 99 Updated Nov 23, 2025

Muon is Scalable for LLM Training

1,387 78 Updated Aug 3, 2025

End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Python 341 20 Updated Sep 22, 2025

From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.

Python 23 1 Updated Oct 7, 2025

Train your Agent model via our easy and efficient framework

Python 1,668 156 Updated Dec 5, 2025

[ICML 2025] Official Implementation of GLIDER

Python 72 6 Updated Oct 9, 2025

Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Python 61 4 Updated May 22, 2025
Python 132 6 Updated May 14, 2025

🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 297 19 Updated Oct 24, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 676 46 Updated Oct 15, 2025

Our library for RL environments + evals

Python 3,655 454 Updated Dec 23, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 778 182 Updated Dec 22, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,756 1,177 Updated Sep 26, 2025

Lightweight coding agent that runs in your terminal

Rust 54,554 6,930 Updated Dec 23, 2025

Async pipelined version of Verl

Python 125 13 Updated Apr 8, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,896 468 Updated Dec 21, 2025

[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,376 138 Updated Dec 8, 2025

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

Python 148 6 Updated Oct 23, 2025
Next