-
The Hong Kong University of Science and Technology (HKUST)
- Hong Kong
-
01:15
(UTC -12:00) - https://github.com/Zeng-WH
- @AndrewZeng17
- https://scholar.google.com.hk/citations?user=EXSJgXIAAAAJ&hl=zh-CN
Stars
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
A Model Context Protocol server for Excel file manipulation
This MCP server integrates with your Google Drive and Google Sheets, to enable creating and modifying spreadsheets.
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
An extremely fast Python package and project manager, written in Rust.
A Model Context Protocol (MCP) server for Gmail integration in Claude Desktop with auto authentication support. This server enables AI assistants to manage Gmail through natural language interactions.
Muon is an optimizer for hidden layers in neural networks
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
Train your Agent model via our easy and efficient framework
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
Our library for RL environments + evals
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Lightweight coding agent that runs in your terminal
[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"