-
Klavis AI
- San Francisco
- https://www.klavis.ai/
- in/zihao-lin-14137716b
- @ZihaoLin685013
Stars
🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails
AI agents running research on single-GPU nanochat training automatically
Postgres MCP Pro provides configurable read/write access and performance analysis for you and your AI agents.
verl: Volcano Engine Reinforcement Learning for LLMs
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
Our library for RL environments + evals
Scalable toolkit for efficient model reinforcement
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.
MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.
A version of verl to support diverse tool use
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
An open source implementation of code execution with MCP (Programatic Tool Calling)
A Model Context Protocol server for searching and analyzing arXiv papers
Build Real-Time Knowledge Graphs for AI Agents
The official Python SDK for Eval Protocol
Eval Protocol (EP) is an open solution for doing reinforcement learning fine-tuning on existing agents — across any language, container, or framework.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Open-source MCP gateway and control plane for teams to govern which tools agents can use, what they can do, and how it’s audited—across agentic IDEs like Cursor, or other agents and AI tools.
AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x
MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.
A community driven registry service for Model Context Protocol (MCP) servers.
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
The official ElevenLabs MCP server
Visual testing tool for MCP servers
OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)