mcpchecker

Test your MCP servers by having AI agents complete real tasks.

Why mcpchecker?

You've built an MCP server with tools. It works. But can an AI agent actually discover and use your tools correctly? Are your descriptions clear enough? Does your server handle edge cases?

mcpchecker answers these questions automatically. It runs real AI agents against your MCP server, records every tool call, and verifies that tasks complete successfully. Think of it as integration testing for AI tool use.

Install

brew tap mcpchecker/mcpchecker
brew install mcpchecker

For other platforms (Linux, manual download), see Getting Started.

Quick Start

mcpchecker check eval.yaml

This runs an evaluation that:

Starts your MCP server and sets up an MCP proxy to record tool calls
Gives an AI agent a task prompt
Verifies the task succeeded (via scripts or LLM judge)
Checks assertions against the recorded behavior

Results are saved to mcpchecker-<name>-out.json with a pass/fail summary printed to the terminal.

For hands-on tutorials, see Quickstarts.

How It Works

mcpchecker places a recording proxy between the agent and your MCP server:

AI Agent --> MCP Proxy (recording) --> Your MCP Server

If agents can discover and use your tools to complete tasks, your server is well-designed. If they can't, the recorded call history helps you figure out why.

Documentation

Getting started:

Installation and first run

How-to guides:

Configure agents -- Claude Code, LLM agents, custom agents, ACP mode
Write tasks -- task structure, labels, filtering, extensions
Use assertions -- validate tool usage, call order, resource access
LLM judge verification -- semantic evaluation of agent responses
Parallel execution and multi-run -- speed up evals and test consistency

Reference:

Understanding:

How it works -- architecture and evaluation flow

Building from Source

go build -o mcpchecker ./cmd/mcpchecker

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.claude/skills/create-eval		.claude/skills/create-eval
.github		.github
cmd		cmd
docs		docs
examples		examples
functional		functional
internal/gendocs		internal/gendocs
pkg		pkg
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASING.md		RELEASING.md
create-nginx-pod-error.txt		create-nginx-pod-error.txt
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mcpchecker

Why mcpchecker?

Install

Quick Start

How It Works

Documentation

Building from Source

License

About

Uh oh!

Releases 30

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mcpchecker

Why mcpchecker?

Install

Quick Start

How It Works

Documentation

Building from Source

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 30

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages