2 unstable releases

Uses new Rust 2024

new 0.2.0	Feb 4, 2026
0.1.0	Jan 7, 2026

#583 in Command line utilities

MIT license

92KB
2.5K SLoC

singleshot

A CLI tool for testing AI models with a single prompt. Supports multiple providers including OpenAI, Anthropic, Ollama, and OpenRouter.

Features

Multiple Providers: OpenAI, Anthropic, Ollama, OpenRouter
Vision Support: Send images with your prompts
MCP Tools: Connect to MCP (Model Context Protocol) servers for tool-augmented AI
Config Files: Save and reuse prompt configurations
Reports: Generate markdown reports of your tests
Flexible Input: Prompt via argument, file, config, or stdin

Installation

cargo install singleshot

Or build from source:

git clone https://github.com/vincentzhangz/singleshot
cd singleshot
cargo install --path .

Quick Start

# Simple prompt with Ollama (default)
singleshot chat -p "What is Rust?"

# Use OpenAI
singleshot chat -p "Explain async/await" -P openai

# Use a specific model
singleshot chat -p "Hello" -P anthropic -m claude-sonnet-4-20250514

# Send an image
singleshot chat -p "What's in this image?" -P openai -i photo.jpg

Usage

Commands

Command	Description
`chat`	Send a prompt to an AI model
`models`	List available models for a provider
`ping`	Test connection to a provider
`generate`	Generate an example config file

Chat Options

-p, --prompt <PROMPT>        The prompt to send
-f, --file <FILE>            File containing the prompt
-l, --load <LOAD>            Load config from markdown file
-i, --image <IMAGE>          Image file to include
-v, --video <VIDEO>          Video file to include
-a, --audio <AUDIO>          Audio file to include
-P, --provider <PROVIDER>    AI provider [default: ollama]
-m, --model <MODEL>          Model name
-b, --base-url <BASE_URL>    Custom API base URL
-s, --system <SYSTEM>        System prompt
-t, --temperature <TEMP>     Temperature (0.0-2.0) [default: 0.7]
    --max-tokens <TOKENS>    Maximum tokens in response
    --max-turns <TURNS>      Maximum tool call turns (default: 10)
    --mcp <URL>              MCP server URL(s) - can be specified multiple times
-d, --detail                 Show detailed information
-r, --report [<FILE>]        Generate report file

Providers

Provider	Default Model	Environment Variable
`ollama`	llama3.2	-
`openai`	gpt-4o	`OPENAI_API_KEY`
`anthropic`	claude-sonnet-4-20250514	`ANTHROPIC_API_KEY`
`openrouter`	openai/gpt-4o	`OPENROUTER_API_KEY`

Note: The openai provider is compatible with any service that implements the OpenAI Chat Completions API (/v1/chat/completions), including:

Cloud services: OpenAI, Azure OpenAI, Groq, Together AI, Fireworks AI

Local inference: LM Studio, LocalAI, vLLM, Ollama (with OpenAI compatibility), text-generation-webui

Set a custom base URL with -b or OPENAI_BASE_URL environment variable.

Examples

Read prompt from file

singleshot chat -f prompt.txt -P openai

Pipe from stdin

echo "Summarize this" | singleshot chat -P openai

Use a config file

Create a config file with singleshot generate -o config.md:

---provider---
openai

---model---
gpt-4o

---base_url---
https://api.openai.com/v1

---temperature---
0.7

---max_tokens---
1024

---system---
You are a helpful coding assistant.

---prompt---
Explain the difference between async and threads.

---file---
path/to/context.txt

---image---
path/to/image.jpg

---video---
path/to/video.mp4

---audio---
path/to/audio.mp3

---max_turns---
10

---mcp---
http://localhost:8080
http://localhost:8081

All sections are optional. Only include what you need.

Then run:

singleshot chat -l config.md

You can find more example configurations in the examples/config directory, such as examples/config/rust-wiki.md which demonstrates using MCP servers.

Generate a report

singleshot chat -p "What is Rust?" -P openai -r report.md

Vision with detailed output

singleshot chat -p "Describe this image" -i photo.jpg -P openai -d -r

List available models

singleshot models -P openai
singleshot models -P ollama

Test provider connection

singleshot ping -P openai
singleshot ping -P ollama

Use MCP Tools

Connect to MCP (Model Context Protocol) servers to give the AI access to external tools:

# Connect to a single MCP server
singleshot chat -p "Search for Astro components" -P openai --mcp https://mcp.docs.astro.build/mcp

# Connect to multiple MCP servers
singleshot chat -p "How to use Astro?" -P openai \
  --mcp https://mcp.docs.astro.build/mcp \
  --mcp https://mcp.deepwiki.com/mcp

# With detailed output to see configuration
singleshot chat -p "Your prompt" -P openai --mcp http://localhost:8080 -d

When using MCP, you'll see the tool activity in real-time:

[+] Connected to MCP server (1 tools: search_astro_docs)
[+] Connected to MCP server (3 tools: read_wiki_structure, read_wiki_contents, ask_question)
[+] MCP tools loaded (4 tools)
[>] Calling tool: search_astro_docs
[+] Tools called: search_astro_docs → ask_question
[+] Received response

MCP tools enable the AI to perform actions like:

Search documentation
Query databases
Execute calculations
Access external APIs
And more, depending on the MCP server capabilities

The --mcp flag accepts any MCP server URL that implements the Streamable HTTP transport. You can specify --mcp multiple times to connect to multiple servers. You can also list MCP servers in your config file under the ---mcp--- section.

Singleshot automatically hints to the model about the available tools to ensure they are used when relevant. The maximum number of tool execution turns defaults to 10 but can be configured via --max-turns or the ---max_turns--- config section.

Environment Variables

Set API keys for cloud providers:

export OPENAI_API_KEY="sk-..."
export ANTHROPIC_API_KEY="sk-ant-..."
export OPENROUTER_API_KEY="sk-or-..."

For custom endpoints:

export OPENAI_BASE_URL="https://your-proxy.com/v1"
export OLLAMA_API_BASE_URL="http://remote-server:11434"

License

Code released under the MIT License.

Dependencies

~19–35MB
~401K SLoC