Temper Labs

Red team your AI agents and prompts before attackers do.

TemperLLM is an open-source security testing tool for LLM systems. It runs adversarial attacks against your system prompts and AI agents, giving you a detailed security report with scores, verdicts, and recommendations.

Features

Prompt Testing

25 adversarial attacks across 7 categories (Prompt Leaking, Context Manipulation, Roleplay, Encoding, Crescendo, Evaluation Exploit, Emotional)
Test system prompts for jailbreaks, prompt injection, and information leakage

Agent Testing

36 capability-based attacks across 5 categories (Data Exfiltration, Unauthorized Actions, Code Execution, Persistence, Reconnaissance)
Test AI agents with tools (email, files, terminal, web, database, payment, etc.)
Automated judge evaluates if agents follow malicious instructions

General

Multi-provider support — OpenAI, Anthropic, Mistral, Groq (free tier available)
Real-time streaming — results appear as each attack completes
Privacy-first — your API key is never stored or logged
Stats tracking — Supabase integration for anonymous aggregate statistics
Clean UI — Editorial premium design with forest green accents

Quick Start

git clone https://github.com/marti-farre/temperlabs.git
cd temperlabs
npm install
cp .env.example .env.local  # Add your API keys
npm run dev

Open http://localhost:3000 in your browser.

How It Works

Prompt Testing Mode

Configure — Choose your provider (OpenAI, Anthropic, Mistral, or Groq), select a model, and enter your API key
Test — Paste your system prompt and click "Run Security Test"
Review — Get a score out of 25, see which attacks passed/failed, and read recommendations

Agent Testing Mode

Configure — Select which capabilities your agent has (email, files, payment, etc.)
Define — Write your agent's system prompt with security rules
Test — TemperLLM runs targeted attacks based on selected capabilities
Review — See which attacks the agent blocked, warned about, or failed

Supported Providers & Models

Provider	Models	Notes
OpenAI	GPT-4o, GPT-4o Mini, GPT-4 Turbo	Full support
Anthropic	Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude 3 Opus	Full support
Mistral	Mistral Large, Mistral Small	Full support
Groq	Llama 3.1 8B, Llama 3.3 70B, Mixtral 8x7B	Free tier available!

Environment Variables

Create a .env.local file:

# Optional: For stats tracking
NEXT_PUBLIC_SUPABASE_URL=https://your-project.supabase.co
SUPABASE_SERVICE_ROLE_KEY=your_service_role_key

# Optional: For free model testing
GROQ_API_KEY=gsk_your_groq_api_key

Or test with your own API keys via the UI (no environment variables needed).

Deployment

Deploy to Cloudflare Pages, Vercel, or any Next.js-compatible platform.

See DEPLOY_CLOUDFLARE.md for Cloudflare Pages setup.

Tech Stack

Next.js 15 (App Router)
React 19
TypeScript
Tailwind CSS
Framer Motion
Supabase (optional, for stats)

API Key Security

Your API key never leaves your browser except to call the provider's API through our server (to avoid CORS). We never store, log, or persist your key. The entire codebase is open source for verification.

Contributing

Contributions welcome! Feel free to:

Add new attack patterns
Support additional providers
Improve the judge accuracy
Enhance the UI/UX

License

MIT — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
app		app
components		components
hooks		hooks
lib		lib
public		public
scripts		scripts
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.npmrc		.npmrc
DEPLOY_CLOUDFLARE.md		DEPLOY_CLOUDFLARE.md
LICENSE		LICENSE
README.md		README.md
next.config.mjs		next.config.mjs
open-next.config.ts		open-next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
wrangler.toml		wrangler.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Temper Labs

Features

Prompt Testing

Agent Testing

General

Quick Start

How It Works

Prompt Testing Mode

Agent Testing Mode

Supported Providers & Models

Environment Variables

Deployment

Tech Stack

API Key Security

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Temper Labs

Features

Prompt Testing

Agent Testing

General

Quick Start

How It Works

Prompt Testing Mode

Agent Testing Mode

Supported Providers & Models

Environment Variables

Deployment

Tech Stack

API Key Security

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages