Cleanroom Testing Framework

A high-performance hermetic integration testing framework that executes tests in isolated Docker containers with OpenTelemetry validation. Version 1.4.1 introduces container pooling for 80% faster test startup and 10x throughput improvements. Define tests declaratively using TOML configuration files and validate runtime behavior with Weaver schema validation.

What's New in v1.4.1

Performance Revolution: Container Pooling

80% faster test startup: 2-5s → 0.1-0.5s through container pre-warming
10x higher throughput: 500-1000 concurrent tests (vs 50-100 in v1.3.0)
Configurable pooling: Tune pool size, idle timeout, and health checks
Seamless integration: Enable with CLNRM_ENABLE_POOLING=1 environment variable
Production-grade: Lock-free hot paths, background health checks, comprehensive metrics

See Migration Guide for upgrade instructions from v1.4.0, or v1.3 to v1.4 Migration for older versions.

Installation

Homebrew

brew tap seanchatmangpt/clnrm
brew install clnrm

Cargo

cargo install clnrm

Requirements

Rust 1.70 or later (for building from source)
Docker or Podman (for container execution)
4GB+ RAM recommended (8GB+ for container pooling)

Quick Start

Basic Usage

# Run tests (auto-discovers all *.clnrm.toml files)
clnrm run

# Run specific test with container pooling (80% faster)
CLNRM_ENABLE_POOLING=1 clnrm run tests/api_with_database.clnrm.toml

# Run with maximum concurrency
CLNRM_ENABLE_POOLING=1 clnrm run --parallel --jobs 16

# Run with Weaver live-check validation
clnrm run --live-check --registry registry/

Performance Example

Test an API service with database integration. With container pooling enabled, startup time drops from 2-5s to 0.1-0.5ms per test.

[meta]
name = "api_with_database"
version = "1.0.0"
description = "API service with database - Weaver validates telemetry structure"

# Enable Weaver schema validation
[weaver]
enabled = true
registry_path = "registry"
otlp_port = 0        # Auto-discover available port
admin_port = 0       # Auto-discover available port

# Configure OpenTelemetry export to Weaver
[otel]
exporter = "otlp-http"
resources = {
  "service.name" = "api_service",
  "deployment.environment" = "test"
}

# Multiple services working together
[service.api]
plugin = "generic_container"
image = "my-api:latest"

[service.database]
plugin = "generic_container"
image = "postgres:15-alpine"

# Scenario that emits rich telemetry
[[scenario]]
name = "api_handles_user_request"
service = "api"
run = "my-api --endpoint /api/v1/users"
artifacts.collect = ["spans:default"]

# Validate HTTP server span
[[expect.span]]
name = "http.server.request"
kind = "server"
attrs.all = {
  "http.method" = "GET",
  "http.route" = "/api/v1/users"
}

# Validate database query span (must be child of HTTP span)
[[expect.span]]
name = "db.query"
kind = "client"
parent = "http.server.request"
attrs.all = {
  "db.system" = "postgresql",
  "db.operation" = "SELECT"
}

# Validate trace graph structure - proves services actually communicated
[expect.graph]
must_include = [
  ["http.server.request", "db.query"]  # HTTP span must have DB child
]
acyclic = true  # No cycles allowed (proves correct trace structure)

# Validate temporal ordering - proves operations happened in correct sequence
[expect.order]
must_precede = [
  ["http.server.request", "db.query"],        # Request must come before query
  ["db.query", "http.server.response"]       # Query must come before response
]

# Ensure no external service leaks - catch accidental production calls
[expect.hermeticity]
no_external_services = true
span_attrs.forbid_keys = ["net.peer.name"]  # Forbid external hostnames

Why This Matters

This test validates behavior, not just exit codes. Consider what traditional testing misses:

Traditional Testing Problem:

#!/bin/bash
# Fake-green test - passes but does nothing
echo "✅ Test passed"
exit 0
# ❌ Database never queried
# ❌ API never handled request  
# ❌ Services never interacted
# ✅ Traditional testing: PASS (exit code 0)

OTEL-First Validation Solution: Your test fails if:

❌ No HTTP server span exists → API never actually ran (just returned exit code 0)
❌ No database query span → Database was never accessed (test is fake-green)
❌ Graph structure wrong → Services didn't actually communicate (no parent-child edge)
❌ Temporal ordering violated → Operations happened in wrong sequence (bug in execution)
❌ External service calls detected → Test leaked to production (hermeticity violation)
❌ Semantic conventions violated → Instrumentation incorrect (Weaver catches this)

OTEL-first validation requires proof of execution through telemetry:

✅ Graph structure proves service interaction happened (HTTP → DB edge exists)
✅ Temporal ordering proves operations occurred in correct sequence (request before query)
✅ Hermeticity catches accidental external service calls (forbidden attributes)
✅ Semantic conventions validated automatically by Weaver (correct attribute names)

Weaver automatically validates all of this against OpenTelemetry schemas—no manual trace inspection needed. The test fails if your code doesn't actually execute correctly, even if it returns exit code 0.

Features

Core Testing

TOML-based test definitions
Docker container isolation per test step
Automatic test discovery
Template variable support with Tera

OpenTelemetry Integration

Weaver live-checking - Automatic schema validation during test execution
OTLP export for telemetry collection (HTTP/gRPC)
Resource attribute configuration
Custom headers and propagators (tracecontext, baggage)
Sample ratio control

Behavior Validation (Not Just Exit Codes) Unlike traditional testing that only checks return codes, clnrm validates actual execution through telemetry:

Span expectations - Validate name, kind, attributes, events, duration
Graph structure - Ensure correct parent-child relationships and acyclic traces
Temporal ordering - Prove operations occur in the correct sequence
Count/cardinality - Validate span, event, and error counts match expectations
Temporal windows - Ensure spans occur within expected time boundaries
Status codes - Validate span status (OK, ERROR, UNSET) across the trace
Hermeticity - Catch accidental external service calls or forbidden attributes

CLI Commands

clnrm init - Initialize new test project
clnrm run - Execute test files with optional pooling and Weaver validation
- --parallel - Enable parallel test execution
- --jobs N - Set concurrency limit (default: 4)
- CLNRM_ENABLE_POOLING=1 - Enable container pooling (80% faster, env var)
- --live-check - Enable Weaver live-check validation
clnrm validate - Validate TOML configuration
clnrm plugins - List available service plugins
clnrm self-test - Run framework self-validation
clnrm health - Check system health (Docker, resources)

See CLI Guide for complete reference.

OpenTelemetry TOML Configuration

Cleanroom supports comprehensive OpenTelemetry configuration directly in TOML test files:

Weaver Live-Checking

Enable automatic schema validation:

[weaver]
enabled = true                    # Enable Weaver validation
registry_path = "registry"        # Path to schema registry
otlp_port = 0                     # Auto-discover (0) or fixed port
admin_port = 0                    # Auto-discover (0) or fixed port
output_dir = "./validation_output" # Validation report directory
stream = false                    # Streaming output (real-time)
fail_fast = false                 # Stop on first violation

OTEL Export Configuration

[otel]
exporter = "otlp-http"            # Export format: stdout, otlp-http, otlp-grpc
endpoint = "http://localhost:4318" # OTLP endpoint URL
protocol = "http/protobuf"        # Protocol: http/protobuf, grpc, http/json
sample_ratio = 1.0               # Sampling rate (0.0-1.0)

# Resource attributes
resources = {
  "service.name" = "my_service",
  "service.version" = "1.0.0",
  "deployment.environment" = "test"
}

# Custom headers
headers = {
  "Authorization" = "Bearer token"
}

# Context propagators
propagators.use = ["tracecontext", "baggage"]

Span Expectations

Validate span structure and attributes:

[[expect.span]]
name = "http.request"              # Span name (supports globs)
kind = "server"                   # Span kind: internal, client, server, producer, consumer
parent = "http.server.request"    # Parent span name

# Attribute validation
attrs.all = {                     # All attributes must match
  "http.method" = "GET",
  "http.route" = "/api/users"
}
attrs.any = {                      # Any attribute must match
  "http.status_code" = "200"
}

# Event validation
events.all = ["http.request.received", "http.response.sent"]
events.any = ["exception"]

# Duration bounds
duration_ms = { min = 10.0, max = 1000.0 }

Graph Structure Validation

Validate trace topology:

[expect.graph]
# Required edges
must_include = [
  ["http.server.request", "db.query"],
  ["db.query", "cache.get"]
]

# Forbidden edges
must_not_cross = [
  ["external.service", "internal.service"]
]

acyclic = true                    # Ensure no cycles

Count/Cardinality Validation

[expect.counts]
spans_total = { gte = 1, lte = 100 }    # Total span count bounds
events_total = { gte = 5 }             # Total event count
errors_total = { eq = 0 }              # Must have zero errors

# Per-span-name counts
by_name = {
  "http.request" = { eq = 10 },        # Exactly 10 http.request spans
  "db.query" = { gte = 1 }              # At least 1 db.query span
}

Temporal Ordering Validation

[expect.order]
# First must precede second
must_precede = [
  ["auth.check", "db.query"],
  ["db.query", "cache.set"]
]

# First must follow second
must_follow = [
  ["response.sent", "request.received"]
]

Temporal Window Validation

[[expect.window]]
outer = "http.server.request"     # Outer span defining time window
contains = [                       # Spans that must be within window
  "db.query",
  "cache.get",
  "auth.check"
]

Status Code Validation

[expect.status]
all = "OK"                        # All spans must have OK status

# Or per-span-name
by_name = {
  "http.request" = "OK",
  "error.*" = "ERROR"             # Supports glob patterns
}

Hermeticity Validation

Ensure tests don't leak to external services:

[expect.hermeticity]
no_external_services = true      # Forbid external service calls

# Resource attributes must match exactly
resource_attrs.must_match = {
  "service.name" = "my_service",
  "deployment.environment" = "test"
}

# Forbid certain span attributes (e.g., external network calls)
span_attrs.forbid_keys = [
  "net.peer.name",                # No external hosts
  "http.url"                       # No external URLs
]

Security

⚠️ Security Advisory: clnrm v1.4.1 depends on tokio-tar which has RUSTSEC-2025-0111, a file smuggling vulnerability.

Risk Assessment: LOW for normal clnrm usage because:

Container images from trusted registries only (Docker Hub official images)
Extraction happens in isolated testcontainer environments
Filesystems are ephemeral (destroyed after tests)
No user-provided tar archives are processed

User Guidance: See SECURITY.md for complete details, mitigation strategies, and best practices.

Resolution Plan:

v1.4.1: Risk documented, monitoring for upstream fix
v1.4.2: Upgrade when tokio-tar releases security patch
v1.5.0: Consider migration to alternative tar implementation

For security concerns, see our Security Policy.

Documentation

Getting Started

Quick Start Guide - Get started in 5 minutes
CLI Guide - Complete command-line reference
Migrating to v1.4.1 - Upgrade guide from v1.3.0

Performance & Optimization

Container Pooling - Enable 80% faster test startup
Performance Tuning - Optimize for your workload
Concurrency Architecture - Technical deep-dive

Configuration & Validation

TOML Reference - Configuration format
Weaver TOML Configuration - Weaver live-checking setup
Advanced Users Guide - Comprehensive documentation (mdbook)

Reference

Documentation Index - Complete navigation hub

Contributing

Contributions are welcome. See CONTRIBUTING.md for guidelines.

License

Licensed under the MIT License. See LICENSE for details.

Repository: github.com/seanchatmangpt/clnrm

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
.claude-flow		.claude-flow
.clnrm		.clnrm
.cursor		.cursor
.github/workflows		.github/workflows
.hive-mind		.hive-mind
.swarm		.swarm
benches		benches
book		book
config		config
crates		crates
docs		docs
examples		examples
homebrew		homebrew
memory		memory
registry		registry
release/v1.4.1		release/v1.4.1
scripts		scripts
src		src
templates		templates
tests		tests
tools/schema-validator		tools/schema-validator
validation_tests		validation_tests
.cursorrules		.cursorrules
.gitignore		.gitignore
AGENT5_SUMMARY.md		AGENT5_SUMMARY.md
AGENT_14_MIGRATION_GUIDE_DELIVERABLE.md		AGENT_14_MIGRATION_GUIDE_DELIVERABLE.md
AGENT_4_STATUS.txt		AGENT_4_STATUS.txt
AGENT_6_INTEGRATION_REPORT.md		AGENT_6_INTEGRATION_REPORT.md
BUILD_SUCCESS.txt		BUILD_SUCCESS.txt
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CLIPPY_FIXES_CHECKLIST.md		CLIPPY_FIXES_CHECKLIST.md
COMPILATION_STATUS_REPORT.md		COMPILATION_STATUS_REPORT.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.clnrm-self-test		Dockerfile.clnrm-self-test
E2E_TEST_DELIVERABLE.md		E2E_TEST_DELIVERABLE.md
FINAL_TEST_REPORT.md		FINAL_TEST_REPORT.md
HOMEBREW_FORMULA_DELIVERABLE.md		HOMEBREW_FORMULA_DELIVERABLE.md
LICENSE		LICENSE
LOOP_NEAR_CLOSURE.txt		LOOP_NEAR_CLOSURE.txt
Makefile.toml		Makefile.toml
Makefile.toml.full		Makefile.toml.full
Makefile.weaver		Makefile.weaver
PRODUCTION_READINESS_CERT_V1_4_0.md		PRODUCTION_READINESS_CERT_V1_4_0.md
PRODUCTION_VALIDATION_EXECUTIVE_SUMMARY.md		PRODUCTION_VALIDATION_EXECUTIVE_SUMMARY.md
PRODUCTION_VALIDATION_SUMMARY.md		PRODUCTION_VALIDATION_SUMMARY.md
QUICK_VALIDATION.md		QUICK_VALIDATION.md
README.md		README.md
RELEASE_NOTES_v1.2.1.md		RELEASE_NOTES_v1.2.1.md
SECURITY.md		SECURITY.md
SECURITY_ADVISORY_RESOLUTION_SUMMARY.md		SECURITY_ADVISORY_RESOLUTION_SUMMARY.md
TEST_EXECUTION_REPORT.md		TEST_EXECUTION_REPORT.md
TEST_SUITE_DELIVERY.md		TEST_SUITE_DELIVERY.md
V1_2_1_INTEGRATION_COMPLETE.md		V1_2_1_INTEGRATION_COMPLETE.md
V1_3_0_DEPLOYMENT_SUMMARY.md		V1_3_0_DEPLOYMENT_SUMMARY.md
V1_4_1_CERTIFICATION_EXECUTIVE_SUMMARY.md		V1_4_1_CERTIFICATION_EXECUTIVE_SUMMARY.md
V1_4_1_LOCK_FREE_QUEUE_IMPLEMENTATION.md		V1_4_1_LOCK_FREE_QUEUE_IMPLEMENTATION.md
V1_4_1_LOCK_FREE_VALIDATION_REPORT.md		V1_4_1_LOCK_FREE_VALIDATION_REPORT.md
V1_4_1_RELEASE_SUMMARY.md		V1_4_1_RELEASE_SUMMARY.md
V1_4_1_ROADMAP.md		V1_4_1_ROADMAP.md
VALIDATION_SUCCESS_REPORT.md		VALIDATION_SUCCESS_REPORT.md
build.rs		build.rs
cleanroom.toml		cleanroom.toml
debug_config.rs		debug_config.rs
debug_toml		debug_toml
deny.toml		deny.toml
digest.txt		digest.txt
docker-compose.weaver.yml		docker-compose.weaver.yml
framework-self-testing-innovations.toml		framework-self-testing-innovations.toml
framework-test-report.json		framework-test-report.json
init_analysis.puml		init_analysis.puml
junit.xml		junit.xml
libtest_lint.rlib		libtest_lint.rlib
remove_test_modules.py		remove_test_modules.py
report.html		report.html
test-autonomic-ai.clnrm.toml		test-autonomic-ai.clnrm.toml
test-results.json		test-results.json
test_config.rs		test_config.rs
test_dyn_compatibility		test_dyn_compatibility
toml-self-validation-innovation.toml		toml-self-validation-innovation.toml
validate-dogfood-innovations.sh		validate-dogfood-innovations.sh
validate-homebrew-setup.sh		validate-homebrew-setup.sh
validation_system_architecture.puml		validation_system_architecture.puml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cleanroom Testing Framework

What's New in v1.4.1

Installation

Homebrew

Cargo

Requirements

Quick Start

Basic Usage

Performance Example

Why This Matters

Features

OpenTelemetry TOML Configuration

Weaver Live-Checking

OTEL Export Configuration

Span Expectations

Graph Structure Validation

Count/Cardinality Validation

Temporal Ordering Validation

Temporal Window Validation

Status Code Validation

Hermeticity Validation

Security

Documentation

Contributing

License

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

seanchatmangpt/clnrm

Folders and files

Latest commit

History

Repository files navigation

Cleanroom Testing Framework

What's New in v1.4.1

Installation

Homebrew

Cargo

Requirements

Quick Start

Basic Usage

Performance Example

Why This Matters

Features

OpenTelemetry TOML Configuration

Weaver Live-Checking

OTEL Export Configuration

Span Expectations

Graph Structure Validation

Count/Cardinality Validation

Temporal Ordering Validation

Temporal Window Validation

Status Code Validation

Hermeticity Validation

Security

Documentation

Contributing

License

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages