NoPilot

An AI Native personal development workflow framework for Greenfield projects.

What It Is

NoPilot is a three-stage workflow that takes you from requirement exploration to shipping code with minimal human involvement downstream. Each stage builds on the previous, with AI handling possibility generation and execution while humans make decisions.

Stages:

/discover — Progressive idea collection → direction selection → MVP definition → design philosophy → requirement lock
/spec — Expand locked requirements into module-level design specifications
/build — Autonomous TDD implementation with tracer bullet validation and independent acceptance review
/visualize — Generate interactive HTML dashboards from JSON artifacts

What you get: Structured JSON artifacts at each stage that serve as machine-readable contracts, with HTML visualization for human review. Perfect traceability from requirements through to delivered code.

Long-term Vision

NoPilot's North Star is an enterprise-grade AI delivery governance platform — a control layer that lets organizations adopt AI coding agents (Claude Code, Codex, Cursor, Copilot, OpenCode, …) without losing auditability, contract integrity, or release safety.

We get there in three stages, each building on the previous:

Personal layer (current — V1.x): Single-developer Greenfield workflow framework. Spec-as-contract, generation-review separation, multi-agent orchestration via Lash.
Team layer (V2 → V3): Shared specs and decision ledger, PR / issue / CI-failure entry points, web dashboard for traceability, multi-model verification, MCP enforcement.
Enterprise layer (V4 → North Star): SSO/RBAC, audit logs, private deployment, compliance reports, AI Delivery Governance — risk-tiered release contracts that integrate with (never replace) the team's existing CI/CD, feature flags, APM, and incident management tools.

The framing is consistent across all three stages: NoPilot governs the AI-generated change; existing infrastructure executes it.

Installation

Prerequisites

Claude Code installed and configured
Codex CLI and OpenCode CLI are also supported for shared skill installation and Lash workers
Node.js >= 20.0.0

Install

npm install -g nopilot

This installs two CLIs:

nopilot — Framework tools (project initialization)
lash — Build runtime (multi-agent orchestration)

Initialize a project

cd your-project
nopilot init

This renders the package skills from commands/ into ~/.claude/skills/ for Claude Code and ~/.agents/skills/ for Codex/OpenCode (shared), creates the specs/ directory, and appends Lash auto-trigger context to any existing CLAUDE.md, AGENTS.md, or opencode.md. Schemas and workflow.json stay in the npm package — run nopilot paths to locate them.

Start using

Open your AI coding tool and start from the installed discover skill.

claude   # Claude Code: then run /discover

Codex and OpenCode share the installed skills under ~/.agents/skills/.

Why This Approach

Humans are decision-makers, not executors. You define intent and choose from possibilities. AI generates options and runs them. You never say "how" — only "which one."
Less human involvement downstream. Deep participation in /discover (where direction is uncertain) means you can go AFK during /build (where direction is locked).
All dimensions appear simultaneously. Requirements, feasibility, competitive risks, and effort emerge together so you decide with full context.
Spec is contract, not document. Every output is structured JSON consumed by downstream stages.
AI autonomy with full audit trail. Low-risk technical details are decided by AI without interrupting flow. Every decision gets recorded.
Failures route to decisions, not execution. When something breaks, it means an upstream decision needs revision, not that code needs debugging.

Non-Goals

To keep NoPilot focused, we explicitly do not plan to:

Replace CI/CD pipelines. Jenkins, GitHub Actions, ArgoCD, Spinnaker remain the executors. NoPilot produces release contracts they consume.
Replace feature flag platforms. LaunchDarkly, Unleash, Statsig keep their role. NoPilot recommends flag configuration based on spec risk tier.
Replace APM / observability tools. Sentry, Datadog, New Relic continue to monitor production. NoPilot consumes their signals as post-deploy verification evidence.
Replace incident management. PagerDuty, Opsgenie keep handling on-call. NoPilot supplies change provenance and rollback recommendations when an incident is linked back.
Write to production databases or invoke paid third-party APIs without human approval.
Promise L5 fully-autonomous delivery. Risk-tier L3 (security, payments, permissions, data migration, compliance) always requires human approval, by design.

Risk-tier Autonomy (at a glance)

Tier	Scope	Autonomy	Approval
L1	Docs, configs, internal tools, low-risk UI tweaks	Auto-PR + auto-merge to non-prod	None required (audited)
L2	Standard business features	Auto-PR + auto-verification	Human approval before prod
L3	Security, auth/permissions, payments, data migration, compliance	NoPilot only produces evidence + rollback plan	Human approval always required

Orthogonal to the L0–L3 exception tiers. Exception tier describes "how a failure is routed", risk tier describes "how autonomously a task may execute". Both coexist.

Workflow

/discover    # Collect idea → explore directions → lock requirements
→ /spec      # Design to module level
→ /build     # Implement with TDD
→ /visualize # Generate HTML dashboards for human review

Each command reads upstream artifacts from specs/ and writes its own. All artifacts are JSON contracts consumed by downstream stages.

Architecture

Supervisor + Critic Agents

Two independent agents provide cross-cutting quality assurance. Both are core guardrails (cannot be disabled):

Supervisor — Intent Guardian (telescope)

Monitors whether the overall output still matches your original intent and constraints
Systematic drift detection: scope creep, gold plating, tech-driven drift, requirement dilution, constraint erosion
Quantitative drift scoring (0-100) with recommended actions, not just binary pass/fail
Checks design philosophy compliance and decision chain analysis

Critic — Independent Challenger (magnifying glass)

The sole quality evaluator — generating agents must never self-approve their own output
AI bias detection catalog: over-engineering, optimistic assessment, missing negative paths, concept conflation, self-approval bias, anchoring, symmetric completion
Floating iteration caps (3/5/7-10 by complexity) with trend evaluation at limits
Activated at checkpoints: requirement lock, spec backward verification, build test review, build acceptance review

Framework Principles (V4.0)

Generation-review separation: Generating agents must never evaluate their own output. All review is performed by independent Critic instances in isolated sessions.
Iterative verification: Review cycles use fresh agent instances each round to avoid anchoring bias. Iteration limits float by complexity with trend evaluation (converging → extend, diverging → escalate model, oscillating → escalate human).
Agent consensus: Before escalating to the human, the executing agent spawns a consulting agent anchored on design philosophy and first principles to attempt resolution. (declared, incrementally adopting)

Key Concepts

Artifacts (generated at runtime in specs/):

discover.json or discover/index.json — Locked requirements with acceptance criteria and invariants
discover_history.json or discover/history.json — Exploration log of directions considered and decisions made
spec.json or spec/index.json — Module decomposition, interfaces, data models, dependency graph
spec_review.json — Backward verification and global coherence check results
tests.json or tests/index.json — Test cases derived from requirements and invariants
tests_review.json — Independent review of generated tests before implementation
build_report.json or build/index.json — Execution plan, TDD results, auto-acceptance verification
build_review.json — Independent acceptance review of the implemented product

For larger projects, NoPilot can split artifacts into index.json + child files so downstream agents load only the sections they need.

Exception Handling (Tiered):

L0/L1: Environmental or low-impact → AI self-fixes
L2: Contract-impacting → Pause for product decision (accept degradation, cut feature, modify spec, retry, backtrack)
L3: Fundamental issue → Diagnostic report + choice to backtrack

Backtrack Safety:

Max 3 backtracks total across all stages
Cycle detection: if A→B→A→B repeats, terminate and report
Cost awareness: users informed of re-run time before confirming backtrack

File Structure

After nopilot init, your project gets:

your-project/
├── specs/                   # Runtime artifacts (generated by commands)
│   ├── discover.json        # or discover/index.json + child files
│   ├── spec.json            # or spec/index.json + child files
│   ├── build_report.json    # or build/index.json + child files
│   └── ...
└── CLAUDE.md               # Project context with Lash auto-trigger

Global files installed by nopilot init:

~/.claude/skills/            # Claude Code skills (global, shared across projects)
├── discover/
├── spec/
├── build/
├── visualize/
├── supervisor/
├── critic/
├── lash-tracer/
├── lash-verify/
├── lash-build/
└── ...

~/.agents/skills/            # Shared by Codex and OpenCode
├── discover/
├── spec/
├── build/
├── visualize/
├── supervisor/
├── critic/
├── lash-tracer/
├── lash-verify/
├── lash-build/
└── ...

Package source skills live under commands/ and are rendered into platform-specific skill directories by nopilot init.

Schemas (25 JSON Schema files) and workflow.json stay in the npm package. Run nopilot paths to locate them.

Current Scope (V1.2, Schema 4.0)

Included: Three-stage workflow with unified skill distribution for Claude Code, Codex, and OpenCode, Greenfield projects, pure prompt engineering, full core guardrails (Supervisor with drift detection, Critic with AI bias catalog), generation-review separation, progressive idea collection, design philosophy extraction, completeness tracking, domain model and NFR outputs, artifact visualization, directory-split support for large projects, integrated Lash multi-agent build engine (TypeScript), dual CLI (nopilot + lash), npm distribution.

Not included: Brownfield/incremental iteration, agent consensus (declared, not yet wired), iOS remote agent, multi-model routing.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
.github/workflows		.github/workflows
benchmark		benchmark
commands		commands
docs		docs
schemas		schemas
specs		specs
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
CLAUDE.dev.md		CLAUDE.dev.md
CLAUDE.md		CLAUDE.md
README.md		README.md
README_AGENT.md		README_AGENT.md
ROADMAP.md		ROADMAP.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
workflow.json		workflow.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NoPilot

What It Is

Long-term Vision

Installation

Prerequisites

Install

Initialize a project

Start using

Why This Approach

Non-Goals

Risk-tier Autonomy (at a glance)

Workflow

Architecture

Supervisor + Critic Agents

Framework Principles (V4.0)

Key Concepts

File Structure

Current Scope (V1.2, Schema 4.0)

License

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NoPilot

What It Is

Long-term Vision

Installation

Prerequisites

Install

Initialize a project

Start using

Why This Approach

Non-Goals

Risk-tier Autonomy (at a glance)

Workflow

Architecture

Supervisor + Critic Agents

Framework Principles (V4.0)

Key Concepts

File Structure

Current Scope (V1.2, Schema 4.0)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages