Repeatable agentic engineering

Flow-Next

Raise the quality bar for AI-assisted software work.

Flow-Next is the workflow layer for agentic engineering: durable specs, context-fit planning, re-anchored worker agents, adversarial review loops, docs drift prevention, and receipts for every serious handoff. When you're ready, hand the whole pipeline to a loop — pilot ticks in your session, Ralph runs overnight — with the same quality bar across any platform.

▣ Spec-driven Intent survives the chat.◎ Context-fit plan Right-sized task slices.♙ Re-anchored work Fresh context per task.✕ Adversarial gates Fix until SHIP.◉ Multi-harness One workflow everywhere.↻ Self-improving Compounds as you work.∞ Autonomous loops Drain the backlog hands-free.◫ Render lenses HTML review aids, opt-in.

Install Flow-Next → Read the Docs

$ /flow-next:plan fn-52 && /flow-next:work fn-52

specs/fn-52-quality-gates.md Ready ✓ Review gated Tasks 3 / 4 R-IDs 6 Receipts 2

Task Graph

fn-52-quality-gates ✓Spec · blessed

write-spec ✓Task

context-fit-plan ✓Task

worker-reanchor ✓Task

review-loop ●In review

make-pr-cognitive-aid …Handover · queued

Fit − 100% ＋ ⛶

Command Palette

⊙ Plan Spec... /flow-next:plan fn-52 ⌘ Work Ready Task /flow-next:work fn-52 ▣ Impl Review /flow-next:impl-review ◎ Live-app QA /flow-next:qa fn-52 ⇄ Sync to Tracker /flow-next:tracker-sync ∞ Pilot Tick /loop 10m /flow-next:pilot

Recent Runs

SHIP fn-52-quality-gates

ADVANCED fn-61 · pilot

Terminal / Receipts

$ /flow-next:plan fn-52
✓ sized 4 tasks for focused context
✓ dependencies recorded

$ /flow-next:work fn-52
> Re-anchoring spec + task
+ Running worker subagent
+ Updating flowctl state
+ Evidence recorded

$ /flow-next:impl-review fn-52
✓ verdict: SHIP · codex:gpt-5.5:high
Receipt: .flow/review-receipts/fn-52.json

$ /flow-next:qa fn-52
✓ verdict: SHIP · live-app · 6/6 R-IDs

$ /flow-next:make-pr fn-52
✓ PR opened — R-ID coverage + critical changes
$ ▌

Receipts

Receipt: .flow/review-receipts/fn-52.json
Verdict: SHIP
Backend: codex:gpt-5.5:high
Scope: introduced findings only

Open Receipt (JSON) →

Claude CodeOpenAI CodexFactory DroidGrok BuildCursorRepoPromptGitHub CopilotOpenCode

The flow-next task dependency graph, laid out by dependency depth with the critical path highlighted in amber. — the pipeline in motion intent → spec → plan → work → review → ship — read the full pipeline

Spec-driven control

Turn vague requests into durable specs that keep product intent, technical constraints, and review criteria stable.

Capture rough intent
Clarify business + technical gaps
Prevent scope drift through delivery

spec: fn-52-quality-gates
source: .flow/specs/fn-52-*.md

R1: Every task maps to acceptance criteria.
R2: Review receipts gate handoff.
R3: PR body tells reviewers where to look.

Context-fit planning

Automatically split specs into dependency-ordered task slices sized for focused agent context windows.

Expose blockers and parallelism
Keep each task bounded
Avoid one giant prompt becoming the plan

/flow-next:plan fn-52

ready:
  fn-52.1 wire config schema
  fn-52.2 add review receipt gate
blocked:
  fn-52.3 docs sync after API lands

Re-anchored workers

Run each task in fresh execution context, rereading the spec, task, git state, and relevant repo code before editing.

Fresh worker per ready slice
Subagents for focused investigation
Evidence recorded as work moves

/flow-next:work fn-52

> read .flow/specs/fn-52.md
> read ready task + git status
> inspect relevant code
> edit, test, record evidence

Adversarial quality gates

Raise the quality bar with plan review, implementation review, completion review, live-app QA, docs drift checks, and receipts.

Cross-model challenge loop
Fix until introduced issues are gone
Live-app QA — drive the app, not just the diff
Reviewer-ready proof, not promises

/flow-next:plan-review fn-52
/flow-next:impl-review fn-52
/flow-next:spec-completion-review fn-52
/flow-next:qa fn-52   # live-app

verdict: NEEDS_WORK -> fix -> review -> SHIP

Multi-harness by design

Standardize agent workflow across Claude Code, OpenAI Codex, Factory Droid, xAI Grok Build, Cursor, and review backends.

One repo-local state model
Native skills per harness
CLI only as safe plumbing

Claude Code   /flow-next:work fn-52
OpenAI Codex  /flow-next:work fn-52
Factory Droid /flow-next:work fn-52
Grok Build    /flow-next:work fn-52
Cursor        /flow-next:work fn-52

RepoPrompt reviews when available

Tracker sync (opt-in)

Project specs to Linear or GitHub Issues, reconcile body, status, and comments two-way, and review PRs as Linear Diffs. Projection, not coordination.

Spec stays the source of truth
Two-way body / status / comments
PRs render as Linear Diffs

/flow-next:tracker-sync

> push fn-52 -> WOR-17 (linked)
> reconcile body / status / comments
> make-pr links PR -> Linear Diff

Visual review aids (opt-in)

Render specs and PRs as self-contained HTML lenses — a spec visualizer with task DAG and R-ID coverage, and a read-only PR review instrument. Markdown stays the record.

Spec lens for business + plan review
Diff-derived, R-ID-verified PR lens
Annotate in the browser via optional Lavish

flowctl config set artifacts.html.enabled true

.flow/artifacts/fn-52/
  spec.html   # thesis, task DAG, R-ID matrix
  pr.html     # churn map, evidence, checklist

01 / Loops

Bless it. Loop it. Ship it.

Your judgment lives in the spec. The pipeline runs on a loop.

/loop 10m /flow-next:pilot --review=codex

tick 1  fn-61 · stage=plan
        ✓ 4 tasks, deps recorded
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=plan
tick 2  fn-61 · stage=plan-review
        ✓ verdict SHIP (round 1)
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=plan-review
tick 3  fn-61 · stage=work
        ✓ 4/4 tasks done · impl-review SHIP
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=work
tick 4  fn-61 · stage=make-pr
        ✓ draft PR #172 opened (gh-confirmed)
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=make-pr
tick 5  PILOT_VERDICT=NO_WORK spec=- stage=-
        backlog drained — bless more on the board

Claude Code /goal /goal keep running /flow-next:pilot until PILOT_VERDICT=NO_WORK Claude Code /loop /loop 10m /flow-next:pilot Codex /goal [features] goals = true · plain-text objective + verdict grammar

02 / Signal

What people say.

Shared by practitioners, not marketing.

“Flow-next is simply the best coding flow, not even close, and still a side project.”

Tiago Freitas · @tiagoefreitas

@Lat3ntG3nius

“Cross-model review is genius. Different models make different mistakes, so using them as mutual reviewers creates a safety net single-model workflows can’t match.”

@clairernovotny

“I’ve found it generating production-quality code. Far, far better than any of the other tools I’ve tried.”

@BaranGuneysel

“As a designer, flow-next finally lets me ship features with confidence. The review loop catches what I miss.”

@dailyreader

“The re-anchoring is the quiet superpower. After a long session the agent still knows exactly what it’s building.”

@mfeighery

“Ralph mode at night, PRs in the morning. Zero drama. The receipts mean I trust what landed.”

@ben

“A force multiplier. Plan once, then watch a team of scouts and reviewers do their jobs.”

☆ New to Flow-Next? Get your first spec planned, worked, reviewed, and handed off.