Flow-Next

Repeatable agentic engineering

Flow-Next

Raise the quality bar for AI-assisted software work.

Flow-Next is the workflow layer for agentic engineering: durable specs, context-fit planning, re-anchored worker agents, adversarial review loops, docs drift prevention, and receipts for every serious handoff. When you're ready, hand the whole pipeline to a loop — pilot ticks in your session, Ralph runs overnight — with the same quality bar across any platform.

Spec-driven Intent survives the chat. Context-fit plan Right-sized task slices. Re-anchored work Fresh context per task. Adversarial gates Fix until SHIP. Multi-harness One workflow everywhere. Self-improving Compounds as you work. Autonomous loops Drain the backlog hands-free. Render lenses HTML review aids, opt-in.
$ /flow-next:plan fn-52 && /flow-next:work fn-52
specs/fn-52-quality-gates.md Ready ✓ Review gated Tasks 3 / 4 R-IDs 6 Receipts 2

Task Graph

fn-52-quality-gates Spec · blessed
write-spec Task
context-fit-plan Task
worker-reanchor Task
review-loop In review
make-pr-cognitive-aid Handover · queued
Fit 100%

Command Palette

⊙ Plan Spec... /flow-next:plan fn-52 ⌘ Work Ready Task /flow-next:work fn-52 ▣ Impl Review /flow-next:impl-review ◎ Live-app QA /flow-next:qa fn-52 ⇄ Sync to Tracker /flow-next:tracker-sync ∞ Pilot Tick /loop 10m /flow-next:pilot

Recent Runs

SHIP fn-52-quality-gates

ADVANCED fn-61 · pilot

Terminal / Receipts

$ /flow-next:plan fn-52
 sized 4 tasks for focused context
 dependencies recorded

$ /flow-next:work fn-52
> Re-anchoring spec + task
+ Running worker subagent
+ Updating flowctl state
+ Evidence recorded

$ /flow-next:impl-review fn-52
 verdict: SHIP · codex:gpt-5.5:high
Receipt: .flow/review-receipts/fn-52.json

$ /flow-next:qa fn-52
 verdict: SHIP · live-app · 6/6 R-IDs

$ /flow-next:make-pr fn-52
 PR opened — R-ID coverage + critical changes
$ 

Receipts

Receipt
.flow/review-receipts/fn-52.json
Verdict
SHIP
Backend
codex:gpt-5.5:high
Scope
introduced findings only
Open Receipt (JSON) →
Claude CodeOpenAI CodexFactory DroidGrok BuildCursorRepoPromptGitHub CopilotOpenCode
The flow-next task dependency graph, laid out by dependency depth with the critical path highlighted in amber.
the pipeline in motion intent → spec → plan → work → review → ship — read the full pipeline

Spec-driven control

Turn vague requests into durable specs that keep product intent, technical constraints, and review criteria stable.

  • Capture rough intent
  • Clarify business + technical gaps
  • Prevent scope drift through delivery
spec: fn-52-quality-gates
source: .flow/specs/fn-52-*.md

R1: Every task maps to acceptance criteria.
R2: Review receipts gate handoff.
R3: PR body tells reviewers where to look.

Context-fit planning

Automatically split specs into dependency-ordered task slices sized for focused agent context windows.

  • Expose blockers and parallelism
  • Keep each task bounded
  • Avoid one giant prompt becoming the plan
/flow-next:plan fn-52

ready:
  fn-52.1 wire config schema
  fn-52.2 add review receipt gate
blocked:
  fn-52.3 docs sync after API lands

Re-anchored workers

Run each task in fresh execution context, rereading the spec, task, git state, and relevant repo code before editing.

  • Fresh worker per ready slice
  • Subagents for focused investigation
  • Evidence recorded as work moves
/flow-next:work fn-52

> read .flow/specs/fn-52.md
> read ready task + git status
> inspect relevant code
> edit, test, record evidence

Adversarial quality gates

Raise the quality bar with plan review, implementation review, completion review, live-app QA, docs drift checks, and receipts.

  • Cross-model challenge loop
  • Fix until introduced issues are gone
  • Live-app QA — drive the app, not just the diff
  • Reviewer-ready proof, not promises
/flow-next:plan-review fn-52
/flow-next:impl-review fn-52
/flow-next:spec-completion-review fn-52
/flow-next:qa fn-52   # live-app

verdict: NEEDS_WORK -> fix -> review -> SHIP

Multi-harness by design

Standardize agent workflow across Claude Code, OpenAI Codex, Factory Droid, xAI Grok Build, Cursor, and review backends.

  • One repo-local state model
  • Native skills per harness
  • CLI only as safe plumbing
Claude Code   /flow-next:work fn-52
OpenAI Codex  /flow-next:work fn-52
Factory Droid /flow-next:work fn-52
Grok Build    /flow-next:work fn-52
Cursor        /flow-next:work fn-52

RepoPrompt reviews when available

Tracker sync (opt-in)

Project specs to Linear or GitHub Issues, reconcile body, status, and comments two-way, and review PRs as Linear Diffs. Projection, not coordination.

  • Spec stays the source of truth
  • Two-way body / status / comments
  • PRs render as Linear Diffs
/flow-next:tracker-sync

> push fn-52 -> WOR-17 (linked)
> reconcile body / status / comments
> make-pr links PR -> Linear Diff

Visual review aids (opt-in)

Render specs and PRs as self-contained HTML lenses — a spec visualizer with task DAG and R-ID coverage, and a read-only PR review instrument. Markdown stays the record.

  • Spec lens for business + plan review
  • Diff-derived, R-ID-verified PR lens
  • Annotate in the browser via optional Lavish
flowctl config set artifacts.html.enabled true

.flow/artifacts/fn-52/
  spec.html   # thesis, task DAG, R-ID matrix
  pr.html     # churn map, evidence, checklist

01 / Loops

Bless it. Loop it. Ship it.

Your judgment lives in the spec. The pipeline runs on a loop.
/loop 10m /flow-next:pilot --review=codex
tick 1  fn-61 · stage=plan
        ✓ 4 tasks, deps recorded
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=plan
tick 2  fn-61 · stage=plan-review
        ✓ verdict SHIP (round 1)
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=plan-review
tick 3  fn-61 · stage=work
        ✓ 4/4 tasks done · impl-review SHIP
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=work
tick 4  fn-61 · stage=make-pr
        ✓ draft PR #172 opened (gh-confirmed)
        PILOT_VERDICT=ADVANCED spec=fn-61 stage=make-pr
tick 5  PILOT_VERDICT=NO_WORK spec=- stage=-
        backlog drained — bless more on the board

02 / Signal

What people say.

Shared by practitioners, not marketing.
“Flow-next is simply the best coding flow, not even close, and still a side project.”
Tiago Freitas · @tiagoefreitas
@Lat3ntG3nius
“Cross-model review is genius. Different models make different mistakes, so using them as mutual reviewers creates a safety net single-model workflows can’t match.”
@clairernovotny
“I’ve found it generating production-quality code. Far, far better than any of the other tools I’ve tried.”
@BaranGuneysel
“As a designer, flow-next finally lets me ship features with confidence. The review loop catches what I miss.”
@dailyreader
“The re-anchoring is the quiet superpower. After a long session the agent still knows exactly what it’s building.”
@mfeighery
“Ralph mode at night, PRs in the morning. Zero drama. The receipts mean I trust what landed.”
@ben
“A force multiplier. Plan once, then watch a team of scouts and reviewers do their jobs.”
☆ New to Flow-Next? Get your first spec planned, worked, reviewed, and handed off.