Agentic Proof Engineering

A methodology for formal verification using AI agents.

Workflow

flowchart TD
    A[Gap Discovery<br>Research via search or prior knowledge<br>Human or AI] --> B{Existing work?}

    B -->|YES| C{Comprehensive?}
    B -->|NO| D[Identification of components]

    C -->|YES| E[Discard]
    C -->|NO| F[Define boundaries]

    F --> D
    D --> G[Atomization of components]

    G --> H{Existing conventions?}
    H -->|YES| I[Adopt conventions]
    H -->|NO| J[Generate conventions]

    I --> K[Identify proof libraries]
    J --> K

    K --> L[Landmark Planning]

    L --> M[Constrained Prompt<br>Sequential<br>No escape hatches<br>No time pressure<br>No simplifications]

    M --> N{Workflow?}

    N -->|Historical| O[Browser Paste<br>Forced review]
    N -->|Modern| P[Agentic<br>High velocity]

    O --> Q[Paste output to proof assistant]
    Q --> R{Compiles?}
    R -->|NO| S[Paste error back to AI]
    S --> Q
    R -->|YES| T[Cross-reference highest-tier AI<br>GPT-5 / Opus / Gemini]

    P --> U[Agent runs autonomously]
    U --> V{Deviation?}
    V -->|YES| W[Intervene]
    W --> X{Complete?}
    V -->|NO| X
    X -->|NO| U
    X -->|YES| T

    T --> Y{Issues found?}
    Y -->|NO| Z[Section Done]
    Y -->|YES| AA[Present to origin agent for review]

    AA --> AB{Resolution}
    AB -->|Critique accepted| AC[Fix & re-loop]
    AB -->|Defense holds| Z

    AC --> U

    Z --> ZA[Self-Assessment]
    ZA --> AD{More sections?}
    AD -->|YES| M
    AD -->|NO| AE[Done]

Example: khalil-verified

Repository: github.com/CharlesCNorton/khalil-verified

Formalizing 8th century Arabic prosody by Khalil ibn Ahmad al-Farahidi.

Workflow Traversal

Gap Discovery (Research via search or prior knowledge — Human or AI):

Researched domain: 8th century Arabic prosody
Target: Khalil ibn Ahmad al-Farahidi's aruz system (c. 760 CE)
The original quantitative meter system from which Persian, Turkish, Kurdish, and Urdu prosody descend

Existing work? → YES (searched)

Comprehensive? → NO

No proof assistant formalization of Arabic prosody exists in Coq, Agda, Lean, or Isabelle.

Define boundaries:

Khalil's original Arabic system
The 15 canonical meters (16 with later addition)
The tent-pole terminology (watad, sabab, fāṣila)
The five circles organization
Excludes: Persian/Turkish adaptations, Arabic NLP, text processing

Identification of components:

Syllable weight classification
Building blocks (sabab, watad)
The eight tafāʿīl (feet)
The sixteen buḥūr (meters)
The five dawāʾir (circles)
Variation rules (zihāf, ʿilla)
Scansion

Atomization of components:

Layer 1: Syllable weight (Short: CV; Long: CVV or CVC)
Layer 2: Sabab khafīf, sabab thaqīl, watad majmūʿ, watad mafrūq
Layer 3: Eight tafāʿīl with weight patterns
Layer 4: Sixteen buḥūr in five circles
Layer 5: Zihāf and ʿilla modification rules
Layer 6: Scansion (deferred)

Existing conventions? → NO

No prosody formalization conventions in proof assistants.

Generate conventions:

Inductive types for weight, foot, meter, circle
Pattern as list weight
Decidable equality for pattern matching
Finite enumeration for closed sets

Identify proof libraries:

Stdlib.Lists.List
Stdlib.Logic.FinFun
Stdlib.Structures.Equalities

Landmark Planning:

Section 1: Foundations
Section 2: Building Blocks
Section 3: The Eight Feet
Section 4: The Sixteen Meters
Section 5: The Five Circles
Section 6: Variation Rules
Section 7: Scansion

Constrained Prompt (Sequential, No escape hatches, No time pressure, No simplifications):

Constraints:

One section at a time
Single file monolith
Clean compile before advancing
Witness, example, and counterexample for each proof
No admits, no axioms

Counterexample standard: A counterexample must test a failure mode or edge case that could plausibly arise from a bug in the definition. Trivial negations (e.g., X ≠ Y where X and Y are obviously different) do not qualify.

Workflow mode: Modern (Agentic, High velocity)

Progress

Section	Status
1. Foundations	Complete
2. Building Blocks	Complete
3. The Eight Feet	Complete
4. The Sixteen Meters	Complete
5. The Five Circles	Complete
6. Variation Rules	Complete
7. Scansion	Complete

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Proof Engineering

Workflow

Example: khalil-verified

Workflow Traversal

Progress

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Agentic Proof Engineering

Workflow

Example: khalil-verified

Workflow Traversal

Progress

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages