Skip to content

maweiruc/proofcheck-stat-paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

proofcheck-stat-paper

A toolkit for systematically checking the proofs in statistics / ML-theory paper appendices using coding agents (Claude Code, Codex, etc.).

Authors: Wei Ma

Directory

proofcheck-stat-paper/
  README.md       ← you are here
  guideline.md    ← principles, templates, severity system
  workflow.md     ← Pass 0-5 execution methodology
  bootstrap.md    ← prompt to auto-generate CHECK_PLAN + EXECUTION_ORDER
  scripts/        ← automation (index, crossref, scaffold)
  templates/      ← PROGRESS.md and other reusable templates
  tutorial/       ← step-by-step walkthrough (CBARA paper)
  papers/         ← put papers to check here
    your-paper/   ← one folder per paper
      audit/      ← created during check execution

How to Use

Everything happens inside this directory. Open Claude Code here.

Check a new paper

  1. Create a folder: papers/your-paper-name/
  2. Put the paper's LaTeX source into it: papers/your-paper-name/paper.tex
  3. Tell Claude:
Check the appendix proofs of the paper in papers/your-paper-name/paper.tex.
First read guideline.md and workflow.md to understand the methodology,
then read the paper and run bootstrap.md to generate a check plan.

Claude will:

  • Generate CHECK_PLAN.md and EXECUTION_ORDER.md inside papers/your-paper-name/
  • Create an audit/ directory there
  • Execute the phases (indexing → foundations → lemmas → theorems → final report)

All outputs stay in papers/your-paper-name/. The toolkit files at the root are never modified.

Re-check after author revisions

Tell Claude: "The paper in papers/your-paper-name/ has been updated. Re-run the checks."

Files

guideline.md

Read this first. Paper-agnostic. Defines:

  • Core objective, operating principles
  • Severity system: S0 (fatal) through S3 (minor)
  • Proof-unit templates, 19 common failure patterns
  • Agent prompt templates for every stage

workflow.md

The bridge from principles to execution. Covers:

  • How to extract proof architecture from a paper
  • How to build dependency-ordered execution plans
  • Workspace setup, phase-by-phase execution
  • Paper-type-specific adaptations (asymptotic, concentration, optimization, Markov chain, M-estimation)

bootstrap.md

One prompt → CHECK_PLAN.md + EXECUTION_ORDER.md drafts.

  • Mode A: fully automatic (best when paper has clear proof-strategy section)
  • Mode B: you provide a knowledge block with proof strategy insights

tutorial/

A step-by-step walkthrough using the CBARA paper. Shows what every file looks like at every stage — not a paper to check, but a reference for how the toolkit works.

Requirements

  • Claude Code (or similar coding agent with file read/write)
  • LaTeX source of the paper
  • Git (for version-tracking toolkit changes)

Notes

Tested with Claude Code using deepseek-v4-pro at maximum effort. Results may vary with other models or effort settings.

About

A toolkit for systematically checking proofs in statistics and ML-theory paper appendices using coding agents. Includes severity framework, phased parallel execution, auto-planning prompt, and a complete worked tutorial.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages