Lupine Science

Lupine Science is a public research program for understanding where interatomic potentials fail, why those failures have structure, and how that structure can guide correction.

This repository is written first for materials scientists, computational materials groups, MLIP builders, research software teams, and lab leaders who need to decide whether a model can be trusted outside the narrow conditions where it looked accurate.

The central claim is simple but demanding: prediction error is not just noise. Across potentials, elements, properties, and structure families, errors can form low-dimensional geometry. If that geometry is stable, it can tell us what a potential gets wrong, where the next failure is likely, and what correction or new benchmark would actually matter.

Lupine Science is not a finished product page. It is an operating research corpus: claims, evidence, refutations, formalization attempts, viewer artifacts, and a changelog that keeps the self-correction visible.

Why This Matters To A Materials Lab

Modern materials research increasingly depends on interatomic potentials and machine-learned interatomic potentials (MLIPs). The hard question is not only "which model has the lowest benchmark error?" It is:

Where does this potential fail?
Are the failures structured or idiosyncratic?
Does a foundation MLIP inherit the same error geometry as classical models?
Which apparent trends vanish under matched-sample tests or causal checks?
Can a claim be inspected, reproduced, refuted, and corrected without losing its provenance?

Lupine Science treats those questions as the scientific object. The goal is to turn model failure into a field of evidence that a human researcher can inspect and an agentic research system can keep extending.

The Science Spine

Layer	Scientific question	Repository evidence
Error geometry	Do prediction errors form a low-dimensional manifold across potentials and materials?	IMMI analysis, hyper-ribbon reports, LUPI views
Sloppy-model structure	Are stiff and sloppy directions visible in atomistic model error, not just parameter fitting?	`docs/sloppy_models_report.md`, Distill policy work
Cross-MLIP transfer	Do foundation MLIPs inherit, rotate, or escape the classical error geometry?	`mlip_immi/`, cross-MLIP alignment payloads
Causal and statistical validity	Which patterns survive confounder checks, bootstrap controls, and sample-size matching?	refutation notes, changelog, critique responses
Claim lifecycle	Which hypotheses are supported, refuted, corrected, or still open?	`CHANGELOG.md`, `docs/conjectures/ledger.md`, Library shelves
Formal specification	Which claims can be moved toward theorem-shaped validation?	`lean-spec/`, theory registry direction, proof templates
Agentic research loop	Can agents propose, test, broadcast, and correct claims against a durable ledger?	`glim-think/`, Phoenix traces, evidence campaigns

The important cultural point is that refutation is not treated as failure. Self-correction is part of the method. A claim that changes status should become more useful, not disappear.

How To Use Lupine Science

1. Read The Research Corpus

Start with the public Library:

library.lupine.science

The Library is the human knowledge surface for reports, claim status, evidence summaries, formal notes, and the working changelog. It is generated from this repository, so the corpus is the source of truth and the site is a readable view of it.

Useful local entry points:

Path	Use it for
`docs/ONBOARDING.md`	Start here if you are new — research-scientist and software-engineer tracks
`docs/ARCHITECTURE.md`	System map: how the roots connect into a closed scientific loop
`docs/navigation.md`	The 60-second path to the real science, error-geometry objects disambiguated, and honest status of recent additions
`docs/GLOSSARY.md`	Shared vocabulary for the science and the system
`docs/FAQ.md`	Common questions for scientists and engineers
`archive/swarm_preprint_review/research/immi_dim01_sloppy_theory.md`	The literature foundation: sloppy models, the hyper-ribbon, primary sources (Transtrum, Waterfall, Frederiksen, Kurniawan)
`lit-review.md`	Assembled review: sloppy theory + Simpson's-paradox/permutation methodology + benchmarking
`CHANGELOG.md`	The fastest way to see what changed, what was learned, and what was corrected
`docs/conjectures/ledger.md`	Current supported, refuted, and open hypotheses
`docs/research_evolution_2026_05_05.md`	Narrative of the research loop and corpus growth
`docs/science/SCIENCE_SPINE.md`	Canonical taxonomy for the full scientific program
`docs/sloppy_models_report.md`	Mathematical background for hyper-ribbon and sloppy-model framing
`docs/tda_error_landscapes_report.md`	Topological framing for error landscapes
`paper/`	IMMI manuscript source

2. Inspect Evidence In LUPI

LUPI is the browser-native viewer for atomistic evidence:

lupi.live

Use LUPI when the evidence has structures, trajectories, galleries, or visual inspection routes. The viewer is not the whole science; it is the inspectable surface for evidence that benefits from spatial or temporal inspection.

Local code lives under atlas/.

3. Run Or Extend The Work Locally

Build the Library locally:

cd library-site
npm install
npm run dev

Run the Rust scientific engine checks:

cargo test --manifest-path atlas-distill/Cargo.toml --bin atlas-distill
cargo clippy --manifest-path atlas-distill/Cargo.toml --bin atlas-distill -- -D warnings

Run the focused repo gates:

just think-lint
just engine-test
just live-build

On Windows, use Git Bash for Node and build tasks. The root justfile already does this with the explicit Git Bash path.

4. Add A Scientific Claim

New claims should be written as evidence-bearing research objects, not loose marketing copy.

Use these templates:

Template	Purpose
`docs/templates/publication.md`	Publication-ready claim, evidence, provenance, and citation structure
`docs/templates/proof-pack.md`	Evidence packet for a paper, benchmark, or collaboration review
`docs/templates/mlip-failure-geometry-audit.md`	Structured audit of where a potential or MLIP fails

Every serious claim should identify the model family, material set, property target, evidence path, status, known confounders, and the next test that could change its status.

What Is Established, Refuted, And Open

Lupine Science is explicit about epistemic status. The exact state changes over time, so treat CHANGELOG.md and the Library as the live record.

Status	Examples
Supported	Hyper-ribbon error geometry survives the classical-to-MLIP transition for most IMMI elements; de-myopization beyond elastic constants preserves structure in early tests
Refuted by us	The d-band hypothesis was confounded by sample size; the MEAM anomaly weakened under matched-sample bootstrap; the BCC/FCC causal shield was traced to data contamination
Open	Au escape under foundation MLIPs; Fe as a persistent outlier; prediction of cohesive energy and bulk modulus from the learned geometry

Repository Map

For the full root ownership ledger, including keep/elevate/remove-candidate decisions, see ROOTS.md.

Path	What it contains
`docs/ONBOARDING.md`	New contributors start here — research-scientist and software-engineer tracks
`docs/ARCHITECTURE.md`	System map: control plane, compute plane, evidence plane, and data flow
`docs/working-path.md`	Practical checkout, branch, worktree, and verification path
`archive/`	Retired surfaces and historical exports
`library-site/`	Static-site generator for the Lupine Library
`docs/`	Research corpus, reports, plans, runbooks, templates, and hypotheses
`mlip_immi/`	IMMI analysis code, benchmark data, and cross-MLIP evidence payloads
`lean-spec/`	Lean 4 proof/specification work
`paper/`	IMMI paper source
`atlas/`	LUPI viewer and atomistic evidence surfaces
`atlas-distill/`	Rust runtime for Distill scoring, policy, and fault-line extraction
`python/`	Active Python Distill packages: benchmarking, uplift, regime gate, instrumented runtime
`glim-think/`	Agentic research control plane, durable agenda, and ledger-backed loop

The old lupine-start/ marketing/start site, the distiller/ KB, the lupine-distill/ Rust crate, and the lupine-dspy/ package have been retired and archived under archive/. Public research should surface through the Library, the LUPI viewer, and the glim-think feed rather than through a second launch site.

For Contributors

docs/ONBOARDING.md — research-scientist and software-engineer tracks
CONTRIBUTING.md — contribution kinds, workflow, verification commands
scripts/bootstrap.ps1 / scripts/bootstrap.sh — install lightweight dev deps and run quick checks

For Collaborators And Observers

If you are evaluating the program, the best way to understand it is to watch the public evidence trail rather than look for a pitch surface.

Signal	What to watch
Library updates	Scientific throughput and clarity
Claim status changes	Whether the system corrects itself in public
LUPI evidence routes	Whether results are inspectable, not just asserted
MLIP audit templates	Whether the work can answer concrete model-trust questions
`CHANGELOG.md`	Whether progress is cumulative and honest about failure
Agent-readable files	Whether search engines and research agents can repeat the story accurately

Brand And Agent Contract

Public-facing surfaces use one naming contract:

Surface	Canonical name
Company / research program	Lupine Science
Browser viewer	LUPI
Viewer URL	lupi.live
Public library	Lupine Library

Avoid retired organization labels, legacy viewer labels, and retired viewer domains in new copy, metadata, links, and public docs.

Agent-readable files are first-class public artifacts:

File	Purpose
`brand.config.json`	Structured source of truth for names, roles, URLs, and retired-language categories
`docs/brand/narrative.md`	Human narrative spine for sites, docs, and publications
`docs/brand/agent/llms.txt`	Short agent/search guide served from public sites
`docs/brand/agent/llms-full.txt`	Full agent/search guide served from public sites
`docs/agent-index.md`	Repository-level orientation for coding and research agents
`docs/science/science-map.json`	Structured science taxonomy for generated docs and agents

After editing canonical agent files, run:

python scripts/sync_brand_agent_text.py

That republishes /llms.txt, /llms-full.txt, and /brand.json into the public static roots.

Citation

@unpublished{welcing2026causal,
  author  = {Welcing, Alexander},
  title   = {The Causal Geometry of Prediction Errors in Interatomic Potentials:
             A Hyper-Ribbon Manifold Analysis},
  year    = {2026},
  note    = {Working paper, in preparation}
}

License

MIT - see LICENSE.

Acknowledgments

This work builds on sloppy-model theory, causal inference, meta-analysis, materials benchmark infrastructure, OpenKIM/NIST-style potential corpora, and the broader computational materials community.

Name		Name	Last commit message	Last commit date
Latest commit History 935 Commits
.agents/skills/improve		.agents/skills/improve
.github		.github
Kimi_Agent_Draft Assistance Team Selected		Kimi_Agent_Draft Assistance Team Selected
archive		archive
atlas-distill		atlas-distill
atlas		atlas
cloudflare/cdn-proxy		cloudflare/cdn-proxy
data		data
deck		deck
docs		docs
gcp		gcp
glim-think		glim-think
kimi-workspace-export		kimi-workspace-export
lean-spec		lean-spec
library-site		library-site
lupine-ops		lupine-ops
mlip_immi		mlip_immi
paper		paper
paper2		paper2
plans		plans
python		python
raise		raise
replication/error-geometry		replication/error-geometry
scripts		scripts
tools		tools
.gcloudignore		.gcloudignore
.gcloudignore-library		.gcloudignore-library
.gitignore		.gitignore
.gitmodules		.gitmodules
.lupi_add_library.py		.lupi_add_library.py
.lupi_probe.py		.lupi_probe.py
AGENTS.md		AGENTS.md
ATLAS_Lean_Integration_Review.md		ATLAS_Lean_Integration_Review.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
GLOSSARY.md		GLOSSARY.md
LICENSE		LICENSE
README.md		README.md
ROOTS.md		ROOTS.md
benchmark_causal_audit.json		benchmark_causal_audit.json
benchmark_manifold.json		benchmark_manifold.json
brand.config.json		brand.config.json
justfile		justfile
lit-review.md		lit-review.md
nist_benchmark.csv		nist_benchmark.csv
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
skills-lock.json		skills-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lupine Science

Why This Matters To A Materials Lab

The Science Spine

How To Use Lupine Science

1. Read The Research Corpus

2. Inspect Evidence In LUPI

3. Run Or Extend The Work Locally

4. Add A Scientific Claim

What Is Established, Refuted, And Open

Repository Map

For Contributors

For Collaborators And Observers

Brand And Agent Contract

Citation

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lupine Science

Why This Matters To A Materials Lab

The Science Spine

How To Use Lupine Science

1. Read The Research Corpus

2. Inspect Evidence In LUPI

3. Run Or Extend The Work Locally

4. Add A Scientific Claim

What Is Established, Refuted, And Open

Repository Map

For Contributors

For Collaborators And Observers

Brand And Agent Contract

Citation

License

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages