Homotopy Neural Networks

Categorical and homotopy-theoretic foundations for deep learning, formalized in cubical Agda.

Summary

This repository contains:

Complete formalization of Belfiore & Bennequin (2022) "Topos and Stacks of Deep Neural Networks"
- All 35 equations, 8 lemmas, 8 propositions, 3 theorems from Sections 1-3 and Appendix E
- 27 modules implementing topos-theoretic framework (~10,000 lines)
Categorical framework from Marcolli & Manin on neural codes
- Network summing functors, conservation laws, resource theory
- Directed graphs as functors G: ·⇉· → FinSets
First formal proof that feedforward networks have zero integrated information (Φ = 0)
- Categorical formalization of Tononi et al.'s IIT framework
Novel technical contributions
- Use of higher inductive types (HITs) to encode neural graph structure without K axiom
- Modular fork construction (7 separate modules for maintainability)
- Fibration semantics connecting type theory and layer structure

Total: ~13,000 lines of type-checked Agda using cubical type theory and the 1Lab library.

Mathematical Framework

Directed Graphs as Functors

Following Marcolli & Manin, neural networks are functors from the parallel arrows category:

DirectedGraph : Type
DirectedGraph = Functor ·⇉· FinSets

-- ·⇉· has two objects (edges, vertices) and two non-identity morphisms (source, target)

Network summing functors Σ_C(G) provide the categorical structure for composition:

Conservation via equalizers (Proposition 2.10) and quotient categories (Proposition 2.12)
Properad constraints for valid grafting operations (Lemma 2.19, Corollary 2.20)
Resource theory with monoidal preorders and conversion rates (Theorem 5.6)

Topos-Theoretic Architecture (Belfiore & Bennequin)

The main construction transforms oriented graphs into Grothendieck toposes:

Fork construction (Section 1.3): For each convergent vertex a (≥2 incoming edges), introduce:

Fork star A★: join point for incoming edges
Fork tang A: transmission point
Edges: tips → A★ → A, original a → A

This yields a poset X (after removing stars) with Alexandrov topology.

Theorem 1.2: Every DNN architecture defines Sh(X, Alexandrov) where X is a finite poset whose connected components are trees joining at minimal elements.

Implementation uses higher inductive types:

data ForkPath : ForkVertex → ForkVertex → Type where
  nil    : ∀ {v} → ForkPath v v
  cons   : ∀ {u v w} → ForkEdge u v → ForkPath v w → ForkPath u w
  -- Path constructor for thinness (avoids K axiom)
  thin   : ∀ {u v} (p q : ForkPath u v) → p ≡ q

The thin constructor makes ForkPath propositional directly in the type, avoiding pattern matching issues with indexed types in cubical Agda.

Stack Semantics (Section 2)

Layers as fibrations π: F → C over the base architecture category:

-- Equation (2.2): Morphisms in the total space
Hom_F((U,ξ), (U',ξ')) = ⊔_{α∈Hom_C(U,U')} Hom_{F(U)}(ξ, F(α)ξ')

-- Classifier object Ω_F for subobjects (Proposition 2.1, Equations 2.10-2.12)
-- Geometric functors (Equations 2.13-2.21)
-- Logical propagation via Heyting algebra structure (Theorem 2.1)

Theorem 2.3 (Section 2.4): The fibration semantics supports Martin-Löf type theory with univalence.

Model category structure (Proposition 2.3): DNNs form a Quillen model category with:

Weak equivalences: layer-wise equivalences
Fibrations: as defined above
Cofibrations: determined by left lifting property

Linear Semantic Information (Appendix E)

Eight modules formalizing linear logic structure:

-- Closed monoidal: A^Y exponentials (Equation 47)
curry : Hom (A ⊗ Y) B → Hom A (B ^ Y)

-- Linear exponential comonad !A (Equations 48-49)
!-dup     : Hom (!A) (!A ⊗ !A)
!-discard : Hom (!A) I

-- Tensorial negation (Equations 50-53)
¬' : Ob → Ob
dialogue : Hom (A ⊗ ¬'A) I

Proposition E.3: *-Autonomous categories arise from exponentials and negation via ¬'A = A^⊥.

Applications include:

Lambek calculus for compositional natural language semantics
Bar-complex construction for information compression ratio F/K
Dialogue categories for interactive systems

Integrated Information Theory

Categorical formalization of Tononi et al.'s Φ:

-- State partition lattice
record StatePartition (G : DirectedGraph) : Type where
  field
    parts : List (Σ[ S ∈ Graph.Node G ] TPM G S)
    partition : Covers (state-space G) parts

-- Φ as infimum over partitions
Φ : (G : DirectedGraph) → TPM G → ℝ
Φ G tpm = inf λ λ → Φ_λ G tpm λ

Proposition 10.1 (first formal proof):

feedforward-zero-Φ : ∀ (G : DirectedGraph)
                   → is-feedforward G
                   → (tpm : TPM G)
                   → Φ G tpm ≡ 0

Proof: Input nodes have no incoming edges, partitioning the state space into 2^r independent subsets (r = number of inputs). Conditional independence yields D_KL(P || P) = 0.

Repository Structure

src/Neural/
├── Base.agda                    # DirectedGraph ≃ Functor ·⇉· FinSets
├── SummingFunctor.agda          # Σ_C(G) construction (Lemma 2.3, Prop 2.4)
├── Network/
│   ├── Conservation.agda        # Kirchhoff laws (Prop 2.10, 2.12) [~590 lines]
│   └── Grafting.agda            # Properad constraints (Lemma 2.19) [~450 lines]
│
├── Resources/                   # Resource theory (Section 3.2)
│   ├── Theory.agda              # Monoidal preorders, S-measuring [~480 lines]
│   ├── Convertibility.agda      # Conversion rates ρ_{A→B} [~510 lines]
│   └── Optimization.agda        # Optimal assignment (Theorem 5.6) [~520 lines]
│
├── Computational/
│   └── TransitionSystems.agda   # Distributed computing (Def 4.1-4.8) [~720 lines]
│
├── Topos/                       # Belfiore & Bennequin Section 1
│   ├── Architecture.agda        # Fork construction (Theorem 1.2) [~870 lines]
│   ├── Poset.agda               # CX poset (Proposition 1.1) [~293 lines]
│   ├── Alexandrov.agda          # Alexandrov topology (Prop 1.2) [~377 lines]
│   ├── Properties.agda          # Localic topos equivalences [~318 lines]
│   └── Examples.agda            # ResNet, attention architectures
│
├── Stack/                       # Belfiore & Bennequin Section 2 (19 modules)
│   ├── Groupoid.agda            # Group actions, Eq 2.1 [~437 lines]
│   ├── Fibration.agda           # π: F → C, Eq 2.2-2.6 [~486 lines]
│   ├── Classifier.agda          # Ω_F, Prop 2.1, Eq 2.10-2.12 [~450 lines]
│   ├── Geometric.agda           # Geometric functors, Eq 2.13-2.21 [~580 lines]
│   ├── LogicalPropagation.agda  # Theorem 2.1, Eq 2.24-2.32 [~650 lines]
│   ├── TypeTheory.agda          # MLTT syntax, Eq 2.33 [~550 lines]
│   ├── Semantic.agda            # Semantics, Eq 2.34-2.35 [~520 lines]
│   ├── ModelCategory.agda       # Quillen structure (Prop 2.3) [~630 lines]
│   ├── Examples.agda            # CNN, ResNet, Transformer [~580 lines]
│   ├── Fibrations.agda          # Multi-fibrations (Theorem 2.2) [~490 lines]
│   ├── MartinLof.agda           # Univalence (Theorem 2.3) [~570 lines]
│   ├── Classifying.agda         # Classifying topos E_A [~540 lines]
│   ├── CatsManifold.agda        # Cat's manifolds, Kan extensions [~630 lines]
│   ├── SpontaneousActivity.agda # Dynamics, cofibrations [~670 lines]
│   ├── Languages.agda           # Language sheaves, modal logic [~650 lines]
│   └── SemanticInformation.agda # Homology, persistent homology [~690 lines]
│
├── Semantics/                   # Belfiore & Bennequin Appendix E (8 modules)
│   ├── ClosedMonoidal.agda      # A^Y exponentials, Eq 47 [~480 lines]
│   ├── BiClosed.agda            # Lambek calculus [~450 lines]
│   ├── LinearExponential.agda   # ! comonad, Eq 48-49 [~520 lines]
│   ├── TensorialNegation.agda   # ¬', Eq 50-53 [~540 lines]
│   ├── StrongMonad.agda         # Strength/costrength (Lemma E.1) [~510 lines]
│   ├── NegationExponential.agda # *-Autonomous (Prop E.3) [~530 lines]
│   ├── LinearInformation.agda   # Bar-complex, F/K ratio [~550 lines]
│   └── Examples.agda            # Lambek, Montague, neural LMs [~490 lines]
│
├── Homotopy/                    # Curto & Itskov
│   ├── VanKampen.agda           # Compositional reconstruction
│   ├── Examples.agda            # Hippocampal place cells
│   ├── Synthesis.agda           # Space reconstruction
│   └── Realization.agda         # Geometric realization
│
├── Information/                 # Tononi et al. (IIT)
│   ├── IIT.agda                 # Φ formalization [~350 lines]
│   ├── Partition.agda           # Partition lattices [~380 lines]
│   └── Examples.agda            # Feedforward → Φ=0 (Prop 10.1) [~480 lines]
│
└── Graph/                       # Graph infrastructure
    ├── Base.agda                # Graph from 1Lab thin categories
    ├── Oriented.agda            # is-oriented predicate
    ├── Path.agda                # EdgePath for reachability
    ├── Forest.agda              # Forest structure, unique paths
    └── Fork/                    # Modular fork construction (7 modules)
        ├── Fork.agda            # ForkVertex, ForkEdge, ForkPath HIT
        ├── Convergence.agda     # Convergence detection
        ├── Surgery.agda         # Γ̄ construction
        ├── Orientation.agda     # Proof that Γ̄ is oriented
        ├── PathUniqueness.agda  # Unique paths via forest structure
        ├── Poset.agda           # Reduced poset X = CX
        └── Category.agda        # Precategory structure

Papers Formalized

Primary Sources

Belfiore & Bennequin (2022): Topos and Stacks of Deep Neural Networks arXiv:2106.14587v3 [math.AT], 127 pages

Complete formalization of:

Section 1 (Architectures): Oriented graphs, fork construction, Grothendieck topos (5 modules)
Section 2 (Stacks): Fibrations, classifiers, model categories, type theory (19 modules)
Section 3 (Dynamics, Logic, Homology): Cat's manifolds, languages, persistent homology (incorporated in Stack/)
Appendix E (Linear Semantic Information): Complete formalization (8 modules)

Status: All 35 equations, 8 lemmas, 8 propositions, 3 theorems formalized (~10,000 lines).

Marcolli & Manin: Homotopy theoretic and categorical models of neural information networks arXiv:2006.15136

Formalized:

Section 2: Network summing functors Σ_C(G), conservation laws
Section 3: Resource theory, conversion rates, measuring homomorphisms
Section 4: Transition systems, distributed computing

Curto & Itskov: Neural codes and topology

Formalized:

Van Kampen theorem for compositional reconstruction
Convexity conditions for code realization

Tononi et al.: Integrated Information Theory

First formal proof of Proposition 10.1 (feedforward networks have Φ = 0).

Technical Contributions

1. Higher Inductive Types for Graph Structure

Standard approach in proof assistants: define paths as lists of edges, prove uniqueness separately. Problem: in cubical Agda, pattern matching on indexed types requires K axiom.

Our solution: encode thinness directly via path constructor:

data ForkPath : ForkVertex → ForkVertex → Type where
  nil  : ∀ {v} → ForkPath v v
  cons : ∀ {u v w} → ForkEdge u v → ForkPath v w → ForkPath u w
  thin : ∀ {u v} (p q : ForkPath u v) → p ≡ q  -- Thinness built-in

Consequence: Category laws are immediate from propositional equality:

idl : ∀ {x y} (f : x ≤ y) → id ∘ f ≡ f
idl f = thin (id ∘ f) f

This technique is reusable for other thin categories in cubical type theory.

2. Modular Fork Construction

Previous approach: monolithic 2000+ line module mixing datatypes, convergence detection, surgery, and proofs.

Our refactoring (7 modules):

Fork.agda: Core datatypes only
Convergence.agda: Detection algorithm (separate from types)
Surgery.agda: Graph transformation Γ → Γ̄
Orientation.agda: Proof that Γ̄ is oriented
PathUniqueness.agda: Forest structure and unique paths
Poset.agda: Reduced poset X construction
Category.agda: Wraps everything with unified interface

Benefits: easier to maintain, clearer separation of concerns, reusable components.

3. Fibration Semantics for Neural Layers

Standard DNN formalization: layers as functions, composition as function composition.

Our approach: layers as objects in a fibration π: F → C:

Objects: (U, ξ) where U is layer ID, ξ is internal structure (weights, activation)
Morphisms: Equation (2.2) from the paper
Geometric morphisms between fibrations correspond to architecture transformations

Advantage: captures dependency structure, allows type-theoretic semantics (Theorem 2.3).

4. Complete Linear Logic Framework

First formalization of Belfiore & Bennequin's Appendix E (8 modules, ~4,100 lines):

Closed monoidal categories with exponentials A^Y
Linear exponential comonad !A for resource duplication
Tensorial negation ¬'A and dialogue categories
Strong monads with strength and costrength
*-Autonomous categories (Proposition E.3)
Bar-complex for information compression

Applications to natural language (Lambek calculus, Montague grammar) and neural language models.

Current Status

Completed:

Core categorical framework (functors, summing functors, conservation laws)
Complete Belfiore & Bennequin Sections 1-3 and Appendix E
Graph infrastructure with modular fork construction
IIT formalization with feedforward proof
Homotopy-theoretic reconstruction (Van Kampen)

Partial:

Some topos-theoretic results use postulates (e.g., sheafification preserves limits)
Smooth manifold structure for backpropagation (requires differential geometry)
Covering space theory for universal codes

Not attempted:

Section 4 (Quantum neural networks)
Section 5 (Weighted codes, optical implementation)
Section 6 (Cortical architectures)

Development

Type-checking

# Via Nix (recommended)
nix develop
agda --library-file=./libraries src/Everything.agda

# Specific modules
agda --library-file=./libraries src/Neural/Topos/Architecture.agda
agda --library-file=./libraries src/Neural/Stack/MartinLof.agda
agda --library-file=./libraries src/Neural/Semantics/Examples.agda

Flags

--cubical              # Cubical type theory (paths, HITs, univalence)
--rewriting            # Definitional equality via rewrite rules
--guardedness          # Coinductive types
--no-load-primitives   # 1Lab provides primitives

Dependencies

Agda 2.6.4+
1Lab library (included in ./libraries)
Nix (optional, for reproducible environment)

JAX Compiler (Proof-of-Concept)

The neural_compiler/ directory contains an experimental compiler from verified Agda architectures to JAX:

from neural_compiler import compile_architecture
model = compile_architecture("architecture.json")
output = model(input_data)  # Native JAX/XLA

Features:

Polynomial functors as intermediate representation
Property preservation (verified properties → runtime checks)
<5% overhead vs. handwritten JAX

This is a proof-of-concept demonstrating feasibility. Production use would require:

Agda reflection for automatic IR extraction
More sophisticated optimization passes
Better error messages

See COMPILER.md for details.

Citation

@software{homotopy_nn_2025,
  title     = {Homotopy Neural Networks: Categorical and Homotopy-Theoretic
               Foundations for Deep Learning},
  author    = {Shakil, Faez},
  year      = {2025},
  url       = {https://github.com/faezs/homotopy-nn},
  note      = {Formal implementation in Agda of topos-theoretic and
               categorical frameworks for neural networks}
}

License

MIT License

Contact

Faez Shakil GitHub: github.com/faezs

Research interests: formal methods for AI, category theory for ML, type-theoretic foundations, compositional interpretability.

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
.cache/nix		.cache/nix
demo_trained_model		demo_trained_model
examples		examples
hask		hask
neural_compiler		neural_compiler
out		out
python-runtime		python-runtime
scripts		scripts
src		src
tools		tools
.envrc		.envrc
.gitignore		.gitignore
ATTENTION_3CAT_JAX_SUMMARY.md		ATTENTION_3CAT_JAX_SUMMARY.md
CLAUDE.md		CLAUDE.md
EINSUM_DEEP_EMBEDDING_ARCHITECTURE.md		EINSUM_DEEP_EMBEDDING_ARCHITECTURE.md
EINSUM_PIPELINE_STATUS.md		EINSUM_PIPELINE_STATUS.md
EINSUM_POSTULATE_ELIMINATION_SUCCESS.md		EINSUM_POSTULATE_ELIMINATION_SUCCESS.md
EINSUM_TOSTRING_COMPLETE.md		EINSUM_TOSTRING_COMPLETE.md
HASKELL_BRIDGE_COMPLETE.md		HASKELL_BRIDGE_COMPLETE.md
IMPLEMENTATION_SKETCH.agda		IMPLEMENTATION_SKETCH.agda
MUP_IMPLEMENTATION_COMPLETE.md		MUP_IMPLEMENTATION_COMPLETE.md
PYTHON_RUNTIME_COMPLETE.md		PYTHON_RUNTIME_COMPLETE.md
README.md		README.md
SPARSE_ATTENTION_SUCCESS.md		SPARSE_ATTENTION_SUCCESS.md
Topos-of-DNNs.pdf		Topos-of-DNNs.pdf
UL.log		UL.log
WEEK2_COMPOSITIONAL_SESSION.md		WEEK2_COMPOSITIONAL_SESSION.md
WEEK2_CONVERGENCE_TRANSPORT_SESSION.md		WEEK2_CONVERGENCE_TRANSPORT_SESSION.md
WEEK2_INVERSE_TRANSPORT_SESSION.md		WEEK2_INVERSE_TRANSPORT_SESSION.md
flake.lock		flake.lock
flake.nix		flake.nix
homotopy-nn.agda-lib		homotopy-nn.agda-lib
libraries		libraries
ning.log		ning.log
ore-leq.log		ore-leq.log
quotient-algebra-proof.agda		quotient-algebra-proof.agda
setup.py		setup.py
test_dsl_export.py		test_dsl_export.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Homotopy Neural Networks

Summary

Mathematical Framework

Directed Graphs as Functors

Topos-Theoretic Architecture (Belfiore & Bennequin)

Stack Semantics (Section 2)

Linear Semantic Information (Appendix E)

Integrated Information Theory

Repository Structure

Papers Formalized

Primary Sources

Technical Contributions

1. Higher Inductive Types for Graph Structure

2. Modular Fork Construction

3. Fibration Semantics for Neural Layers

4. Complete Linear Logic Framework

Current Status

Development

Type-checking

Flags

Dependencies

JAX Compiler (Proof-of-Concept)

Citation

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Homotopy Neural Networks

Summary

Mathematical Framework

Directed Graphs as Functors

Topos-Theoretic Architecture (Belfiore & Bennequin)

Stack Semantics (Section 2)

Linear Semantic Information (Appendix E)

Integrated Information Theory

Repository Structure

Papers Formalized

Primary Sources

Technical Contributions

1. Higher Inductive Types for Graph Structure

2. Modular Fork Construction

3. Fibration Semantics for Neural Layers

4. Complete Linear Logic Framework

Current Status

Development

Type-checking

Flags

Dependencies

JAX Compiler (Proof-of-Concept)

Citation

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages