C60.ai — Molecular Evolution AutoML

Named after Buckminsterfullerene (C60): a 60-carbon molecule whose highly stable, non-obvious lattice emerges entirely from self-organisation — never from top-down design. The same principle drives this framework.

C60.ai is a research-grade Automated Machine Learning (AutoML) framework that treats every machine learning pipeline as a graph molecule and evolves it with a genetic algorithm. Unlike every mainstream AutoML tool, C60.ai does not assume the pipeline has a fixed shape — it searches over arbitrary directed acyclic graphs, discovering topologies no human would design by hand.

Why C60.ai Exists

Every major AutoML framework (auto-sklearn, TPOT, H2O, Google AutoML) shares one hidden limitation:

They assume the pipeline shape is fixed.

The search space is always Preprocessor → FeatureSelector → Model. Systems search over which components fill the slots, not what the slots should be. This creates hard ceilings:

Problem with today's AutoML	C60.ai's answer
Fixed sequential topology	Pipelines are arbitrary DAGs — parallel branches, skip connections
Hyperparameter tuning only	Structural mutation: insert, delete, replace nodes; redirect edges
No memory across evaluations	EvaluationCache keyed by structure hash, FIFO-evicted
Black-box output	PipelineStory — human-readable narrative of the entire evolution
Manual feature engineering	Genetic operators discover useful subgraph patterns automatically

Benchmark Results

vs. 9 sklearn baselines — 7 datasets, 3-fold × 3 seeds

C60.ai achieves the highest mean accuracy (94.95%) across all 7 datasets, ranking #1 by mean accuracy and outperforming 7 of 9 baselines at p < 0.05 (Wilcoxon signed-rank).

Dataset	Samples	Features	Classes	C60.ai	Best Baseline	Δ
iris	150	4	3	96.00%	KNN-10 95.33%	+0.67 pp
digits	1 797	64	10	98.83%	SVM-RBF 97.86%	+0.97 pp
waveform	5 000	21	3	86.70%	VotEns 85.80%	+0.90 pp
pendigits	8 000	16	10	99.54%	SVM-RBF 99.56%	−0.02 pp
letter	8 000	16	26	92.84%	SVM-RBF 93.60%	−0.76 pp
wine	178	13	3	95.79%	SVM-RBF 97.75%	−1.97 pp
breast_cancer	569	30	2	94.99%	LR 98.33%	−3.34 pp

C60.ai leads on high-dimensional multi-class tasks where topology search matters most. It trails on simple low-dimensional datasets where linear models are near-optimal.

vs. 9 AutoML frameworks — 7 datasets, 2-fold × 2 seeds

9 systems evaluated across all 7 datasets (BayesSearchCV excluded from Letter/Waveform — each fold takes 600–1200 s; its 5-dataset mean is 97.07%).

System	Mean (7-ds)	Digits	Letter	Waveform
Optuna Ensemble	95.51%	98.58%	94.44%	86.44%
Greedy Ensemble	95.48%	98.66%	94.76% ‡	86.64%
AutoStack	95.24%	97.77%	94.12%	86.07%
Optuna Search	95.21%	97.61%	93.86%	86.61%
FeatEng AutoML	95.18%	98.50%	93.51%	86.50%
Broad Rand.	95.12%	97.52%	93.55%	86.51%
Hyperopt Search	95.05%	97.69%	93.75%	86.66%
C60.ai	94.95%	98.83% ★	92.84%	86.70% ★
Succ. Halving	94.44%	97.41%	91.01%	84.88%

★ Best accuracy on Digits and Waveform across all 9 fully-evaluated systems.
‡ Best on Letter.
No AutoML framework statistically significantly outperforms C60.ai on Digits or Waveform (Wilcoxon p > 0.05).

Full results, plots, and statistical analysis: benchmark/results/

Research Paper

This work is written up as an ICML 2026 workshop submission:
example_paper.tex / example_paper.bib

Figures: benchmark/results/paper_figures/

Installation

git clone https://github.com/aditirkrishna/c60.ai.git
cd c60.ai
python -m venv .venv && source .venv/bin/activate   # Windows: .venv\Scripts\activate
pip install -e ".[dev]"

Optional extras:

pip install torch        # hybrid neuro-symbolic nodes
pip install matplotlib   # evolution plots and pipeline visualisation

Quick Start

Fit on any dataset

from sklearn.datasets import load_iris
from c60.evolution.engine import EvolutionEngine

X, y = load_iris(return_X_y=True)

engine = EvolutionEngine(
    population_size=20,
    max_generations=10,
    task="classification",
    random_seed=42,
)
best_pipeline = engine.fit(X, y)
best_pipeline.fit(X, y)
print(f"Accuracy: {best_pipeline.score(X, y):.4f}")

Read the evolution story

from c60.explainability.story import PipelineStory

story = PipelineStory(
    engine.history(), best_pipeline,
    feature_names=["sepal_len", "sepal_wid", "petal_len", "petal_wid"],
)
print(story.narrate())

Output:

Evolution ran 10 generations, improving from 0.6133 to 0.9600 (+0.3467).
Best pipeline: StandardScaler -> PCA(n=3) -> SVC(C=8.2, kernel=rbf)
Top features: petal_len 0.486 | petal_wid 0.374 | sepal_len 0.087

CLI

c60 run data.csv --target label --task classification
c60 explain best_pipeline.pkl --data data.csv
c60 info --type classifier

REST API

uvicorn c60.api.server:app --reload

import requests

resp = requests.post("http://localhost:8000/jobs", json={
    "X": X.tolist(), "y": y.tolist(),
    "task": "classification",
    "population_size": 15,
    "max_generations": 8,
})
job_id = resp.json()["job_id"]
# poll GET /jobs/{job_id} until status == "complete"
result = requests.get(f"http://localhost:8000/jobs/{job_id}/result").json()
print(result["best_score"], result["pipeline_steps"])

How It Works

Dataset (X, y)
     |
     v
Population of random Pipeline DAGs
  A: Scaler -> PCA -> SVM
  B: Scaler -> SelectKBest -> GBT
  C: Scaler -> RandomForest

For each generation:
  1. EVALUATE  — cross-val accuracy per pipeline (cached by structure hash)
  2. SELECT    — tournament selection (higher score = more likely to reproduce)
  3. CROSSOVER — swap subgraphs between two parent pipelines
  4. MUTATE    — insert/delete/replace nodes; redirect edges; tweak hyperparams
  5. ELITISM   — best K individuals carry forward unchanged

     |
     v
Best pipeline found -> refit on full training data -> ready to predict

Steps 3 and 4 operate on graph structure — this is what distinguishes C60.ai from all template-based AutoML.

Architecture

src/c60/
  core/           Typed DAG pipeline, operation registry, data-type lattice
  evaluation/     Fitness evaluator (stratified k-fold + timeout), eval cache
  evolution/      Population, genetic operators, tournament selection, GA engine
  explainability/ Feature introspection, PipelineStory narrative, visualisation
  hybrid/         PyTorch autoencoder + MLP classifier as first-class pipeline nodes
  execution/      Parallel population evaluation (ThreadPoolExecutor)
  cli/            Click CLI: run / explain / info / version
  api/            FastAPI async job server with Pydantic models

benchmark/
  baselines.py    9 sklearn baselines + C60Estimator sklearn-compatible wrapper
  runner.py       BenchmarkRunner — nested CV, standard + OpenML datasets
  report.py       ResultsReporter — tables, Wilcoxon tests, plots
  _run.py         Executable benchmark script
  results/        results_full.csv, report.txt, summary.md, PNG charts

test/             250+ pytest tests, full suite completes in < 60 s
docs/             Full documentation (concept / theory / architecture / results)
research/         Original research document and open problems

Comparison with Other AutoML Frameworks

Feature	auto-sklearn	TPOT	H2O AutoML	C60.ai
Pipeline topology	Fixed	Fixed	Fixed	Arbitrary DAG
Search method	Bayesian	Genetic (DEAP)	Grid/random	Graph-level GA
Structural mutation	No	Limited	No	Yes (5 operators)
Explainability	Limited	No	No	PipelineStory
Neural hybrid nodes	No	No	No	Yes (PyTorch)
REST API	No	No	Yes	Yes (FastAPI)
Structure-hash cache	No	No	No	Yes

Documentation

File	Contents
docs/concept.md	What is AutoML? The molecular evolution metaphor for anyone
docs/theory.md	Mathematical formulation — DAGs, fitness, genetic operators
docs/architecture.md	Code organisation and design decisions
docs/algorithms.md	Selection, crossover, mutation, plateau detection in depth
docs/results.md	Full benchmark results with statistical analysis
docs/getting_started.md	Step-by-step tutorial: install → fit → explain → extend
docs/api_reference.md	Python API and REST API reference
research/molecular_concept.md	Original research document

Running Tests

pytest                                      # full suite (~60 s)
pytest test/core/test_evolution.py -v      # GA engine
pytest test/core/test_benchmark.py -v      # benchmark infrastructure
pytest --cov=src/c60 --cov-report=html     # coverage report

License

MIT — see LICENSE.

Citation

@software{c60ai2026,
  title  = {C60.ai: Molecular Evolution for Automated Machine Learning},
  author = {Ramakrishnan, Aditi},
  year   = {2026},
  url    = {https://github.com/aditirkrishna/c60.ai}
}

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.venv/Scripts		.venv/Scripts
benchmark		benchmark
docs		docs
src/c60		src/c60
test/core		test/core
.gitignore		.gitignore
README.md		README.md
conftest.py		conftest.py
example_paper.bib		example_paper.bib
example_paper.tex		example_paper.tex
overleaf_submission.zip		overleaf_submission.zip
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C60.ai — Molecular Evolution AutoML

Why C60.ai Exists

Benchmark Results

vs. 9 sklearn baselines — 7 datasets, 3-fold × 3 seeds

vs. 9 AutoML frameworks — 7 datasets, 2-fold × 2 seeds

Research Paper

Installation

Quick Start

Fit on any dataset

Read the evolution story

CLI

REST API

How It Works

Architecture

Comparison with Other AutoML Frameworks

Documentation

Running Tests

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

C60.ai — Molecular Evolution AutoML

Why C60.ai Exists

Benchmark Results

vs. 9 sklearn baselines — 7 datasets, 3-fold × 3 seeds

vs. 9 AutoML frameworks — 7 datasets, 2-fold × 2 seeds

Research Paper

Installation

Quick Start

Fit on any dataset

Read the evolution story

CLI

REST API

How It Works

Architecture

Comparison with Other AutoML Frameworks

Documentation

Running Tests

License

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages