Havoc OCR Showdown

This project is an experiment to find the most effective model architecture for a 12-digit number recognition (OCR) task. We explore three different approaches, evolving from a simple linear model to a more complex Convolutional Recurrent Neural Network (CRNN), demonstrating why specialized architectures are critical for sequence-based problems. Kaggle Link

Presentation

Technical Approach

The experiment was conducted by building and training three distinct models on the same dataset.

1 `MCLv1` (Baseline: Simple Linear Model)

This model serves as our baseline. It's a simple Multi-Layer Perceptron (MLP) that takes the flattened image as input.

Architecture: havoc/registry/modelRegistry/MCLv1
Result: This model failed to learn. By flattening the image, it destroys all spatial and sequential information. It cannot tell the 1st digit from the 12th. As seen in the logs, its accuracy plateaus at ~10%.

2 `MCCv1` (Convolutional Model)

This model introduces Convolutional Neural Networks (CNNs) to extract meaningful spatial features before classification.

Architecture: havoc/registry/modelRegistry/MCCv1
Result: This model struggles but eventually learns. After being stuck at ~10% for over 20-30 epochs, it starts to "memorize" the scrambled feature patterns, peaking at ~78.08% accuracy.

3 `MCCLv1` (CNN + LSTM)

This specialized architecture, commonly known as a CRNN (Convolutional Recurrent Neural Network).

Architecture: havoc/registry/modelRegistry/MCCLv1
Result: This model is highly effective. It starts learning almost immediately and achieves a peak validation accuracy of 99.40%.

Results Summary

Model	Architecture	Peak Validation Accuracy
`MCLv1`	Simple MLP	~10.64%
`MCCv1`	CNN + MLP	78.08%
`MCCLv1`	CRNN (CNN + LSTM)	99.40%

Installation & Reproduction

This project uses uv for fast Python package management.

Prerequisites

You must have uv installed.

Instructions

Follow these steps to create the virtual environment and reproduce the results.

Clone the repository:

git clone https://github.com/SuriyaaMM/havoc
cd havoc

Create and sync the virtual environment:
```
uv venv
uv sync
```

Activate the environment and run an experiment:

source .venv/bin/activate
uv run -m havoc.experiment.Ev3

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
analysis		analysis
havoc		havoc
logs		logs
reports		reports
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
cfg.format.json		cfg.format.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Havoc OCR Showdown

Technical Approach

1 `MCLv1` (Baseline: Simple Linear Model)

2 `MCCv1` (Convolutional Model)

3 `MCCLv1` (CNN + LSTM)

Results Summary

Installation & Reproduction

Prerequisites

Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Havoc OCR Showdown

Technical Approach

1 MCLv1 (Baseline: Simple Linear Model)

2 MCCv1 (Convolutional Model)

3 MCCLv1 (CNN + LSTM)

Results Summary

Installation & Reproduction

Prerequisites

Instructions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1 `MCLv1` (Baseline: Simple Linear Model)

2 `MCCv1` (Convolutional Model)

3 `MCCLv1` (CNN + LSTM)

Packages