Kiri OCR 📄

Kiri OCR is a lightweight OCR library for English and Khmer documents. It provides document-level text detection, recognition, and rendering capabilities.

🚀 Try the Live Demo | 📚 Full Documentation

✨ Key Features

High Accuracy: Transformer model with hybrid CTC + attention decoder
Bi-lingual: Native support for English and Khmer (and mixed text)
Document Processing: Automatic text line and word detection
Streaming: Real-time character-by-character output (like LLM streaming)
Easy to Use: Simple Python API and CLI

📦 Installation

pip install kiri-ocr

💻 Quick Start

CLI Tool

kiri-ocr document.jpg

Python API

from kiri_ocr import OCR

# Initialize (auto-downloads from Hugging Face)
ocr = OCR()

# Extract text from document
text, results = ocr.extract_text('document.jpg')
print(text)

# Get detailed box-by-box results
for line in results:
    print(f"{line['text']} (confidence: {line['confidence']:.1%})")

Decoding Methods

Choose the decoding method based on your speed/quality tradeoff:

# Fast (CTC) - Fastest, good for batch processing
ocr = OCR(decode_method="fast")

# Accurate (Decoder) - Balanced speed and quality (default)
ocr = OCR(decode_method="accurate")

# Beam Search - Best quality, slowest
ocr = OCR(decode_method="beam")

Streaming Recognition

Get character-by-character output like LLM streaming:

from kiri_ocr import OCR

ocr = OCR(decode_method="accurate")

# Stream characters as they're decoded
for chunk in ocr.extract_text_stream_chars('document.jpg'):
    print(chunk['token'], end='', flush=True)
    if chunk['document_finished']:
        print()  # Done!

📚 Documentation

Full documentation is available on the Wiki:

📊 Benchmark

Results on synthetic test images (10 popular fonts):

📁 Project Structure

kiri_ocr/
├── core.py               # OCR class
├── model.py              # Transformer model
├── training.py           # Training code
├── cli.py                # Command-line interface
└── detector/             # Text detection
    ├── db/               # DB detector
    └── craft/            # CRAFT detector

☕ Support

If you find this project useful:

⭐ Star this repository
Buy Me a Coffee
ABA Payway

Join our Discord Community](https://discord.gg/Vcrw274RVC)

⚖️ License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
.github/workflows		.github/workflows
assets		assets
benchmark		benchmark
kiri_ocr		kiri_ocr
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kiri OCR 📄

✨ Key Features

📦 Installation

💻 Quick Start

CLI Tool

Python API

Decoding Methods

Streaming Recognition

📚 Documentation

📊 Benchmark

📁 Project Structure

☕ Support

⚖️ License

About

Uh oh!

Releases 26

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Kiri OCR 📄

✨ Key Features

📦 Installation

💻 Quick Start

CLI Tool

Python API

Decoding Methods

Streaming Recognition

📚 Documentation

📊 Benchmark

📁 Project Structure

☕ Support

⚖️ License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 26

Contributors 1

Languages