ReceiptVision 📄✨

Advanced OCR Application for Bank Invoices and Consumer Receipts

ReceiptVision is a comprehensive Python-based OCR application that transforms receipts and invoices into structured data using advanced image processing and machine learning techniques. Built with a modern Apple-inspired UI and robust backend architecture.

🌟 Key Features

📁 Multi-Format Support

PDF Documents: Extract text and convert pages to images
Image Formats: PNG, JPG, JPEG, BMP, TIFF support
Automatic Detection: Smart file type recognition and processing

🧠 Advanced OCR Processing

Smart Data Extraction: Merchant names, dates, amounts, and itemized purchases
Confidence Scoring: Field-level and overall processing confidence metrics
Multi-Language Support: Configurable OCR language models

🖼️ Advanced Image Processing

Denoising: Multiple denoising algorithms for cleaner text extraction
Adaptive Thresholding: Gaussian and mean adaptive thresholding
Morphological Operations: Text cleanup and enhancement
Skew Correction: Automatic image rotation and alignment
Contrast Enhancement: CLAHE and custom enhancement algorithms

⚡ Batch Processing

Multiple File Upload: Process dozens of files simultaneously
Progress Tracking: Real-time processing status and progress bars
Job Management: Named batch jobs with detailed statistics
Error Handling: Individual file error tracking and reporting

🎨 Modern Web Interface

Apple-Inspired Design: Clean, modern UI following Apple's design principles
Responsive Layout: Works perfectly on desktop, tablet, and mobile
Real-Time Updates: Live progress tracking and notifications
Intuitive Navigation: Easy-to-use interface for all skill levels

🗄️ Data Management

PostgreSQL Storage: Robust database with full ACID compliance
Search & Filter: Advanced search capabilities across all receipts
Data Export: Multiple export formats for extracted data
Audit Trail: Complete processing history and metadata

🚀 Quick Start

Prerequisites

Python 3.8 or higher
PostgreSQL 12 or higher
Tesseract OCR engine
Git

Installation

Clone the Repository

git clone https://github.com/encoreshao/ReceiptVision.git
cd ReceiptVision

Set Up Virtual Environment

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Dependencies
```
pip install -r requirements.txt
```

Install System Dependencies

macOS (using Homebrew):

brew install tesseract
brew install poppler  # For PDF processing

Ubuntu/Debian:

sudo apt-get update
sudo apt-get install tesseract-ocr
sudo apt-get install poppler-utils
sudo apt-get install libpq-dev  # For PostgreSQL

Windows:

Download and install Tesseract OCR
Download and install Poppler
Add both to your system PATH

Set Up PostgreSQL Database

# Create database
createdb receiptvision

# Create user (optional)
psql -c "CREATE USER receiptvision_user WITH PASSWORD 'your_password';"
psql -c "GRANT ALL PRIVILEGES ON DATABASE receiptvision TO receiptvision_user;"

Configure Environment Variables

cp env.example .env
# Edit .env with your database credentials and settings

Initialize Database
```
python migrations/init_db.py
```
Run the Application
```
python app.py
```
Access the Application Open your browser and navigate to http://localhost:5001

📖 Usage Guide

Single File Processing

Navigate to Upload Page: Click "Upload" in the navigation menu
Select File: Drag and drop or click to browse for your receipt/invoice
Process: Click "Process File" to start OCR processing
Review Results: View extracted data with confidence scores
View Details: Click "View Full Details" for complete information

Batch Processing

Navigate to Batch Page: Click "Batch" in the navigation menu
Add Files: Drag and drop multiple files or browse to select
Name Your Job: Optionally provide a descriptive name for the batch
Start Processing: Click "Start Batch Processing"
Monitor Progress: Watch real-time progress updates
Review Results: View completion statistics and individual file results

Managing Receipts

View All Receipts: Navigate to the "Receipts" page
Search & Filter: Use the search bar and filters to find specific receipts
View Details: Click on any receipt to see full extracted data
Export Data: Use export options to download data in various formats

🏗️ Architecture

Backend Components

ReceiptVision/
├── app.py                     # Flask application factory
├── models.py                  # SQLAlchemy database models
├── api/
│   ├── routes.py             # Main API blueprint registration
│   └── blueprints/           # Resource-specific route blueprints
│       ├── __init__.py       # Blueprint package initialization
│       ├── upload_routes.py  # File upload endpoints
│       ├── receipt_routes.py # Receipt management endpoints
│       ├── batch_routes.py   # Batch processing endpoints
│       ├── system_routes.py  # Health/statistics endpoints
│       └── utils.py          # Shared API utilities
├── web/
│   └── routes.py             # Web interface routes
├── services/
│   ├── file_processor.py     # File processing service
│   └── batch_processor.py    # Batch processing service
├── ocr/
│   ├── ocr_engine.py         # Main OCR processing engine
│   ├── image_processor.py    # Advanced image preprocessing
│   └── pdf_processor.py      # PDF handling and conversion
├── tests/
│   ├── conftest.py           # Pytest configuration
│   ├── test_api.py           # API endpoint tests
│   ├── test_models.py        # Database model tests
│   ├── test_ocr.py           # OCR processing tests
│   └── test_services.py      # Service layer tests
└── migrations/
    └── init_db.py            # Database initialization

Frontend Components

static/
├── css/
│   └── style.css         # Apple-inspired CSS styles
├── js/
│   └── main.js          # Core JavaScript functionality
templates/
├── base.html            # Base template with navigation
├── index.html           # Homepage with features showcase
├── upload.html          # Single file upload interface
├── batch.html           # Batch processing interface
├── receipts.html        # Receipt management interface
├── receipt_detail.html  # Individual receipt details
└── statistics.html      # Application statistics

Database Schema

receipts: File metadata and processing status
extracted_data: OCR results and structured data
batch_jobs: Batch processing job tracking

API Blueprint Architecture

The API is organized using Flask blueprints for better maintainability:

📤 Upload Routes (upload_routes.py): File upload and processing endpoints
📄 Receipt Routes (receipt_routes.py): Receipt management and retrieval
📦 Batch Routes (batch_routes.py): Batch job management and status
⚙️ System Routes (system_routes.py): Health checks and statistics
🔧 Utils (utils.py): Shared utilities and helper functions

Each blueprint is registered under the /api/v1 prefix and handles specific resource domains, making the codebase more modular and easier to maintain.

🔧 Configuration

Environment Variables

Variable	Description	Default
`DATABASE_URL`	PostgreSQL connection string	`postgresql://...`
`SECRET_KEY`	Flask secret key	`dev-secret-key`
`UPLOAD_FOLDER`	File upload directory	`uploads`
`MAX_CONTENT_LENGTH`	Maximum file size (bytes)	`16777216` (16MB)
`TESSERACT_CMD`	Tesseract executable path	`/usr/local/bin/tesseract`
`CORS_ORIGINS`	Allowed CORS origins	`http://localhost:3000`

OCR Configuration

The OCR engine can be configured for different languages and processing modes:

# In ocr/ocr_engine.py
custom_config = r'--oem 3 --psm 6 -l eng+fra+deu'  # Multiple languages

Image Processing Parameters

Fine-tune image processing in ocr/image_processor.py:

# Contrast enhancement
alpha = 1.2  # Contrast control (1.0-3.0)
beta = 10    # Brightness control (0-100)

# Denoising parameters
cv2.fastNlMeansDenoising(image, None, 10, 7, 21)

🧪 Testing

Run Tests

# Run all tests
pytest tests/ -v

# Run specific test files
pytest tests/test_api.py -v
pytest tests/test_models.py -v
pytest tests/test_ocr.py -v

# Run with coverage report
pytest --cov=. --cov-report=html --cov-report=term

# Run tests in parallel (faster)
pytest -n auto

Test Organization

test_api.py: API endpoint testing
test_models.py: Database model testing
test_ocr.py: OCR processing testing
test_services.py: Service layer testing
conftest.py: Shared pytest fixtures and configuration

📊 Performance Optimization

Database Optimization

Indexes on frequently queried columns
Connection pooling for high-traffic scenarios
Query optimization for large datasets

Image Processing Optimization

Multi-threading for batch processing
Image caching for repeated processing
Memory-efficient processing for large files

API Performance

Response caching for static data
Pagination for large result sets
Asynchronous processing for long-running tasks

🔒 Security Considerations

File Upload Security

File type validation and sanitization
Size limits to prevent DoS attacks
Temporary file cleanup after processing

Database Security

Parameterized queries to prevent SQL injection
User input validation and sanitization
Database connection encryption

API Security

Rate limiting for API endpoints
CORS configuration for cross-origin requests
Input validation for all endpoints

🚀 Deployment

Production Deployment with Docker

The project includes production-ready Docker configuration:

Using Docker Compose

# Start all services
docker-compose up -d

# View logs
docker-compose logs -f

# Stop services
docker-compose down

Services Configuration
- Web Application: Flask app with Gunicorn server on port 5000
- Database: PostgreSQL 13 with persistent data storage
- Reverse Proxy: Nginx for static files and SSL termination
- Health Checks: Built-in health monitoring for all services

Environment Variables Update the docker-compose.yml with your production settings:

environment:
  - DATABASE_URL=postgresql://your_user:your_pass@db:5432/receiptvision
  - SECRET_KEY=your-production-secret-key
  - FLASK_ENV=production

Cloud Deployment Options

AWS: EC2 + RDS + S3 for file storage
Google Cloud: App Engine + Cloud SQL + Cloud Storage
Azure: App Service + Azure Database + Blob Storage
Heroku: Web dyno + Heroku Postgres + Cloudinary

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes and add tests
Run the test suite: pytest
Submit a pull request

Code Style & Development Guidelines

The project includes comprehensive development guidelines in .cursor/rules for:

Architecture Patterns: Flask application factory, blueprint organization, service layer
Code Standards: PEP 8 compliance, type hints, Google-style docstrings
API Development: RESTful conventions, error handling, response formats
Database Patterns: SQLAlchemy best practices, relationship management
Testing Guidelines: pytest patterns, fixture usage, coverage requirements
Security Considerations: Input validation, file upload security, SQL injection prevention

Key Conventions

Follow PEP 8 for Python code
Use type hints for all function parameters and return values
Add Google-style docstrings to all functions and classes
Organize routes by resource using blueprints
Implement business logic in service layer, not route handlers
Write comprehensive tests for new features
Use meaningful variable and function names
Implement proper error handling and logging

📝 API Documentation

Core Endpoints

Upload & Processing

# Upload single file
POST /api/v1/upload
Content-Type: multipart/form-data
file: [binary file data]

# Batch upload multiple files
POST /api/v1/batch-upload
Content-Type: multipart/form-data
files: [multiple binary files]
job_name: "Optional job name"

Receipt Management

# Get specific receipt
GET /api/v1/receipt/{receipt_id}

# List all receipts with pagination
GET /api/v1/receipts?page=1&per_page=10&status=completed

# Get detailed receipt information
GET /api/v1/receipts/{receipt_id}

# Download original receipt file
GET /api/v1/receipts/{receipt_id}/file

Batch Job Management

# Get batch job status
GET /api/v1/batch-job/{job_id}

# List all batch jobs
GET /api/v1/batch-jobs?page=1&per_page=10

System Information

# Health check
GET /api/v1/health

# Application statistics
GET /api/v1/statistics

Response Format

All API responses follow a consistent JSON structure:

{
  "success": true,
  "data": {...},
  "message": "Operation completed successfully"
}

Error responses:

{
  "error": "Description of the error",
  "code": "ERROR_CODE"
}

🐛 Troubleshooting

Common Issues

Tesseract not found:

# macOS
brew install tesseract
export TESSERACT_CMD=/usr/local/bin/tesseract

# Ubuntu
sudo apt-get install tesseract-ocr

PDF processing fails:

# Install poppler-utils
sudo apt-get install poppler-utils  # Ubuntu
brew install poppler  # macOS

Database connection errors:

Verify PostgreSQL is running
Check database credentials in .env
Ensure database exists and user has permissions

Low OCR accuracy:

Ensure images are high resolution (300+ DPI)
Check image quality and contrast
Try different OCR language models
Adjust image preprocessing parameters

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Tesseract OCR for optical character recognition
OpenCV for image processing capabilities
Flask for the web framework
PostgreSQL for robust data storage
Apple Design Guidelines for UI inspiration

📞 Support

Documentation: Wiki
Issues: GitHub Issues
Discussions: GitHub Discussions
Email: support@receiptvision.com

Built with ❤️ for accurate receipt processing

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.cursor		.cursor
.github		.github
api		api
migrations		migrations
ocr		ocr
services		services
static		static
templates		templates
tests		tests
web		web
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
env.example		env.example
models.py		models.py
nginx.conf		nginx.conf
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

Uh oh!

encoreshao/ReceiptVision

Folders and files

Latest commit

History

Repository files navigation

ReceiptVision 📄✨

🌟 Key Features

📁 Multi-Format Support

🧠 Advanced OCR Processing

🖼️ Advanced Image Processing

⚡ Batch Processing

🎨 Modern Web Interface

🗄️ Data Management

🚀 Quick Start

Prerequisites

Installation

📖 Usage Guide

Single File Processing

Batch Processing

Managing Receipts

🏗️ Architecture

Backend Components

Frontend Components

Database Schema

API Blueprint Architecture

🔧 Configuration

Environment Variables

OCR Configuration

Image Processing Parameters

🧪 Testing

Run Tests

Test Organization

📊 Performance Optimization

Database Optimization

Image Processing Optimization

API Performance

🔒 Security Considerations

File Upload Security

Database Security

API Security

🚀 Deployment

Production Deployment with Docker

Cloud Deployment Options

🤝 Contributing

Development Setup

Code Style & Development Guidelines

Key Conventions

📝 API Documentation

Core Endpoints

Upload & Processing

Receipt Management

Batch Job Management

System Information

Response Format

🐛 Troubleshooting

Common Issues

📄 License

🙏 Acknowledgments

📞 Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Languages

Packages