Web Application to extract text from image
-
Updated
May 19, 2023 - Python
Web Application to extract text from image
This web application utilizes OCR technology to recognize text in uploaded images and provides spelling correction and word performance improvement. Users can easily upload images containing text and receive accurate and enhanced text results.
Turn Old City Directory scans into searchable data. Automated pipeline handles column detection, OCR processing, and accuracy evaluation for historical document digitization.
Processing and hashing Slack communication to enable language modelling
A fast and easy-to-use Python toolkit for image processing with CLI tools for resizing, cropping, OCR, and optimization, including batch processing support.
A local GPU-accelerated Retrieval-Augmented Generation (RAG) pipeline for PDF question-answering with multi-LLM support and modular NLP components. Process documents locally with privacy-focused information retrieval.
PDF Liberation MCP Server - Break large PDFs into digestible chunks for Claude
DocuParse is a high-performance tool for converting PDF documents into clean, structured Markdown files. Designed for speed and accuracy, it extracts and formats content while minimizing errors like hallucinations and repetitions.
This repository contains a Python script to extract text from images using OpenAI's GPT-4 API. The script supports text extraction from both online image URLs and locally stored images (converted to base64). It ensures accurate and structured text extraction, making it a powerful tool for OCR-like tasks. The extracted text is saved to a file
This Python script automates the extraction of text from images using Tesseract OCR. It processes all images in the test_images/ folder and saves the extracted text as .txt files in the extracted_texts/ directory, maintaining the original image filenames.
Extract price amount and currency symbol from a raw text string
OCR tool to extract and structure text from images and scanned PDFs (outputs .docx / .txt) — FR/EN
package for ml training in GCP
Python tool for converting PDF files to text. Simplify your document processing tasks.
Retrieve data from two different websites, loading them into the PostgreSQL database using Python, and combine them to get and present new information
A Python-based application for live video text extraction using the Gemini 1.5 Flash API, hand gesture detection, and UI display.
Convert scrolling article videos into long images and extract text with OCR.
The project focuses on extracting and structuring key details from resumes, such as names, contact information, education, and work experience, into a user-friendly interface for efficient review and management.
A complete Python pipeline that automates the creation of structured datasets from natural language search queries. This tool searches the web for content matching your query, scrapes and cleans the content, and outputs a structured dataset in multiple formats.
Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.
To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."