#

ocr-recognition

Here are 541 public repositories matching this topic...

ricochetservice / Gemma3_OCR_Text_Extractor_LLM

Gemma-3 OCR exemplifies the confluence of abstruse computer vision and arcane NLP, leveraging Gemma-3 Vision’s neural framework for precise OCR and semantically refined text curation. Powered by Streamlit and Ollama, this hermetic system converts visual data into perspicuous, markdown-rendered output, ensuring maximal accuracy and confidentiality.

ocr base64 deep-learning image-processing transformers pillow text-extraction ocr-recognition streamlit text-extraction-from-image llm vision-language-model ollama gemma3

Updated Nov 11, 2025
Python

Y2marcos / DeepSeek-OCR-Studio

This tool Help you Convert documents to markdown, extract raw text, and locate specific content with bounding boxes. It takes 20~ sec for markdown and 3~ sec for locate task. Check the info at the bottom of the page for more information.

ocr ocr-recognition ocr-text-reader ocr-python deepseek deepseekocr

Updated Nov 10, 2025
Python

ScaleDP

StabRise / ScaleDP

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

nlp pdf machine-learning ocr spark data-extraction nlp-machine-learning vlm ocr-recognition pdf-document-processor ocr-python easyocr huggingface-models llm llm-inference suryaocr doctrocr

Updated Nov 10, 2025
Python

HTLinh0604 / Invoice-data-extraction

This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

python opencv flask numpy regex pandas tesseract-ocr ocr-recognition

Updated Nov 10, 2025
Python

HTLinh0604 / invoice_ai_automation

This project transforms messy invoice images into a structured, searchable knowledge base. The pipeline automatically extracts text with Tesseract, uses Google Gemini to parse fields (vendor, total, date), stores data in Milvus, and enables natural language queries via a LangChain-powered chatbot.

python tesseract-ocr ocr-recognition fastapi streamlit huggingface-transformers llm langchain rag-chatbot

Updated Nov 10, 2025
Python

alephpi / Texo

A minimalist SOTA LaTeX OCR model which contains only 20M parameters and runs in browser. Containing full training pipeline suitable for self-study. | 超轻量SOTA LaTeX公式识别模型，20M参数量，可在浏览器中运行。包含训练全流程代码，适合自学。

python formula machine-learning ocr latex computer-vision deep-learning math transformers pytorch hydra ocr-recognition pytorch-lightning distillation-model latex-ocr math-formula-recognition vision-encoder-decoder unimernet formulanet

Updated Nov 8, 2025
Python

ajaj123-debug / Nutriscan-food-labels-insights

Django app that uses Tesseract OCR and a Gemini-based analyzer to extract nutrition info from images, match harmful ingredients, and persist scan results.

python3 ocr-recognition render-deployment

Updated Nov 8, 2025
Python

shubhambhavsar / ThirdEyeVision

text-to-speech computer-vision ocr-recognition yolov8

Updated Nov 8, 2025
Python

themeetshah / screen-assistant

Screen Assistant is an intelligent, voice-driven desktop automation system.

python nlp ocr-recognition openai-whisper gemeni-api

Updated Nov 7, 2025
Python

rtr46 / meikiocr

high-speed, high-accuracy, local ocr for japanese video games

python video-games ocr computer-vision deep-learning japanese text-recognition ocr-recognition japanese-study onnx ocr-detection onnxruntime japanese-language-learners ocr-demo

Updated Nov 7, 2025
Python

dl-cv / labelme-ai

深度视觉LabelmeAI是一款基于LabelMe开源版进行深度重开发的更加智能的标注工具。

python computer-vision deep-learning image-annotation video-annotation annotations classification semantic-segmentation ocr-recognition instance-segmentation

Updated Nov 7, 2025
Python

prithivrajmu / extract-data-from-pdf

To extract data from pdf - handwritten, structured, unstructured, all sorts of documents

data-mining ocr ocr-recognition pdf-document-processor llm

Updated Nov 6, 2025
Python

wcosta-01 / SeniorCapstone

With the use of Pupil Labs Core eye tracking glasses we are able to capture text in images, videos, and real-time.

eye-tracking pupil-labs ocr-recognition eye-detection text-detection-recognition easyocr

Updated Nov 6, 2025
Python

Uli-Z / autoPDFtagger

autoPDFtagger is a Python tool designed for efficient home-office organization, focusing on digitizing and organizing both digital and paper-based documents. By automating the tagging of PDF files, including image-rich documents and scans of varying quality, it aims to streamline the organization of digital archives.

artificial-intelligence archive pdf-files ocr-recognition gpt-3 gpt-4 gpt-api gpt-vision

Updated Nov 5, 2025
Python

BrunoViola / ocr-with-knn

OCR em Python com k-Nearest Neighbors (k-NN) para reconhecer caracteres do alfabeto Iorubá

python machine-learning yoruba optical-character-recognition knn k-nearest-neighbours ocr-recognition ioruba

Updated Nov 4, 2025
Python

Ishita-kapoor / SmartEyeNPR

AI-driven number plate recognition web app built with Flask, YOLOv8, and EasyOCR. Instantly detects and reads license plates from images or live webcam, flags stolen vehicles via database lookup, and is powered by a custom Roboflow-annotated dataset. 🚗✨

python opencv computer-vision ocr-recognition yolov8

Updated Nov 3, 2025
Python

vikashmehta292511 / NavDoc-AI

AI-powered medical document analyzer with multilingual support and comprehensive medical report interpretation for my SIH-2025 project.

python machine-learning language-translation tesseract-ocr medical-image-processing ocr-recognition fastapi huggingface-transformers report-summarizer facebook-mbart

Updated Nov 3, 2025
Python

Ramakm / Deepseek-OCR

Deepseek OCR paper implementation.

machine-learning ocr ai ml python3 research-paper ocr-recognition mlops deepseek-llm deepseek

Updated Nov 3, 2025
Python

richardmatusch / rename_jpgs

script that uses easyocr for scanning .jpg files and renaming them based on read values

ocr-recognition jpg-images

Updated Nov 2, 2025
Python

supreetbhat / meter_reading

An automated meter reader (AMR) system to extract serial numbers and readings from meter images using deep learning.

tesseract-ocr ocr-recognition yolov8

Updated Nov 2, 2025
Python

Improve this page

Add a description, image, and links to the ocr-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocr-recognition topic, visit your repo's landing page and select "manage topics."