Skip to content
#

ocr-recognition

Here are 541 public repositories matching this topic...

Gemma-3 OCR exemplifies the confluence of abstruse computer vision and arcane NLP, leveraging Gemma-3 Vision’s neural framework for precise OCR and semantically refined text curation. Powered by Streamlit and Ollama, this hermetic system converts visual data into perspicuous, markdown-rendered output, ensuring maximal accuracy and confidentiality.

  • Updated Nov 11, 2025
  • Python

This project transforms messy invoice images into a structured, searchable knowledge base. The pipeline automatically extracts text with Tesseract, uses Google Gemini to parse fields (vendor, total, date), stores data in Milvus, and enables natural language queries via a LangChain-powered chatbot.

  • Updated Nov 10, 2025
  • Python

A minimalist SOTA LaTeX OCR model which contains only 20M parameters and runs in browser. Containing full training pipeline suitable for self-study. | 超轻量SOTA LaTeX公式识别模型,20M参数量,可在浏览器中运行。包含训练全流程代码,适合自学。

  • Updated Nov 8, 2025
  • Python

Improve this page

Add a description, image, and links to the ocr-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocr-recognition topic, visit your repo's landing page and select "manage topics."

Learn more