Build software better, together

GDDaucatmoi / pdf2epub-paddle

📚 Convert scanned PDFs into clean EPUB ebooks effortlessly with intelligent layout analysis and smart chapter detection.

react computer-science typescript ocr books pwa book mcp nextjs english reader translate document webui epubcheck epub-reader epub3 digital-publishing pdf2zh

Updated Mar 27, 2026
Python

emotech15 / GLM-OCR-Demo

Star

python markdown ocr computer-vision pillow torch pytorch accelerate gradio opencv-python peft torchvision huggingface-transformers vlms glm-ocr

Updated Mar 27, 2026
Python

Papotewii / EpsteIn

Star

🔍 Search Epstein court documents for mentions of your LinkedIn connections to uncover relevant insights into your network.

rust data ocr base64 csv cnn pytorch dataset rag epstein epstien epsteindidntkillhimself rag-pipeline rag-chatbot epstein-files epstein-files-download

Updated Mar 27, 2026
Python

Altumbilal / risk-platform

Star

🚀 Assess transaction risk in real-time with a high-performance platform leveraging ensemble machine learning models for effective decision-making.

kubernetes flow ocr dashboard risk-analysis back-office grc case-management component-analysis vulnerability-detection hacktoberfest software-security risk-management bill-of-materials know-your-customer kyb liveliness

Updated Mar 27, 2026
Python

Kinose12 / OceanofPDFs_Tag_Remover

Star

🧹 Remove OceanofPDFs.com watermarks and rename files in your PDF library efficiently, optimizing large collections with smart processing modes.

pdf ocr books ocrmypdf renamer pdf-document-processor pdf-processing vibe-coded pdf-cleaner document-cleaner oceanofpdf

Updated Mar 27, 2026
Python

27Harsh-Tamrakar / MedTrace

Star

🔍 Inspect medicine strip authenticity with MedTrace, a forensic computer vision pipeline that detects tampering and ensures accurate drug information.

python opencv ocr computer-vision dapp truffle image-processing forensics testrpc solidity web3 web3js metamask pharmaceuticals ropsten tamper-detection easyocr inspection-system

Updated Mar 26, 2026
Python

sus112112 / PYDNS-Scanner

Star

🕵️♂️ Scan millions of IP addresses efficiently with this high-performance DNS scanner featuring an intuitive Terminal User Interface (TUI).

Updated Mar 26, 2026
Python

MuhammadZainiRedha / ComfyUI-AnyDeviceOffload

Star

🚀 Control model allocation on any device, reduce OOM errors, and enhance multi-GPU workflows with seamless memory management.

Updated Mar 26, 2026
Python

bus35hs / DeepSeek-OCR-2-Demo

Star

🖥️ Utilize DeepSeek-OCR-2 to effortlessly execute advanced OCR tasks, converting documents to markdown and extracting text through an intuitive web app.

python ocr torch pytorch addict gradio ocr-text-reader torchvision huggingface-transformers tokenizers einops deepseek-v3 deepseek-ocr easydict

Updated Mar 26, 2026
Python

leadershop / marksheet-information-extraction-api

Star

🎓 Extract and validate data from academic marksheets using AI for accurate JSON output, enhancing record-keeping and analysis.

docker ocr computer-vision json-api image-processing document-processing ai-api fastapi huggingface pdf-processing easyocr document-ai llm multimodal-ai google-gemini-api vision-llm marksheet-extraction

Updated Mar 26, 2026
Python

kiddo0001 / ComfyUI-PainterAI2V

Star

🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.

Updated Mar 26, 2026
Python

gadp27deabril / s2-document-intelligence

Star

📄 Streamline document processing with S2 Document Intelligence: an open-source API for PDF, DOCX, and image text extraction, complete with OCR and layout analysis.

python api docker pdf ocr text-extraction document-analysis document-processing layout-analysis pymupdf open-core fast-api paddleocr

Updated Mar 26, 2026
Python

GamemodeG / ocr_scanner_gemini

Star

📄 Scan documents easily with Gemini AI to extract text and align images in a web app. Perfect for quick and accurate OCR processing.

python nlp cli opencv flask ocr image-processing gemini openai data-extraction corner-detection perspective-transform document-processing anthropic google-gemini gemini-ai kubsu-university-project

Updated Mar 26, 2026
Python

punt-labs / quarry

Star

A memory for Claude Code: Ingest PDFs, images, text files, source code and audio into a local vector database and serve semantic search over that content through the Model Context Protocol or CLI

ocr beta vector-search mcp-server claude-code-plugin

Updated Mar 26, 2026
Python

vimo-dgaf / RVisionT

Star

🛠️ Build a stable, engineering-oriented OCR pipeline that adapts to various input styles using Python, OpenCV, and Tesseract.

python opencv engineering ocr research computer-vision experimental pipeline image-processing tesseract text-recognition heuristics robustness no-deep-learning

Updated Mar 26, 2026
Python

weiwei0011 / ComfyUI_RH_DreamID-V

Star

🎭 Enable high-fidelity video face swapping with this ComfyUI plugin using Diffusion Transformer technology for seamless integration and creative control.

emulator machine-learning r ocr cpp pytorch copilot v cemu cemu-emulator pdf-parser comfy ai4science stable-diffusion generative-ai deepseek comfyui-nodes pdf-extractor-pretrain deepseek-v3

Updated Mar 26, 2026
Python

KhaiEr720 / AI-Skills

Star

🤖 Enhance AI capabilities with modular Skills that provide expert knowledge, workflows, and integrations for any project.

github react ui-design documentation ocr ai landing-page web-scraping codex mobile-ui github-scraper ai-tools claude-desktop mcp-server ai-skills kiro anthropic-skills claude-skills-hub

Updated Mar 26, 2026
Python

cannsssff / preocr

Star

🔍 Streamline your workflow by using PreOCR to detect and skip unnecessary OCR processing on already machine-readable files.

python pdf opencv ocr computer-vision python-library image-processing text-extraction document-classification text-detection file-analysis document-analysis pdf-parsing document-processing layout-analysis ocr-detection document-understanding pdf-analysis document-intelligence

Updated Mar 26, 2026
Python

sebastian1889 / MangaTrans

Star

📖 Translate manga and manhwa pages easily with this maintainable CLI tool built on clean architecture for clear organization and efficiency.

linux cli open-source ocr translation ai computer-vision ocr-service comics translate segmentation manga-reader japanese-language manga-translator open-to-collaborate

Updated Mar 26, 2026
Python

redocto / image-text-structurizer

Star

java ocr ffi transformers tesseract spacy metadata-extraction phi anonymization data-anonymization data-masking rag personally-identifiable-information pii-detection pdf-extraction document-intelligence

Updated Mar 26, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocr

Here are 3,654 public repositories matching this topic...

GDDaucatmoi / pdf2epub-paddle

emotech15 / GLM-OCR-Demo

Papotewii / EpsteIn

Altumbilal / risk-platform

Kinose12 / OceanofPDFs_Tag_Remover

27Harsh-Tamrakar / MedTrace

sus112112 / PYDNS-Scanner

MuhammadZainiRedha / ComfyUI-AnyDeviceOffload

bus35hs / DeepSeek-OCR-2-Demo

leadershop / marksheet-information-extraction-api

kiddo0001 / ComfyUI-PainterAI2V

gadp27deabril / s2-document-intelligence

GamemodeG / ocr_scanner_gemini

punt-labs / quarry

vimo-dgaf / RVisionT

weiwei0011 / ComfyUI_RH_DreamID-V

KhaiEr720 / AI-Skills

cannsssff / preocr

sebastian1889 / MangaTrans

redocto / image-text-structurizer

Improve this page

Add this topic to your repo