📚 Convert scanned PDFs into clean EPUB ebooks effortlessly with intelligent layout analysis and smart chapter detection.
-
Updated
Mar 27, 2026 - Python
📚 Convert scanned PDFs into clean EPUB ebooks effortlessly with intelligent layout analysis and smart chapter detection.
🔍 Search Epstein court documents for mentions of your LinkedIn connections to uncover relevant insights into your network.
🚀 Assess transaction risk in real-time with a high-performance platform leveraging ensemble machine learning models for effective decision-making.
🧹 Remove OceanofPDFs.com watermarks and rename files in your PDF library efficiently, optimizing large collections with smart processing modes.
🔍 Inspect medicine strip authenticity with MedTrace, a forensic computer vision pipeline that detects tampering and ensures accurate drug information.
🕵️♂️ Scan millions of IP addresses efficiently with this high-performance DNS scanner featuring an intuitive Terminal User Interface (TUI).
🚀 Control model allocation on any device, reduce OOM errors, and enhance multi-GPU workflows with seamless memory management.
🖥️ Utilize DeepSeek-OCR-2 to effortlessly execute advanced OCR tasks, converting documents to markdown and extracting text through an intuitive web app.
🎓 Extract and validate data from academic marksheets using AI for accurate JSON output, enhancing record-keeping and analysis.
🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.
📄 Streamline document processing with S2 Document Intelligence: an open-source API for PDF, DOCX, and image text extraction, complete with OCR and layout analysis.
📄 Scan documents easily with Gemini AI to extract text and align images in a web app. Perfect for quick and accurate OCR processing.
A memory for Claude Code: Ingest PDFs, images, text files, source code and audio into a local vector database and serve semantic search over that content through the Model Context Protocol or CLI
🛠️ Build a stable, engineering-oriented OCR pipeline that adapts to various input styles using Python, OpenCV, and Tesseract.
🎭 Enable high-fidelity video face swapping with this ComfyUI plugin using Diffusion Transformer technology for seamless integration and creative control.
🤖 Enhance AI capabilities with modular Skills that provide expert knowledge, workflows, and integrations for any project.
🔍 Streamline your workflow by using PreOCR to detect and skip unnecessary OCR processing on already machine-readable files.
📖 Translate manga and manhwa pages easily with this maintainable CLI tool built on clean architecture for clear organization and efficiency.
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."