extract-text
Here are 61 public repositories matching this topic...
Scripts engineered for R&D to extract text from audio, video, and websites necessary to improve their 'Unfold' app algorithm
-
Updated
Dec 19, 2022 - Python
A React-based web app that extracts text from images using Tesseract.js. Upload an image, and the app will process it automatically. Supports manual text extraction as well. 🚀
-
Updated
Feb 10, 2025 - TypeScript
AI-powered document organiser. Extracts text and/or sorts documents: Drop in a bunch of PDFs, DOCX files, or ebooks, and it extracts Document Text, identifies Title, Author, and Year, with a local or remote LLM, and moves them into folders, and/or keeps the extracted text.
-
Updated
Mar 17, 2026 - JavaScript
Extract specific paragraphs out of Joplin notes using keywords, hashtags or custom tags similar to Logseq block references. Also, refresh extracted notes if source notes change.
-
Updated
Jan 6, 2026 - TypeScript
A python-based application, developed using Kivy framework, for text extraction and text to speech synthesis.
-
Updated
Jul 2, 2020 - Python
Extract Text and Data from Document with OCR NER
-
Updated
Aug 29, 2023 - Jupyter Notebook
This is GroupDocs free consulting project that helps you to extract Text from Microsoft PowerPoint Presentation PPTX/PPT using GroupDocs.Parser Cloud SDK for JAVA. https://www.groupdocs.cloud
-
Updated
Sep 19, 2024 - Java
Image to text using Tesseract OCR.
-
Updated
May 16, 2022 - Python
Automatically extracts packages root name for monorepos
-
Updated
Aug 28, 2022 - JavaScript
Apple Shortcut to copy text of selected area (screenshot) to clipboard
-
Updated
Jun 10, 2025
Library that allows to extract text from RPG Maker files.
-
Updated
Mar 12, 2026 - Rust
Extract text from image using Pytesseract
-
Updated
Apr 26, 2020 - Python
An application to extract text from pdf files
-
Updated
Jan 20, 2022 - C#
A gem that parses positional text from hOCR output and provides convenience methods to find text.
-
Updated
Oct 20, 2022 - Ruby
A collection of tools for OCR (optical character recognition).
-
Updated
Oct 17, 2024 - C
tokyo, a REST API, when given any type of document 📄, Identifies mime-type 🧐. Suggests extension 🦔. Alas Extracts text 💪.
-
Updated
Jun 13, 2020 - Clojure
-
Updated
Apr 16, 2022 - Python
Improve this page
Add a description, image, and links to the extract-text topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the extract-text topic, visit your repo's landing page and select "manage topics."