text-extraction
Here are 9 public repositories matching this topic...
Golang PDF library for creating and processing PDF files (pure go)
-
Updated
Oct 9, 2025 - Go
Highlight/colourise command output, logfiles (and anything else really) based on regex pattern matching
-
Updated
Oct 1, 2025 - Go
A command-line tool in Go that extracts meaningful text from web pages, filters out unwanted elements, and outputs clean text for easy integration with AI applications, data mining, and web scraping.
-
Updated
Sep 15, 2024 - Go
Extract text from plaintext, .docx, .odt and .rtf files. Pure go.
-
Updated
Nov 25, 2023 - Go
Golang module for extracting text from XML-based MS Office documents
-
Updated
Jan 29, 2023 - Go
This repository has moved! https://github.com/unidoc/unipdf
-
Updated
May 23, 2019 - Go
[UNMANTEINED] Extract values from strings and fill your structs with nlp.
-
Updated
Sep 18, 2017 - Go
Improve this page
Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."