The fluent, lightweight and powerful .NET lexerless parsing library for language development (DSL) and data scraping.
-
Updated
Nov 6, 2025 - C#
The fluent, lightweight and powerful .NET lexerless parsing library for language development (DSL) and data scraping.
.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
OCR feito em C# para recognição e extração de texto em pdfs, imagens, documentos word, planilhas, txt e de prints do clipboard.
.NET Core library to extract text from doc and convert Microsoft Office binary files (doc, xls and ppt) to Open XML (docx, xlsx and pptx).
[Thesis] Video Text Extraction
Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.
To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."