.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
-
Updated
Nov 4, 2025 - C#
.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
OCR feito em C# para recognição e extração de texto em pdfs, imagens, documentos word, planilhas, txt e de prints do clipboard.
The fluent, lightweight and powerful .NET lexerless parsing library for language development (DSL) and data scraping.
.NET Core library to extract text from doc and convert Microsoft Office binary files (doc, xls and ppt) to Open XML (docx, xlsx and pptx).
[Thesis] Video Text Extraction
Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.
To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."