#
ms-office
Here are 2 public repositories matching this topic...
Anyparser Typescript SDK for RAG/ETL Pipelines - File Content Extraction. Supports extraction from various file formats including PDF, Microsoft Office documents, OCR/Image to Text, Audio to Text, and Website to Text.
crawler ocr microsoft-word web-crawler text-extraction artificial-intelligence knowledgebase ms-office microsoft-office etl-pipeline rag pdf-extraction n8n-nodes langchain retrieval-augmented-generation graph-rag cache-augmented-generation anyparser
-
Updated
Feb 26, 2025 - TypeScript
Improve this page
Add a description, image, and links to the ms-office topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ms-office topic, visit your repo's landing page and select "manage topics."