#
ms-office
Here are 5 public repositories matching this topic...
Anyparser Python SDK for RAG/ETL Pipelines - File Content Extraction. Supports extraction from various file formats including PDF, Microsoft Office documents, OCR/Image to Text, Audio to Text, and Website to Text.
python search-engine pdf crawler typescript ocr knowledge-graph openai knowledgebase ms-office etl-framework etl-pipeline rag n8n langchain llamaindex retrieval-augmented-generation crewai langgraph cache-augmented-generation
-
Updated
Feb 26, 2025 - Python
A MS OpenXML Format Fuzzing Framework
-
Updated
Apr 10, 2018 - Python
Gathers metadata information of MS Office files
-
Updated
Jan 2, 2018 - Python
Improve this page
Add a description, image, and links to the ms-office topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ms-office topic, visit your repo's landing page and select "manage topics."