tika
Here are 147 public repositories matching this topic...
Extracts GPS coordinates from pdf files and Points/Polygons from kmz files to create a master kml file. 🌎
-
Updated
Jul 7, 2021 - HTML
Este proyecto consiste en la construcción de un sistema de recuperación de información que puede manipular documentos de diferentes formatos provenientes de un repositorio de información. La aplicación utiliza herramientas como Lucene y Tika para indexar y extraer información de los documentos.
-
Updated
Jun 23, 2024 - Java
Directory tree metadata parser using Apache Tika
-
Updated
May 3, 2024 - Python
A doc searcher of the documents on the local host that is based on: Tika+OCR, ElasticSearch and Kibana
-
Updated
Jan 23, 2021 - Java
WORK IN PROGRESS - Dataiku DSS plugin to extract text data from documents
-
Updated
Jan 11, 2021 - Makefile
The simple monolithic application demonstrates: the extraction of the images of the PDF document pages using Apache Tika, the storage of the images files into the local filesystem, the display of the pages using the ngx-swiper-wrapper library.
-
Updated
May 9, 2023 - Java
Early Buddhist texts from the Tipitaka (Tripitaka). Suttas (sutras) with the Buddha's teachings on mindfulness, insight, wisdom, and meditation.
-
Updated
Jul 6, 2023 - JavaScript
Information retrieval system for documents.
-
Updated
Feb 15, 2022 - HTML
The Information Retrieval Labolatories
-
Updated
Apr 16, 2018 - Java
Information Retrieval system for indexing and searching files stored on disk, with support for Romanian language
-
Updated
Mar 16, 2019 - Java
POC: azure-functions (kotlin, gradle, tika)
-
Updated
Feb 18, 2019 - Kotlin
Improve this page
Add a description, image, and links to the tika topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tika topic, visit your repo's landing page and select "manage topics."