This is a simple Java project to perform a word search from a directory of documents. It can handle multiple Document types, from PDF to txt to XML.
-
Updated
Sep 8, 2017 - Java
This is a simple Java project to perform a word search from a directory of documents. It can handle multiple Document types, from PDF to txt to XML.
The implementation aims to harness the power of Watson's text classification capabilities by leveraging Stanford CoreNLP for natural language processing and Lucene.Net for text indexing and search.
GridU search course, Lucene, Solr, Elastic Search
nori analysis java application example
Spring-Boot web app for full text lyric search
Using Apache Lucene to index documents in AP89 corpus, perform retrieval on TREC topics and evaluate the performance of retrieval algorithms using different evaluation metrics
A java console application to index and search through files and directories
Information retrieval of documents using user generated query.
Small playground to explore the analysis API of Lucene 6.x
This is Spring Boot Email Client application, it is using Lucene to search and get different resources from server side.
DocClusterizer is a Java desktop application designed to analyze and cluster documents based on their content similarity. The application utilizes Lucene and Tika libraries to process various file extensions such as txt, pdf, docx, and pptx.
An implementation of an advanced movie search engine, using TMDB's data & Lucene's indexing. It is a desktop application, developed in Java
Provides compound word filters for lucene
Implementation of Daitch–Mokotoff Soundex for Solr/Lucene
Práctica realizada para la Asignatura de Ingeniería Informática de Recuperación de Información.
Add a description, image, and links to the lucene-analyzer topic page so that developers can more easily learn about it.
To associate your repository with the lucene-analyzer topic, visit your repo's landing page and select "manage topics."