-
Stealth
- San Francisco
- https://www.antoniomallia.it
- @antonio_mallia
- in/antoniomallia
Highlights
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A collection of powerful data structures
fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.
💎Collection of algorithms and data structures
Anserini is a Lucene toolkit for reproducible information retrieval research
Restful index server for teaching purposes. [I do not maintain this repository any more.]
hyan / kamikaze
Forked from javasoze/kamikazeDocId set compression and set operation library
Tools relating to the CC-News-En Collection
64-bit integer compression algorithms in Java
Indexing TREC corpora and Wikipedia using Lucene
A comparison of two search algorithms used to search queries in large datasets: TAAT (term at a time) and DAAT (document at a time).
Selective search partitions large scale dataset into subsets(shards) such that only few shards needs to be searched for a query, thus improving search efficiency and effectiveness
Binary delta encoding for series of floating point numbers
Document-at-a-time query processing with WAND optimization.
An efficient layer to perform query processing on top of Terrier
A tool to create D2SI format collections from the CC-NEWS crawl using Apache Tika and Lucene
Java tools for evaluating BitFunnel performance compared to an mg4j baseline.
Dhulipala, Laxman, et al. "Compressing Graphs and Indexes with Recursive Graph Bisection." arXiv preprint arXiv:1602.08820 (2016).
Natural Language Processing POS tagger using Viterbi Algorithm