Scalable construction of text indexes

T Bingmann, S Gog, F Kurpicz - arXiv preprint arXiv:1610.03007, 2016 - arxiv.org
… basis for many text indexes and string algorithms. Suffix array construction is theoretically …
and often limits the applicability of advanced text data structures on large datasets. While fast …

Scalable construction of text indexes with thrill

T Bingmann, S Gog, F Kurpicz - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
… In this article, we present five suffix array construction algorithms utilizing the new algorithmic
big data batch processing framework Thrill, which allows scalable processing of input sizes …

Efficient online index construction for text databases

N Lester, A Moffat, J Zobel - ACM Transactions on Database Systems …, 2008 - dl.acm.org
index construction, we assume that an index of n pointers is being built; that the in-memory
index … costs are significantly reduced, and more scalable. In particular, the time required to …

Building a distributed full-text index for the web

S Melink, S Raghavan, B Yang… - ACM Transactions on …, 2001 - dl.acm.org
… There has been recent interest, motivated by the Web, in designing scalable techniques
to speed up inverted index construction using distributed architectures. Ribeiro-Neto et al. [1999…

Scalable structural index construction for JSON analytics

L Jiang, J Qiu, Z Zhao - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
… current design of structural index construction [44] involves … work is to scale the structural
index construction to larger and … in each step of the index construction, then develop specialized …

Scalable and robust construction of topical hierarchies

C Wang, X Liu, Y Song, J Han - arXiv preprint arXiv:1403.3460, 2014 - arxiv.org
… Automated generation of high-quality topical hierarchies for a text collection is a dream …
a scalable and robust algorithm is proposed for constructing a hierarchy of topics from a text

[PDF][PDF] Parallel text index construction

F Kurpicz - 2020 - d-nb.info
… As mentioned before, we focus on the parallel construction in shared and distributed memory
and provide construction algorithm that scale well. Another central point of our construction

Scalable thread based index construction using wavelet tree

AK Yadav, D Yadav, A Verma, M Akbar… - Multimedia Tools and …, 2023 - Springer
… It was a milestone in the field of compressed full-text indexing but little mentioned in this
paper. The research was extended by applying a wavelet tree for image processing, sets of …

Efficient single‐pass index construction for text databases

S Heinz, J Zobel - Journal of the American Society for …, 2003 - Wiley Online Library
Efficient construction of inverted indexes is essential to provision of search over large collections
of text data. In this article, we review the principal approaches to inversion, analyze their …

[PDF][PDF] Scalable construction of high-quality web corpora

C Biemann, F Bildhauer, S Evert, D Goldhahn… - Journal for Language …, 2013 - jlcl.org
… Large-scale web corpora as discussed in this article are often designed to replace and extend
a traditional general-language reference corpus such as the British National Corpus (BNC…