-
This is my personal account
- Pittsburgh
- http://searchivarius.org/about
- @srchvrs
Stars
A library for efficient similarity search and clustering of dense vectors.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Seamless operability between C++11 and Python
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Header-only C++/python library for fast approximate nearest neighbors
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates th…
Benchmark of Nearest Neighbor Search on High Dimensional Data
A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.
Fast and memory-efficient svmlight / libsvm file loader for Python.
FM-Index full-text index implementation using RRR Wavelet trees (libcds) and fast suffix sorting (libdivsufsort) including experimental results.
Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistic…
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering
Python bindings for the fast integer compression library FastPFor.
Transition-based joint syntactic dependency parser and semantic role labeler using a stack LSTM RNN architecture.
Fast implementations of the scancount algorithm: C++ header-only library
Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016
Python binding to the KrovetzStemmer package (C++ version)
GPU-Accelerated Faster Decoding of Integer Lists