Stars
A Persistent Key-Value Store designed for Streaming processing
Apache Fluss is a streaming storage built for real-time analytics.
This is the code for our VLDB'24 paper "Oasis: An Optimal Disjoint Segmented Learned Range Filter"
This is the code for our self-designing range filter as described in our SIGMOD'22 paper of the same name.
An efficient external-memory algorithm for the construction of minimal perfect hash functions
Bloom-filter based minimal perfect hash function library
Fast web applications through dynamic, partially-stateful dataflow
MARISA: Matching Algorithm with Recursively Implemented StorAge
Apache Pinot - A realtime distributed OLAP datastore
HDFS based on Java implementation as a remote ObjectStore for DataFusion
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Apache DataFusion Comet Spark Accelerator
A list of learning materials to understand databases internals
String map implementation through Fast Succinct Trie
Sux4J is an effort to bring succinct data structures to Java.
Java binding to Apache DataFusion
Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.
A runtime implementation of data-parallel actors.