Stars
Universal Pasteboard Across Devices
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
TubeMQ has been donated to the Apache Software Foundation and renamed to InLong, please visit the new Apache repository: https://github.com/apache/incubator-inlong
chenjunjiedada / parquet-mr
Forked from apache/parquet-javaMirror of Apache Parquet
Apache Lucene and Solr open-source search software
An open-source columnar data format designed for fast & realtime analytic with big data.
Apache Spark - A unified analytics engine for large-scale data processing
MongoDB River Plugin for ElasticSearch
JDBC importer for Elasticsearch *** THIS REPOSITORY WILL BE DELETED WITHOUT ANY FURTHER NOTICE AFTER AUG 1 2026 ***
Spark RDD with Lucene's query and entity linkage capabilities
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Free and Open Source, Distributed, RESTful Search Engine
Apache Druid: a high performance real-time analytics database.
Apache Pinot - A realtime distributed OLAP datastore