- SF Bay Area
Lists (6)
Sort Name ascending (A-Z)
Stars
a fast, scalable, multi-language and extensible build system
The official home of the Presto distributed SQL query engine for big data
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
The Metadata Platform for your Data and AI Stack
The missing bridge between Java and native C++
Source-agnostic distributed change data capture system
Stream summarizer and cardinality estimator.
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
Fess is very powerful and easily deployable Enterprise Search Server.
Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational m…
Connecting Apache Spark with different data stores [DEPRECATED]
Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.