Stars
- All languages
- ActionScript
- Batchfile
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Cuda
- Dart
- Dockerfile
- Emacs Lisp
- Erlang
- G-code
- Gherkin
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Jupyter Notebook
- Kotlin
- Makefile
- Mojo
- Objective-C
- PHP
- POV-Ray SDL
- Perl
- Prolog
- Python
- R
- Ruby
- Rust
- Scala
- Shell
- Starlark
- TypeScript
- Vim Script
Open Source Identity and Access Management For Modern Applications and Services
Apache Kafka - A distributed event streaming platform
Event Driven Orchestration & Scheduling Platform for Mission Critical Applications
Hystrix is a latency and fault tolerance library designed to isolate points of access to remote systems, services and 3rd party libraries, stop cascading failure and enable resilience in complex di…
The official home of the Presto distributed SQL query engine for big data
Apache Doris is an easy-to-use, high performance and unified analytics database.
Logstash - transport and process your logs, events, or other data
Apache Druid: a high performance real-time analytics database.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
The Metadata Platform for your Data and AI Stack
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.
AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
📈 Capturing JVM- and application-level metrics. So you know what's going on.
Alluxio, data orchestration for analytics and machine learning in the cloud
Upserts, Deletes And Incremental Processing on Big Data.
Apache Pinot - A realtime distributed OLAP datastore
Apache Kafka® running on Kubernetes
Maxwell's daemon, a mysql-to-json kafka producer
Example code from Learning Spark book
Open, Multi-modal Catalog for Data & AI
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …
An uber-fast parallelized Java classpath scanner and module scanner.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.