Stars
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others.
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
The leader in Customer Data Infrastructure
Empowering People Ethically 🚀 — Matomo is hiring! Join us → https://matomo.org/jobs Matomo is the leading open-source alternative to Google Analytics, giving you complete control and built-in priva…
Raccoon is a high-throughput, low-latency service to collect events in real-time from your web, mobile apps, and services using multiple network protocols.
eval of Jaeger tracing within Kafka components
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
📝 Algorithms and data structures implemented in JavaScript with explanations and links to further readings
Scalable machine learning library for Apache Hive/Spark/Pig
This based for rapid introduction to python !
Storm Metrics module for reporting to statsd
A web console for Apache Kafka (retired)