Stars
Grafana Tempo is a high volume, minimal dependency distributed tracing backend.
Distributed DuckDB - dual execution and differential storage
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
cigrainger / duckdb-hnsw-acorn
Forked from duckdb/duckdb-vssACORN-1 pre-filtered HNSW search for DuckDB
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
The Go implementation of Connect: Protobuf RPC that works.
DuckLake is an integrated data lake and catalog format
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
A Rust-based toolkit for building and packaging DuckDB extensions without Python dependencies.
Apache Spark - A unified analytics engine for large-scale data processing
A modular implementation of timely dataflow in Rust
Local AI assistant, dreaming explorable worlds.
The Naiad system provides fast incremental and iterative computation for data-parallel workloads
AliSQL is a MySQL branch originated from Alibaba Group. Fetch document from Release Notes at bottom.
Real-time analytics on Postgres tables
DuckDB is an analytical in-process SQL database management system
An implementation of differential dataflow using timely dataflow on Rust.
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.
SGLang is a high-performance serving framework for large language models and multimodal models.
High-performance adaptive, durable, portable, transactional embeddable storage engine with optional tiered object storage for infinite scale. Designed for flash and RAM optimization.
Postgres extension for vector search (DiskANN), complements pgvector for performance and scale. Postgres OSS licensed.
pg_lake: Postgres with Iceberg and data lake access
The First Distributed Real-Time Search Analytics Database