Starred repositories
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Fast, small, and fully autonomous AI personal assistant infrastructure, any OS, any platform — deploy anywhere, swap anything 🦀
[SIGMOD 2026] F3: The Open-Source Data File Format for the Future
🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platfor…
tpcds dataset generator and benchmark runner
Rust based high-performance Apache Uniffle shuffle-server
An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux…
AI-Native & Cloud-Native FS: A high-performance file semantic layer for cloud object storage, integrated with high-speed cache
GlareDB: A light and fast SQL database for analytics
Unofficial Rust plugin for IntelliJ IDEA Community Edition (fork of intellij-rust)
Multi-platform high-performance compute language extension for Rust.
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
✨ Setup Apache Spark in GitHub Action workflows
OLAP Database Performance Tuning Guide
A fast, non-cryptographic, minimally DoS-resistant hashing algorithm for Rust.