Highlights
Stars
A high-performance observability data pipeline.
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication feat…
Simple, reliable, and efficient distributed task queue in Go
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
PostgreSQL Monitoring, Metrics Collection and Alerting Resources from Crunchy Data
Apache Superset is a Data Visualization and Data Exploration Platform
If you want to become good at system design, join this newsletter now 👇
An extremely fast Python package and project manager, written in Rust.
A markup-based typesetting system that is powerful and easy to learn.
The fantastic ORM library for Golang, aims to be developer friendly
OCR, layout analysis, reading order, table recognition in 90+ languages
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
OpenID Connect (OIDC) identity and OAuth 2.0 provider with pluggable connectors
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A React Framework for building internal tools, admin panels, dashboards & B2B apps with unmatched flexibility.
MetaSeg: Packaged version of the Segment Anything repository
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Extremely fast Query Engine for DataFrames, written in Rust
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
A powerful set of Python debugging tools, based on PySnooper
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Machine Learning Engineering Open Book
Generative Models by Stability AI