Stars
An Open Source Machine Learning Framework for Everyone
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…
C++ based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Apache Spark - A unified analytics engine for large-scale data processing
scikit-learn: machine learning in Python
🔒 End-to-end encrypted cloud for photos, videos and 2FA secrets.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Open Source Identity and Access Management For Modern Applications and Services
A native Rust library for Delta Lake, with bindings into Python
The fastest path to AI-powered full stack observability, even for lean teams.
Open, Multi-modal Catalog for Data & AI
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
⛓️ A Framework for Building High Value Public Blockchains ✨
A beautiful, simple, clean, and responsive Jekyll theme for academics
lakeFS - Data version control for your data lake | Git for data
Orchestrate everything - from scripts to data, infra, AI, and business - as code, with UI and AI Copilot. Simple. Fast. Scalable.
The fundamental package for scientific computing with Python.
Extremely fast Query Engine for DataFrames, written in Rust