Stars
Collection of library stubs for Python, with static types
scikit-learn: machine learning in Python
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
SQL databases in Python, designed for simplicity, compatibility, and robustness.
Data validation using Python type hints
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
An orchestration platform for the development, production, and observation of data assets.
Databricks framework to validate Data Quality of pySpark DataFrames and Tables
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A client for connecting and running DDLs on hive metastore.
Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture
Open, Multi-modal Catalog for Data & AI
Cerbos is the open core, language-agnostic, scalable authorization solution that makes user permissions and authorization simple to implement and manage by writing context-aware access control poli…
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Parses cron schedules to iterate over datetime objects.
An Open Standard for lineage metadata collection
Collect, aggregate, and visualize a data ecosystem's metadata
Open Source, Google Zanzibar-inspired database for scalably storing and querying fine-grained authorization data