Stars
FastAPI framework, high performance, easy to learn, fast to code, ready for production
scikit-learn: machine learning in Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
SQL databases in Python, designed for simplicity, compatibility, and robustness.
An orchestration platform for the development, production, and observation of data assets.
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
Collection of library stubs for Python, with static types
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Uses tokenized query returned by python-sqlparse and generates query metadata
Parses cron schedules to iterate over datetime objects.
Databricks framework to validate Data Quality of pySpark DataFrames and Tables
Lightweight SQL DDL parser for extracting tables, columns, and schema metadata with broad multi-dialect support (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects)