apache-superset
Here are 37 public repositories matching this topic...
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
-
Updated
Jan 18, 2025 - Python
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
-
Updated
Feb 9, 2024 - Python
-
Updated
Jun 25, 2018 - Python
Demostrate apache superset integrated with django application, with custom authentication layer
-
Updated
Jul 16, 2018 - Python
An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
-
Updated
Dec 7, 2022 - Python
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):
-
Updated
May 14, 2022 - Python
Apache Solr dialect for SQLAlchemy
-
Updated
Dec 16, 2025 - Python
Run an open-source data LakeHouse locally using Docker Compose
-
Updated
May 31, 2024 - Python
A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.
-
Updated
Jan 17, 2025 - Python
-
Updated
Nov 13, 2025 - Python
Building a Data Lakehouse for Analyzing Elon Musk Tweets using MinIO, Apache Airflow, Apache Drill and Apache Superset
-
Updated
Feb 4, 2023 - Python
-
Updated
Oct 7, 2020 - Python
This project provides a SQLAlchemy driver for Apache Ignite. It was built to enable (ad-hoc) data exploration and visualization of datasets managed by Apache Ignite.
-
Updated
Nov 3, 2021 - Python
This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.
-
Updated
Jul 1, 2025 - Python
Apache Superset - Authentication Bypass
-
Updated
Jun 24, 2024 - Python
Real‑time/historical analytics dashboard suite unifying multi‑source enterprise data (social media, server, HR, financial, marketing, transactions) using data integration, streaming, visualization and ML techniques
-
Updated
Mar 27, 2025 - Python
DE Project to keep track of my personal health metrics in a Data Warehouse
-
Updated
Feb 1, 2024 - Python
Improve this page
Add a description, image, and links to the apache-superset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the apache-superset topic, visit your repo's landing page and select "manage topics."