-
pytest-dbt-duckdb Public
Test your DBT code (i.e. Snowflake) using DuckDB as the isolated engine
-
lightdash Public
Forked from lightdash/lightdashSelf-serve BI to 10x your data team ⚡️
TypeScript Other UpdatedOct 22, 2025 -
-
OpenMetadata Public
Forked from open-metadata/OpenMetadataOpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
TypeScript Apache License 2.0 UpdatedMay 28, 2025 -
-
airflow-charts Public
Forked from airflow-helm/chartsThe User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-…
Shell Apache License 2.0 UpdatedSep 18, 2024 -
pytest-dbt-postgres Public
Unittest DBT Postgres projects
-
mini-data-platform Public
Mini Data Platform
-
-
airflow-aws-shared-secrets Public
SecretsManagerBackend with cross-account access
-
dbt-core Public
Forked from dbt-labs/dbt-coredbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Python Apache License 2.0 UpdatedJan 30, 2024 -
data-access-layer Public
Library to facilitate accessing Data from Databricks
Python Apache License 2.0 UpdatedFeb 17, 2023 -
kafdrop Public
Forked from obsidiandynamics/kafdropKafka Web UI
Java Apache License 2.0 UpdatedOct 7, 2022 -
helm-charts Public
Forked from lightdash/helm-chartsLightdash Community helm charts
Shell UpdatedJun 28, 2022 -
-
spark-json-schemas Public
Create Spark schemas using JSON-schemas
Scala Apache License 2.0 UpdatedFeb 27, 2022 -
rudderstack-helm Public
Forked from logankopas/rudderstack-helmOpen-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your customer data lake on your data warehouse.
Mustache MIT License UpdatedJan 31, 2022 -
datahub-helm Public
Forked from acryldata/datahub-helmRepository of helm charts for deploying DataHub on a Kubernetes cluster
Mustache Apache License 2.0 UpdatedAug 4, 2021 -
airbyte Public
Forked from airbytehq/airbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Java MIT License UpdatedJun 30, 2021 -
aws-glue-data-catalog-client-for-apache-hive-metastore Public
Forked from awslabs/aws-glue-data-catalog-client-for-apache-hive-metastoreThe AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational m…
Java Apache License 2.0 UpdatedJun 2, 2021 -
datahub Public
Forked from datahub-project/datahubA Generalized Metadata Search & Discovery Tool
TypeScript Apache License 2.0 UpdatedMay 25, 2021 -
prefect Public
Forked from PrefectHQ/prefectThe easiest way to automate your data
Python Apache License 2.0 UpdatedJan 6, 2021 -
-
-
redis-poc Public
Notification System PoC with ZSETs using the time to send as Scores
Scala UpdatedAug 1, 2019 -
rabbitmq-poc Public
Notification System PoC with Delayed/Expired queues.
-
json-schema Public
Forked from everit-org/json-schemaJSON Schema validator for java, based on the org.json API
-
kafka-connect-field-and-time-partitioner Public
Forked from canelmas/kafka-connect-field-and-time-partitionerKafka Connect Store Partitioner by a custom field and time
Java Apache License 2.0 UpdatedJun 3, 2019 -
quinn Public
Forked from mrpowers-io/quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Python UpdatedApr 8, 2019 -
awesome-spark Public
Forked from awesome-spark/awesome-sparkA curated list of awesome Apache Spark packages and resources.