-
Homerun / Armut
Stars
🏛 Python tool for export all your content of Notion page using official Notion API. Includes: all nested subpages, markdown files and HTMLs, nice urls, downloading locally all its content.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
An Open Standard for lineage metadata collection
ilum-cloud / marquez
Forked from MarquezProject/marquezCollect, aggregate, and visualize a data ecosystem's metadata
Collect, aggregate, and visualize a data ecosystem's metadata
mozilla / redash
Forked from getredash/redashThis is a Mozilla fork of the re:dash project (https://redash.io/), where we do work to be contributed back to the upstream project and for our own custom needs.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Make dbt great again! Extend dbt with plugins, local docs and custom adapters — fast, safe, and developer-friendly
Stream Processing and Complex Event Processing Engine
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Open-Source Web UI for managing Apache Kafka clusters
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
The unofficial python package that returns response of Google Bard through cookie value.
A frictionless integrated platform for notebook
A curated list of data engineering tools for software developers
The best place to learn data engineering. Built and maintained by the data engineering community.
GitHub Classroom automates repository creation and access control, making it easy for teachers to distribute starter code and collect assignments on GitHub.