The Metadata Platform for your Data and AI Stack
-
Updated
Apr 4, 2026 - Java
The Metadata Platform for your Data and AI Stack
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Intake is a lightweight package for finding, investigating, loading and disseminating data.
📙 Awesome Data Catalogs and Observability Platforms.
🐳 The stupidly simple CLI workspace for your data warehouse.
Marmot is an open-source data catalog designed for teams who want powerful data discovery without enterprise complexity. Catalog every data asset, enrich it with the context that matters and make it accessible to your team and your AI tools.
Work with your web service, database, and streaming schemas in a single format.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Meteor is a metadata collection agent that connects to databases, warehouses, dashboards, pipelines, and infrastructure to extract and deliver rich observations that power your organization's context graph.
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
The GenAI-powered toolkit for automated data intelligence.
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
The World's Most Comprehensive, Authoritative, and Structured Open Source Data Source Knowledge Base
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Reference Architectures for Datalakes on AWS
Sample code with integration between Data Catalog and RDBMS data sources.
End-to-end DataOps platform deployed by Terraform.
Add a description, image, and links to the data-catalog topic page so that developers can more easily learn about it.
To associate your repository with the data-catalog topic, visit your repo's landing page and select "manage topics."