A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
-
Updated
Apr 27, 2026 - Python
A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
Python client for the Serene Data Integration software
Benchmark to evaluate schema matching approaches
[Information System] SMUTF: Schema Matching Using Generative Tags and Hybrid Features
Valentine scalable deployment for VLDB demo
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching
Master thesis - reproducing state-of-the-art schema matching algorithms
Projects for the course Data Engineering held by professor Paolo Merialdo at Roma Tre University.
Query classification (complexity, keywords, SQL type) + schema ranking for RAG systems. Uses DistilBERT for query analysis and Sentence-BERT for table/column relevance. Filters 500 tables in 7ms.
CLI tool for inserting SELECT query results into ClickHouse with automatic schema matching and type-safe casting. Ideal for ETL pipelines and SQL-driven data flows.
The Master Project of Aldi Doanta Kurnia - Master Computer Science student at the University of Twente.
🌮 Table-based KB Completer
Master thesis: Holistic Schema Matching at Scale
Deterministic key and join discovery for structured datasets
Add a description, image, and links to the schema-matching topic page so that developers can more easily learn about it.
To associate your repository with the schema-matching topic, visit your repo's landing page and select "manage topics."