Stars
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
Collect, aggregate, and visualize a data ecosystem's metadata
The premier open source Data Quality solution
Duke is a fast and flexible deduplication engine written in Java
Quick start: pip install jsoniq ⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy datasets (JSON, text, CSV, Parquet, Delta...) | Data Lakehouse with Up…
Shell script automation to support csv2rdf4lod converter
YANGDB Open-source, Scalable, Non-native Graph database (Powered by Elasticsearch)
A highly scalable RDF triple store with full-text and GeoSPARQL support
Java wrapper for the Microsoft Translator API
R2RML Parser is a tool that can export relational database contents as RDF graphs, based on an R2RML mapping document.
EEA ElasticSearch RDF River Plugin
BatchRefine adds batch processing capabilities to OpenRefine
SHACL validation UI, SHACL documentation generator, UML diagram generator from SHACL, SHACL generator from RDF.
HeFQUIN is a query federation engine for heterogeneous federations of graph data sources, including federations of knowledge graphs.
Nouvelle version du logiciel Opentheso avec un nouveau design
A Java implementation of the bloomier filter data structure
RDF storage and SPARQL processing on top of Apache Spark.