Flink CDC is a streaming data integration tool
-
Updated
Nov 6, 2025 - Java
Flink CDC is a streaming data integration tool
Maestro: Netflix’s Workflow Orchestrator
Hop Orchestration Platform
The premier open source Data Quality solution
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms.
Categorical Query Language IDE
In-memory Java DataFrame library
A cross-platform command line tool for parallelised content extraction and analysis.
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Bender - Serverless ETL Framework
数据可视化, 数据挖掘, 数据处理 ETL分析
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."