Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
-
Updated
Nov 8, 2024 - Java
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features, Linkis computing middleware, and has been integrated into data application development portal DataSphere Studio
最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;
Ambari service for Azkaban
基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
Apache DolphinScheduler Kubernetes Operator.
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
基于Spark的电影推荐系统
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
springboot-azkaban job地址https://github.com/poemp/azkaban-data-push-job
Add a description, image, and links to the azkaban topic page so that developers can more easily learn about it.
To associate your repository with the azkaban topic, visit your repo's landing page and select "manage topics."