Skip to content
View kamalhakim's full-sized avatar

Organizations

@karamelchef @karamel-lab

Block or report kamalhakim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,976 5,119 Updated Mar 30, 2026

Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...

Jupyter Notebook 645 301 Updated Dec 17, 2023

A command-line tool for launching Apache Spark clusters.

Python 651 119 Updated Dec 13, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 44,831 16,788 Updated Mar 30, 2026

Ephemeral Hadoop clusters using Google Compute Platform

Java 136 31 Updated Mar 31, 2022

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 18,708 2,454 Updated Mar 18, 2026

Docker container orchestration platform

Java 2,203 230 Updated Sep 12, 2024