Skip to content
View kim-dabin's full-sized avatar
🐧
🐧

Organizations

@TheCopiens

Block or report kim-dabin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1 Updated Aug 8, 2024

📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.

Python 59 12 Updated Jan 18, 2025

Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg

Python 5 2 Updated May 4, 2023

The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many mo…

TypeScript 72,144 13,451 Updated Feb 16, 2026

🇰🇷 한국어 사용자를 위한 서비스에 사용하기 위한 오픈 API 모음

Python 3,653 394 Updated Feb 7, 2024

🚢 Docker images and configuration for Citus

Dockerfile 267 102 Updated Feb 12, 2026

System Design Studying can be daunting. This gives you a table to study different problems, understand what components they require, their pros and cons, and how to deal with mitigations.

HTML 551 93 Updated Mar 16, 2025
TypeScript 4 1 Updated Jun 20, 2022

월간채널 — Monthly Channel

177 12 Updated May 7, 2024

Code for blog at: https://www.startdataengineering.com/post/docker-for-de/

C 40 14 Updated Apr 29, 2024
JavaScript 11 8 Updated Nov 16, 2022

Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

Python 377 50 Updated May 19, 2025

最全的大数据大厂面试宝典,大数据面试题,大数据面试,王傲旗的大数据之路,大数据成神之路,Flink/Spark/Hadoop/Hbase/Hive/Impala/Hbase/MapReduce/YARN/HDFS/Kafka/Flume/Linux/Java/Scala...面试题

Java 64 14 Updated Dec 6, 2021
Python 221 95 Updated May 22, 2024

Making lecture videos readable

Python 76 14 Updated Feb 7, 2023

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 106,460 57,034 Updated Feb 16, 2026

Spark cluster in docker containers with sample training Jupyter notebooks

Jupyter Notebook 27 25 Updated Feb 24, 2023

The python source code for my Raspberry Pi 4 e-reader.

Python 7 2 Updated Dec 13, 2020

Awesome Docker Compose samples

HTML 43,933 8,024 Updated Feb 12, 2026

Solana Arbitrage Bot on pump.fun, Meteora, Raydium and Orca using Jito bundling, RPC and gRPC. Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage Bot Solana Arbitrage B…

TypeScript 496 213 Updated Nov 21, 2025

豆瓣电影/豆瓣读书 Scarpy 爬虫

Python 785 209 Updated Dec 4, 2023

Scrapy environment with Tor for anonymous ip routing and Privoxy for http proxy

Shell 20 8 Updated Jul 5, 2016

Scrapy middleware with TOR support for more robust scrapers or anonymous scraping.

Python 6 2 Updated Apr 12, 2022

webcrawler using a tor-proxy, elasticsearch and scrapy

Python 6 3 Updated Jan 10, 2023

Scrapy spider to recursively crawl for TOR hidden services

Python 11 4 Updated Oct 12, 2017

The official CLI for Amazon EKS

Go 5,177 1,483 Updated Feb 14, 2026

Airflow TimeTable for korean working days

Python 4 Updated Sep 19, 2022

This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.

Python 1,220 159 Updated Sep 8, 2025
Next