Skip to content
View kyungjunleeme's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@ens4

Block or report kyungjunleeme

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Sentiment Analysis of Tweets in real time

Python 6 6 Updated Dec 18, 2025

A time-series database for high-performance real-time analytics packaged as a Postgres extension

C 21,164 1,015 Updated Dec 21, 2025

100+ RAG interview questions with answers.

83 15 Updated Dec 21, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 45,263 6,114 Updated Dec 22, 2025
Go 4 1 Updated Dec 19, 2025
Jupyter Notebook 1 3 Updated Aug 26, 2025

The paper list of "Memory in the Age of AI Agents: A Survey"

422 18 Updated Dec 19, 2025

This repository contains the notebooks and presentations we use for our Databricks Tech Talks

HTML 733 445 Updated Jan 6, 2025

A repository of data on coronavirus cases and deaths in the U.S.

6,987 3,421 Updated Apr 2, 2024

Generate the ERD as a code from dbt artifacts

Python 287 38 Updated Dec 14, 2025

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

571 148 Updated Dec 2, 2025

MetricFlow allows you to define, build, and maintain metrics in code.

Python 1,417 137 Updated Dec 18, 2025
Python 41 13 Updated May 8, 2025

High-performance automatic differentiation of LLVM and MLIR.

LLVM 1,517 149 Updated Dec 21, 2025

The DuckDB Python package

Python 100 48 Updated Dec 21, 2025

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

Rust 3,373 207 Updated Dec 8, 2025

Knowledge sharing - Material about data-lakes, data warehouses and data lake-houses

5 1 Updated Nov 13, 2025

A highly efficient daemon for streaming data from Kafka into Delta Lake

Rust 424 99 Updated May 5, 2025

Lakehouse (Delta Lake, Apache Iceberg & Apache HUDI)

Jupyter Notebook 5 2 Updated May 21, 2023

API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of BigQuery or Snowflake for AI Agents and Data Apps

C++ 61 3 Updated Nov 18, 2025

The leader in Customer Data Infrastructure

Scala 6,986 1,187 Updated Jun 4, 2025

A fully incremental model, that transforms raw web event data generated by the Snowplow JavaScript tracker into a series of derived tables of varying levels of aggregation.

Shell 64 20 Updated May 28, 2025

Useful macros when performing data audits

388 48 Updated Dec 17, 2025

Compare tables within or across databases

Python 2,992 298 Updated May 17, 2024

dbt Package for modeling raw data exported by Google Analytics 4. BigQuery support, only.

SQL 381 161 Updated Nov 26, 2025

A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like BigQuery and Vertex AI Search.

Jupyter Notebook 9 2 Updated Dec 15, 2025

Batch data ingestion into Amazon OpenSearch Service using AWS Glue

Jupyter Notebook 5 2 Updated Jan 10, 2025

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

Python 358 74 Updated Dec 22, 2025
Next