Skip to content
View vkorukanti's full-sized avatar
:octocat:
:octocat:

Organizations

@apache

Block or report vkorukanti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open, Multi-modal Catalog for Data & AI

Java 3,222 555 Updated Dec 16, 2025

Machine Learning Engineering Open Book

Python 16,059 987 Updated Dec 10, 2025

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,440 188 Updated Dec 17, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,470 1,971 Updated Dec 18, 2025

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,381 168 Updated Dec 18, 2025

A collective list of free APIs

Python 386,274 41,232 Updated Nov 4, 2025

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 63,666 4,468 Updated Dec 18, 2025

Spark + HDFS cluster using docker compose

Shell 48 47 Updated Nov 6, 2018

Apache Calcite

Java 5,005 2,453 Updated Dec 18, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,270 3,943 Updated Dec 18, 2025

Apache Hive

Java 5,948 4,788 Updated Dec 17, 2025

Apache Drill is a distributed MPP query layer for self describing data

Java 2,001 986 Updated Nov 6, 2025