Skip to content
View irfanghat's full-sized avatar
๐Ÿก
๐Ÿก

Organizations

@lexara-prime @spendr-finance-tracker @Embra-Connect-ETL

Block or report irfanghat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

pyspark pagerank script

Jupyter Notebook 3 1 Updated Jan 17, 2020

The open-source AI voice studio. Clone, dictate, create.

TypeScript 31,890 3,911 Updated Apr 26, 2026

IP addresses break, dial keys instead. Modular networking stack in Rust.

Rust 10,506 478 Updated Jun 22, 2026

๐Ÿ”ฅ Feature-rich interactive Jira command line.

Go 5,748 389 Updated Jan 20, 2026

Embed PostgreSQL database

Rust 378 34 Updated Jun 10, 2026

Venice, Derived Data Platform for Planet-Scale Workloads.

Java 610 120 Updated Jun 19, 2026

A pure-Ruby client for Apache Spark Connect: a PySpark-style DataFrame API over gRPC.

Ruby 4 1 Updated Jun 15, 2026

A cloud native embedded storage engine built on object storage.

Rust 3,131 250 Updated Jun 22, 2026

Repository for the Crafting Interpreters book

Java 12 2 Updated Apr 21, 2024

[CNCF Sandbox Project] Managing your Kubernetes clusters (including public, private, edge, etc.) as easily as visiting the Internet

Go 1,439 208 Updated Jun 22, 2026

Compare Icebug vs Graphframes

Python 1 Updated May 7, 2026

Website for Distributed Computing 4 Kids w/ coming soon

CSS 3 1 Updated May 2, 2026

Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews

Jupyter Notebook 235 222 Updated Dec 31, 2025

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Python 46,348 3,220 Updated Jun 22, 2026

DuckDB is an analytical in-process SQL database management system

C++ 38,940 3,350 Updated Jun 22, 2026

A post-modern modal text editor.

Rust 44,964 3,563 Updated Jun 22, 2026

Simplified DOM Trees for Transferable Attribute Extraction from the Web

Python 42 10 Updated Sep 27, 2024

An open protocol for secure data sharing

Scala 951 229 Updated Jun 13, 2026

Quanton Operator is a Kubernetes operator that extends kubeflow/spark-operator to run Apache Spark jobs using the Quanton compute engine by Onehouse. Quanton is a purpose-built query execution engiโ€ฆ

Python 17 7 Updated Jun 15, 2026

Official repository of the SOCI - The C++ Database Access Library

C++ 1,602 516 Updated Jun 19, 2026

The Feldera Incremental Computation Engine

Rust 1,932 133 Updated Jun 22, 2026

Zstandard - Fast real-time compression algorithm

C 27,272 2,514 Updated Jun 1, 2026

A microbenchmark support library

C++ 10,247 1,770 Updated Jun 22, 2026

A unified python SDK supports OceanBase or OceanBase seekdb, more efficient and easy-to-use.

Python 62 32 Updated Jun 20, 2026

MiniOB is a compact database that assists developers in understanding the fundamental workings of a database.

C++ 4,378 1,625 Updated Jun 8, 2026

The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.

C++ 10,159 1,903 Updated Jun 19, 2026

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

Scala 1,187 268 Updated Jun 17, 2026

A virtual CPU emulator featuring a custom instruction set and a compiler that generates executable bytecode.

C++ 21 6 Updated Mar 7, 2026

Apache Spark Native DataSource for Safetensors

Scala 6 1 Updated Feb 24, 2026

Apache Spark Sandbox with patterns often focusing on Microsoft Fabric

TypeScript 15 3 Updated Jun 21, 2026
Next