Skip to content
View tahakashaf's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ING Analytics
  • Amsterdam

Block or report tahakashaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Apache NiFi cluster running in Kubernetes

60 25 Updated Sep 16, 2025

Cotyledon provides a framework for defining long-running services.

Python 91 19 Updated Aug 31, 2025

Display and control your Android device

C 130,670 12,233 Updated Nov 4, 2025

Monitor the stability of a Pandas or Spark dataframe ⚙︎

Python 507 36 Updated Sep 4, 2025

English SDK for Apache Spark

Python 876 137 Updated Jun 12, 2024

Produce data for ITR tool using data from Data Commons

Jupyter Notebook 3 4 Updated Nov 3, 2025

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 5,650 464 Updated Nov 5, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,097 2,954 Updated Apr 29, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,112 3,371 Updated Nov 5, 2025

Apache Flink

Java 25,447 13,750 Updated Nov 5, 2025

MonitoFi: Health & Performance Monitor for your Apache NiFi

Python 67 14 Updated Aug 1, 2023

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.

Java 4,223 1,046 Updated Nov 5, 2025

Magnificent app which corrects your previous console command.

Python 94,583 3,797 Updated Jul 19, 2024

Papers from the computer science community to read and discuss.

Shell 99,964 6,155 Updated Oct 10, 2025

Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.

7,365 2,395 Updated Jan 1, 2024

On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.

Scala 35 10 Updated Apr 15, 2025

An attempt to answer the age old interview question "What happens when you type google.com into your browser and press enter?"

42,427 5,719 Updated Aug 19, 2024

A collection of useful .gitignore templates

170,500 83,001 Updated Sep 10, 2025

Mysqldump, writing in postgresql format

Ruby 716 154 Updated Aug 28, 2020

Tool for migrating/converting from mysql to postgresql.

Python 465 170 Updated Aug 4, 2021

Always know what to expect from your data.

Python 10,896 1,642 Updated Nov 5, 2025

Learn to build a basic machine learning model from scratch with this repo and tutorial series.

Jupyter Notebook 70 37 Updated Dec 8, 2022

Remote shuffle service for Apache Spark to store shuffle data on remote servers.

Java 336 100 Updated Sep 29, 2023

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Scala 296 34 Updated Jan 31, 2025

How to systematically secure anything: a repository about security engineering

10,165 705 Updated Mar 7, 2023

AWS ECS Quickstart

Shell 2 Updated May 11, 2020

a curated list of awesome streaming frameworks, applications, etc

2,915 311 Updated Jul 24, 2025

STIG-Partitioned Enterprise Linux (spel)

Shell 102 63 Updated Oct 30, 2025
Next