Stars
🛁 Clean Code concepts adapted for Python
An opinionated framework for deploying, managing, and serving application workloads
CRAN R package: Impute missing values based on automated variable selection
CRAN R Package: Time Series Missing Value Imputation
Tabular data imputation and generation, with flexible modeling of quantitative features via hierarchical binning (TMLR, 2025)
Book repository for The Turing Way: a how to guide for reproducible, ethical and collaborative data science
A tool for exploring each layer in a docker image
Public release of Telepathy, an OSINT toolkit for investigating Telegram chats.
Python code for "Probabilistic Machine learning" book by Kevin Murphy
💥💻💥 A data-parallel functional programming language
cotainr - a user space Apptainer/Singularity container builder.
A technical explainer by @kognise of how your computer runs programs, from start to finish.
Apache Spark - A unified analytics engine for large-scale data processing
Code for the data memo `Censorship on YouTube During Russia’s Invasion of Ukraine`
GNU Radio – the Free and Open Software Radio Ecosystem
PySDR.org textbook source material, feel free to post issues/PRs
A Highly Accessible and Automated Virtualization Platform for Security Education
a full day lesson material to teach the basics of using a HPC cluster to novices
Online resources that will help you prepare for taking the CNCF CKA 2020 "Kubernetes Certified Administrator" Certification exam. with time, This is not likely the comprehensive up to date list - p…
A complete computer science study plan to become a software engineer.
Docker Certification Associate preparation guide - a list of resources to help you prepare for a successful certification
Solutions on Practical Data Science Specialization on Coursera (offered by deeplearning.ai)
ModelarDB: Model-Based Time Series Management from Edge to Client
Official repository for pygrametl - ETL programming in Python