Highlights
Stars
Decomposing Global AUC into Cluster-Level Contributions for Localized Model Diagnostics
Automated checks on the contents of a Python package (similar to R CMD check)
Integrated tool for model development and validation
A few public recipes for things I've wanted to do and have solved in dagster
R package to setup a standalone shiny application built with electron
Sources for the book "Machine Learning in Production"
Python based GBDT implementation on GPU. Efficient multioutput (multiclass/multilabel/multitask) training
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide…
A collection of research papers on decision, classification and regression trees with implementations.
a python interface to OC1 and other oblique decision tree implementations
Scikit-learn compatible decision trees beyond those offered in scikit-learn
An R package for modern methods for non-probability samples
A R library of pseudo-random number generators written in C++
A distributed agent orchestration framework for market agents
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
Materials for the the Analyzing Time Series at Scale with Cluster Analysis in R Workshop
Python interactive dashboards for learning data science
Precinct shapes (and vote results) for US elections past, present, and future
A lightweight version of R Markdown (without using Pandoc or knitr)
Free MLOps course from DataTalks.Club
Hacking & Cybersecurity class materials - Scott J. Shapiro & Sean O'Brien
Demo Project for Open Source MDS