Highlights
Stars
Learning the practice of Monte Carlo simulations in data science (Research Module in Econometrics and Statistics; Master's/PhD)
Python package for Recentered Influence Function (RIF) regression
Action to automatically create installer releases based on a shiny app
Decomposing Global AUC into Cluster-Level Contributions for Localized Model Diagnostics
Automated checks on the contents of a Python package (similar to R CMD check)
Integrated tool for model development and validation
A few public recipes for things I've wanted to do and have solved in dagster
[Prototype][Experimental] Setup a standalone Shiny application built with Electron to run as a Desktop application
Sources for the book "Machine Learning in Production"
Python based GBDT implementation on GPU. Efficient multioutput (multiclass/multilabel/multitask) training
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide…
A collection of research papers on decision, classification and regression trees with implementations.
a python interface to OC1 and other oblique decision tree implementations
Scikit-learn compatible decision trees beyond those offered in scikit-learn
An R package for modern methods for non-probability samples
A R library of pseudo-random number generators written in C++
An agent orchestration framework for economic agents
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
Materials for the the Analyzing Time Series at Scale with Cluster Analysis in R Workshop
Python interactive dashboards for learning data science
Precinct shapes (and vote results) for US elections past, present, and future
A lightweight version of R Markdown (without using Pandoc or knitr)