Highlights
- Pro
Stars
Download MeteoSwiss Open Government Data — weather stations, radar, hail, forecasts and climate series — as Parquet, with optional Databricks Delta ingestion
Swiss health insurance data
📍 Repel overlapping text labels away from each other in your ggplot2 figures.
Apache Superset is a Data Visualization and Data Exploration Platform
TabICLv2: A state-of-the-art tabular foundation model
A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.
A fast, consistent tool for working with data frame like objects within Teradata and take advantage of the Big Data and Machine Learning analytics capabilities of Vantage.
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
Lightweight and educational reimplementation of TabPFN https://arxiv.org/pdf/2511.03634
Causal Inference in R Workshop
missForest is a nonparametric, mixed-type imputation method for basically any type of data for the statistical software R.
Shapley Interactions and Shapley Values for Machine Learning
A Random Survival Forest implementation for python inspired by Ishwaran et al. - Easily understandable, adaptable and extendable.
Equivalence of the offset and weights for Poisson, Gamma and Tweedie regression
Multivariate (multi-response) ensemble learning
Python wrapper around the Stan programming language, designed particularly for use in natural science applications.
Notebooks of the eXplainableAI working group of the German actuarial association