You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109
Hospital readmission prediction ML pipeline in R — 6 algorithms (LASSO, Random Forest, SVM, KNN, Naive Bayes, Decision Tree) on 69,984 diabetic patients using Tidymodels + SMOTE
Predicts 30-day hospital readmission risk in diabetic patients using ICD-9 code grouping, random oversampling, and HistGradientBoosting on 101,766 clinical encounters. Best model: F1 = 0.2818, AUC = 0.6704.
Stroke risk prediction using Random Forest in R — ROSE class balancing, 14.03% OOB error rate, confusion matrix evaluation & deployed as .rds model for clinical use
ETL pipeline converting Synthetic K-MIMIC (SYN-ICU) to MEDS format, with a transportable in-hospital mortality benchmark. Submitted to SD4H Workshop @ ICML 2026.
Machine learning pipeline for multi-class treatment prediction in lung adenocarcinoma (LUAD) using patient-level molecular profiles, featuring ensemble-based model aggregation, benchmarking across diverse classifier architectures, and systematic performance evaluation.
Evaluates data efficiency in lung cancer risk prediction using a super-stacking ensemble. Trains models on progressively reduced fractions of the PLCO dataset while keeping a fixed test set, analyzing performance stability, degradation, and robustness under limited data.