Stars
Automatic ICD coding benchmark based on the MIMIC dataset
Knowledge-Enriched Machine Learning for Tabular Data (NeuS '25)
A TTS model capable of generating ultra-realistic dialogue in one pass.
Our maintained PFN repository. Come here to train SOTA PFNs.
⚡ TabPFN: Foundation Model for Tabular Data ⚡
ML models + benchmark for tabular data classification and regression
A comprehensive toolkit and benchmark for tabular data learning, featuring 35+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.
Repository for CARTE: Context-Aware Representation of Table Entries
Uncertainty Toolbox: a Python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
Code for ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Github for our paper "Typed Markers and Context for Clinical Temporal Relation Extraction"
Rich is a Python library for rich text and beautiful formatting in the terminal.
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
Improving Compositional Generalization in Classification Tasks via Structure Annotations, ACL 2021 Short
Flexible Python configuration system. The last one you will ever need.
✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.
Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift
Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text, ClinicalNLP workshop at EMNLP 2020