- Amsterdam
- dennisulmer.eu
- @dnnslmr.bsky.social
Highlights
- Pro
Stars
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
Obsidian tools - a Python package for analysing an Obsidian.md vault
Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning
A benchmark with locally sourced multilingual questions for 31 languages.
This repository collects papers that cover queer NLP
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
Code accompanying paper "Set Norm and Equivariant Skip Connections: Putting the Deep in Deep Sets."
Train transformer language models with reinforcement learning.
IvoBrink / RACDH-old
Forked from oneal2000/MINDReal-time Attribution Classification to Detect Hallucinations
Clean Departure Board mainly for DB Trains running on JavaScript
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Uncertainty quantification with PyTorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
An extremely fast Python package and project manager, written in Rust.
A collection of large question answering datasets
PaCMAP: Large-scale Dimension Reduction Technique Preserving Both Global and Local Structure
Repository for "Uncertainty-Aware Machine Translation Evaluation", accepted to Findings of EMNLP 2021.
Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.
Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2023.