Stars
The official implementation of NAACL 2025, "Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation"
Compare two versions of the same API with Doxygen's XML output
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"