We are a research community developing scientifically grounded research outputs and robust deployment infrastructure for broader impact evaluations.
Today, the EvalEval Coalition is beta launching Evaluation Cards, an open-source project for stakeholders across the ...
Our initial post launching the Science of Evaluations workstream outlined a research agenda to document the scientifi...
This workshop focuses on AI evaluation in practice, centering the tensions and collaborations between model developers and evaluation researchers and aims to surface practical i...
A FAccT 2026 tutorial walking through Every Eval Ever — a community-governed open source infrastructure unifying evaluation results under a shared metadata schema — and Evaluati...
Researchers, practitioners, and students are welcome to contribute to our mission. Send us an email to learn more about getting involved.
[email protected]