Skip to content
View aangelopoulos's full-sized avatar

Sponsoring

@KTibow

Highlights

  • Pro

Block or report aangelopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Helping humans ride the GenAI evaluation wave

Python 22 1 Updated Apr 17, 2026

BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.

Python 1,483 55 Updated Apr 19, 2026

Source Code of Arena Leaderboard Methodology

Python 92 7 Updated Feb 20, 2026

Elo was a person

HTML 1 Updated Oct 27, 2024

Prompt-to-Leaderboard

Python 276 24 Updated May 9, 2025
Python 10 1 Updated Feb 20, 2025
Jupyter Notebook 16 3 Updated Feb 20, 2026
Jupyter Notebook 5 3 Updated Jan 31, 2025

🎨 ASCII art library for Python

Python 2,456 156 Updated Feb 24, 2026

a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race

Python 79 7 Updated May 6, 2025
Jupyter Notebook 64 14 Updated May 13, 2025
Python 56 1 Updated Mar 1, 2026
Jupyter Notebook 17 Updated Jul 23, 2025
JavaScript 39 5 Updated Feb 11, 2026
TypeScript 364 28 Updated Dec 1, 2025

Data and analysis for 'Machine Bias'

Jupyter Notebook 682 289 Updated Jun 13, 2017

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,472 12,923 Updated Apr 8, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,973 2,423 Updated Apr 7, 2026
Jupyter Notebook 19 5 Updated Mar 6, 2024

A compositional diagramming and animation library as an eDSL in Python

Python 219 8 Updated Nov 25, 2024

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 251 25 Updated Jun 11, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,229 2,929 Updated Apr 14, 2026
Python 4,444 483 Updated Jul 31, 2025

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,963 580 Updated Mar 7, 2025

Code for multistep feedback covariate shift conformal prediction experiments in "Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)" (ICML 2024)

Jupyter Notebook 28 3 Updated Jun 28, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,453 4,796 Updated Jun 2, 2025
Next