Skip to content
View aangelopoulos's full-sized avatar

Sponsoring

@KTibow

Highlights

  • Pro

Block or report aangelopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Helping humans ride the GenAI evaluation wave

Python 21 1 Updated Apr 3, 2026

BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.

Python 1,365 54 Updated Apr 4, 2026

Source Code of Arena Leaderboard Methodology

Python 90 7 Updated Feb 20, 2026

Elo was a person

HTML 1 Updated Oct 27, 2024

Prompt-to-Leaderboard

Python 276 24 Updated May 9, 2025
Python 10 1 Updated Feb 20, 2025
Jupyter Notebook 16 3 Updated Feb 20, 2026
Jupyter Notebook 5 3 Updated Jan 31, 2025

🎨 ASCII art library for Python

Python 2,455 157 Updated Feb 24, 2026

a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race

Python 79 7 Updated May 6, 2025
Jupyter Notebook 64 14 Updated May 13, 2025
Python 56 1 Updated Mar 1, 2026
JavaScript 39 5 Updated Feb 11, 2026
TypeScript 363 28 Updated Dec 1, 2025

Data and analysis for 'Machine Bias'

Jupyter Notebook 678 287 Updated Jun 13, 2017

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,421 12,876 Updated Apr 2, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,847 2,411 Updated Mar 20, 2026
Jupyter Notebook 19 5 Updated Mar 6, 2024

A compositional diagramming and animation library as an eDSL in Python

Python 219 8 Updated Nov 25, 2024

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 249 25 Updated Jun 11, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,129 2,912 Updated Mar 26, 2026
Python 4,423 481 Updated Jul 31, 2025

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,962 588 Updated Mar 7, 2025

Code for multistep feedback covariate shift conformal prediction experiments in "Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)" (ICML 2024)

Jupyter Notebook 28 3 Updated Jun 28, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,450 4,789 Updated Jun 2, 2025
Next