Skip to content
View aangelopoulos's full-sized avatar

Sponsoring

@KTibow

Highlights

  • Pro

Block or report aangelopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Helping humans ride the GenAI evaluation wave

Python 25 1 Updated Apr 27, 2026

BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.

Python 1,560 60 Updated Apr 28, 2026

Source Code of Arena Leaderboard Methodology

Python 90 6 Updated Feb 20, 2026

Elo was a person

HTML 1 Updated Oct 27, 2024

Prompt-to-Leaderboard

Python 277 23 Updated May 9, 2025
Python 10 1 Updated Feb 20, 2025
Jupyter Notebook 16 3 Updated Feb 20, 2026
Jupyter Notebook 5 3 Updated Jan 31, 2025

🎨 ASCII art library for Python

Python 2,460 156 Updated Apr 27, 2026

a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race

Python 80 7 Updated May 6, 2025
Jupyter Notebook 63 13 Updated May 13, 2025
Python 56 1 Updated Mar 1, 2026
Jupyter Notebook 17 Updated Jul 23, 2025
JavaScript 40 5 Updated Feb 11, 2026
TypeScript 363 27 Updated Dec 1, 2025

Data and analysis for 'Machine Bias'

Jupyter Notebook 684 290 Updated Jun 13, 2017

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,507 12,947 Updated Apr 24, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 19,029 2,425 Updated Apr 7, 2026
Jupyter Notebook 19 5 Updated Mar 6, 2024

A compositional diagramming and animation library as an eDSL in Python

Python 219 8 Updated Nov 25, 2024

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 253 25 Updated Jun 11, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,299 2,931 Updated Apr 14, 2026
Python 4,465 484 Updated Apr 22, 2026

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,964 580 Updated Mar 7, 2025

Code for multistep feedback covariate shift conformal prediction experiments in "Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)" (ICML 2024)

Jupyter Notebook 28 3 Updated Jun 28, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,460 4,797 Updated Jun 2, 2025
Next