Skip to content
View willeppy's full-sized avatar
⛷️
⛷️

Organizations

@cmudig

Block or report willeppy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A benchmark to evaluate AI Agents in social domains.

Python 16 4 Updated Jul 2, 2026

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 5,546 763 Updated Jul 2, 2026

An agent benchmark with tasks in a simulated software company.

Python 736 119 Updated Nov 17, 2025

Magentic-Marketplace: Simulate Agentic Markets and See How They Evolve

Python 173 38 Updated Mar 1, 2026

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 4,848 304 Updated Jul 1, 2026

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 15,062 1,262 Updated Jun 24, 2026
TypeScript 71 6 Updated Feb 11, 2026

🙌 OpenHands: AI-Driven Development

Python 79,201 10,078 Updated Jul 3, 2026

Python tool for converting files and office documents to Markdown.

Python 162,502 11,480 Updated Jun 24, 2026

LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team.

JavaScript 528 56 Updated Feb 11, 2025
Python 3 Updated Oct 16, 2024

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

Python 603 69 Updated May 31, 2026

Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.

Python 163 23 Updated Jun 4, 2025

An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.

Python 959 608 Updated Aug 3, 2024

Visualize your text data with structured attributes

Svelte 34 Updated Apr 11, 2025

A programming framework for agentic AI

Python 59,447 8,949 Updated Apr 15, 2026
Python 10 2 Updated Jun 21, 2026

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,931 183 Updated Feb 24, 2024

An extensible framework for linking databases and interactive views.

TypeScript 1,318 114 Updated Jul 2, 2026

Angler: Machine Translation Visualization (CHI 2023)

TypeScript 67 1 Updated Jul 12, 2023

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 2,016 182 Updated Dec 29, 2024

automatic data slicing

Jupyter Notebook 35 9 Updated Aug 31, 2021

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,249 615 Updated Jul 19, 2024

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collec…

Jupyter Notebook 2,824 142 Updated Jan 10, 2025

JupyterLab desktop application, based on Electron.

TypeScript 4,246 476 Updated Jul 2, 2026

Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet

Jupyter Notebook 2,639 205 Updated Jun 22, 2026

reusable widgets made easy

Python 902 70 Updated Jun 26, 2026

A Python library for anomaly detection across tabular, time series, graph, text, image, and audio data. 60+ detectors, benchmark-backed ADEngine orchestration, and an agentic workflow for AI agents.

Python 9,895 1,477 Updated Jun 17, 2026
Next