Lists (1)
Sort Name ascending (A-Z)
Stars
Function-first R package discovery. Search by what packages do, not what they're called.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Making civic data contextualized and accessible
Lightweight eval framework for MCP servers, built on mcp-agent
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
The U.S. Census Bureau Data API MCP connects AI Assistants with official Census Bureau statistics.
MCP-based Agent Deep Evaluation System
A list of free LLM inference resources accessible via API.
This is a repo with links to everything you'd ever want to learn about data engineering
Tools for computing diversity, integration, and segregation metrics on demographic data.
censusdis is a Python package for discovering, loading and analyzing, U.S. Census demographic, economic, and geographic data and metadata. It is designed to be intuitive and Pythonic, giving users …
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
🐢 Open-Source Evaluation & Testing library for LLM Agents
A curated list of awesome academic research, books, code of ethics, courses, databases, data sets, frameworks, institutes, maturity models, newsletters, principles, podcasts, regulations, reports, …
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collec…
Synthetic data generators for tabular and time-series data
A machine learning toolkit for log parsing [ICSE'19, DSN'16]
Examples and guides for using the OpenAI API
A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
Tensorflow's Fairness Evaluation and Visualization Toolkit