Skip to content
View Mattdl's full-sized avatar

Block or report Mattdl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An agentic evaluation framework

Python 20 4 Updated Feb 11, 2026
Python 1,500 170 Updated Jun 12, 2026

Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.

Python 23,313 2,485 Updated May 14, 2026

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,462 285 Updated Oct 5, 2025

Readymade evaluators for agent trajectories

Python 617 47 Updated Jun 8, 2026

A Docker sandbox template for running GitHub Copilot CLI in an isolated environment, similar to how Docker supports Claude Code and Gemini CLI via docker sandbox run

Shell 21 2 Updated Jun 13, 2026

Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

Python 3,791 331 Updated Jun 9, 2026

DSPy: The framework for programming—not prompting—language models

Python 35,013 2,976 Updated Jun 11, 2026

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…

TypeScript 11,391 1,389 Updated Jun 13, 2026

AI agents running research on single-GPU nanochat training automatically

Python 86,593 12,544 Updated Mar 26, 2026

From a goal to a task DAG, automatically. TypeScript-native multi-agent orchestration.

TypeScript 6,373 2,389 Updated Jun 11, 2026

Open-source Claude Code skills and Codex skills for AI-first work. Audit, re-engineer, and bootstrap projects with AI-first design principles.

TypeScript 82 2 Updated Jun 12, 2026

The Multilingual Entity Linking of Occupations (MELO) Benchmark

Python 5 2 Updated Jan 25, 2025

WorkRB: Work Research Benchmark

Python 36 7 Updated Jun 2, 2026

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Perl 68 16 Updated Feb 13, 2025

šŸ¤— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,570 33,498 Updated Jun 13, 2026

Late Interaction Models Training & Retrieval

Python 842 87 Updated Jun 12, 2026

GitHub Mirror of RecPack: Experimentation Toolkit for Top-N Recommendation (see https://gitlab.com/recpack-maintainers/recpack)

Python 22 3 Updated Dec 11, 2023

State-of-the-Art Embeddings, Retrieval, and Reranking

Python 18,805 2,807 Updated Jun 12, 2026

The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).

Python 39 6 Updated Feb 24, 2026

MTEB: Massive Text Embedding Benchmark

Python 3,302 623 Updated Jun 13, 2026

🄤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL

Python 1,165 102 Updated May 18, 2026

In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which has data from many users to learn user-agnostic representations…

Python 15 Updated May 18, 2026

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Jupyter Notebook 1,865 346 Updated Nov 5, 2025

[Spotlight ICLR 2023 paper] Continual evaluation for lifelong learning with neural networks, identifying the stability gap.

Python 35 4 Updated Apr 2, 2023

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,898 402 Updated Mar 27, 2026

HiPlot makes understanding high dimensional data easy

TypeScript 2,800 149 Updated Jan 10, 2024

CVPR 2022 Continual Learning in Computer Vision Workshop Challenge

Python 27 5 Updated Dec 15, 2022

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Cython 3,142 825 Updated Dec 10, 2023

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 13,859 1,575 Updated Jun 13, 2026
Next