Stars
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Web interface for browsing, search and filtering recent arxiv submissions
Discovering Data-driven Hypotheses in the Wild
Zero-shot clinical trial matching with LLMs
Code for KB construction from LLMs - ACL 2025 paper: "Enabling LLM Knowledge Analysis via Extensive Materialization""
SGLang is a high-performance serving framework for large language models and multimodal models.
Code for our WSDM 2022 paper. CLOCQ is a framework which allows efficient access to knowledge bases (KB) for functionalities related to question answering (QA). CLOCQ can retrieve a set of relevant…
Entity linking system for Wikidata updated by your edits in real time
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Efficient Retrieval Augmentation and Generation Framework
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Examples from knowledge graphs tutorial paper
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
LLM-based ontological extraction tools, including SPIRES
A package for ontology engineering with deep learning and language models.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
Library for unit extraction - fork of quantulum for python3
Toolkit for creating, sharing and using natural language prompts.
EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering