Stars
An official lightweight library for the RaBitQ algorithm and its applications in vector search.
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
HAKES: Efficient Data Search with Embedding Vectors at Scale
A book about Ph.D. student and research career planning
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
A Survey on Data Selection for Language Models
The repo for the article: CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search
Cover Tree implementation in C++ for k-Nearest Neighbours and range search
AlayaLite – A Fast, Flexible Vector Database for Everyone.
A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net
Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Companion repo for a NeurIPS 2022 paper "A Multilabel Classification Framework for Approximate Nearest Neighbor Search"
⚡ PDX: A Library for Fast Vector Search and Indexing on CPUs (x86, ARM) — for Python and C++. Index millions of vectors in seconds. Search them in milliseconds.
A curated list of Knowledge Graph related learning materials, databases, tools and other resources
A list of papers in the field of approximate nearest neighbor search on high-dimensional vectors.
An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.
Source code for the paper: "tau-LevelIndex: Towards Efficient Query Processing in Continuous Preference Space", SIGMOD 2022
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
UCB EECS126 : probability theory and random processes.
Code for paper: Towards Similarity Graphs Constructed by Deep Reinforcement Learning
Experimental Code for "Unleashing Graph Partitioning for Large-Scale Nearest Neighbor Search"