🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
-
Updated
Feb 17, 2025 - Python
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Parsing-free RAG supported by VLMs
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Profile-Based Long-Term Memory for AI Applications
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Neural Search
Deep Recommenders
Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.
To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."