Skip to content
View NohTow's full-sized avatar
🚤
shipping
🚤
shipping

Block or report NohTow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MTEB: Massive Text Embedding Benchmark

Python 2,951 499 Updated Nov 4, 2025

Datamodels for hugging face tokenizers

Python 85 4 Updated Nov 5, 2025

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 3,499 350 Updated Nov 3, 2025

Fast State-of-the-Art Static Embeddings

Python 1,890 106 Updated Oct 11, 2025

State-of-the-art paired encoder and decoder models (17M-1B params)

Python 53 3 Updated Aug 6, 2025

Open-source personal bookmarks search engine

Python 692 37 Updated Nov 5, 2025

PyLate efficient inference engine

Rust 66 7 Updated Sep 11, 2025
Python 44 3 Updated Jul 10, 2025

The first dense retrieval model that can be prompted like an LM

Python 89 5 Updated May 8, 2025

A holistic framework to construct realistic evaluation datasets

Python 28 3 Updated Jun 16, 2025

High-Performance Engine for Multi-Vector Search

Rust 180 11 Updated Oct 29, 2025
Python 85 6 Updated Jul 4, 2025

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.

Python 1,051 83 Updated Jul 5, 2025

Extract full next-token probabilities via language model APIs

Python 247 13 Updated Feb 23, 2024

Bringing BERT into modernity via both architecture changes and scaling

Python 1,559 129 Updated Jun 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,128 2,426 Updated Nov 5, 2025

Crispy reranking models by Mixedbread

Python 38 6 Updated Sep 17, 2025

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

Python 232 43 Updated Nov 5, 2025

XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.

Python 169 13 Updated May 3, 2025

Train Models Contrastively in Pytorch

Python 754 61 Updated Mar 26, 2025

A plug-&-play watermark for LLMs with no impact on text quality.

Python 8 Updated Sep 30, 2024

One-stop shop for running and fine-tuning transformer-based language models for retrieval

Python 59 17 Updated Nov 5, 2025

Schedule-Free Optimization in PyTorch

Python 2,229 68 Updated May 21, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,576 2,500 Updated Sep 30, 2025

Efficient BM25 with DuckDB 🦆

Python 58 2 Updated Dec 20, 2024

The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching of inference workloads.

Python 151 11 Updated Jul 14, 2025

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 828 92 Updated Jan 28, 2025

Toolkit for creating, sharing and using natural language prompts.

Python 2,960 377 Updated Oct 23, 2023

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,749 260 Updated May 17, 2025
Next