Skip to content
View yurymalkov's full-sized avatar

Organizations

@nmslib

Block or report yurymalkov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NanoGPT (124M) in 2 minutes

Python 4,577 606 Updated Feb 1, 2026

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,476 83 Updated Feb 4, 2026

Perplexica is an AI-powered answering engine.

TypeScript 28,706 3,055 Updated Jan 10, 2026

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 5,012 226 Updated Feb 4, 2026

The repo for In-context Autoencoder

Jupyter Notebook 165 20 Updated May 11, 2024

build your own vector database -- the littlest hnsw

Python 67 2 Updated Jan 7, 2025

Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paths.

Python 36 31 Updated Jan 26, 2026

πŸ›°οΈ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,538 79 Updated Sep 25, 2025

utilities for decoding deep representations (like sentence embeddings) back to text

Python 1,060 115 Updated Dec 27, 2025

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,926 568 Updated Jul 11, 2025

JVector: the most advanced embedded vector search engine

Java 1,681 146 Updated Feb 3, 2026

A minimal implementation of diffusion models for text generation

Python 408 38 Updated May 11, 2023

Benchmark for vector databases.

Python 1,003 333 Updated Feb 3, 2026

The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"

Python 318 34 Updated Nov 17, 2025

Image captioning from scratch (or pre-trained vision/language models) using transformers

Python 7 Updated Feb 13, 2025

πŸš€ efficient approximate nearest neighbor search algorithm collections library written in Rust πŸ¦€ .

Rust 2,657 76 Updated Jan 8, 2026

Hierarchical Navigable Small World (HNSW) algorithm for vector similarity search in PostgreSQL

C 576 27 Updated Dec 14, 2023

HSNW module for Redis

Rust 59 3 Updated Sep 1, 2020

Fast Open-Source Search & Clustering engine Γ— for Vectors & Arbitrary Objects Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

C++ 3,755 277 Updated Jan 27, 2026

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,088 122 Updated Jun 1, 2023

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 933 54 Updated Jul 6, 2024

Numbers every LLM developer should know

4,282 140 Updated Jan 16, 2024

Graph Library for Approximate Similarity Search

C++ 139 26 Updated Sep 9, 2025

hnswlib-wasm attempts to create a browser friendly version of hnswlib

C++ 64 14 Updated Jul 21, 2023

hnswlib-node provides Node.js bindings for Hnswlib

C++ 131 12 Updated Jan 31, 2026

A joint community effort to create one central leaderboard for LLMs.

Python 308 35 Updated Aug 23, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,958 843 Updated Nov 21, 2025

Making large AI models cheaper, faster and more accessible

Python 41,338 4,538 Updated Jan 19, 2026

Universal LLM Deployment Engine with ML Compilation

Python 21,990 1,927 Updated Feb 3, 2026
Next