Skip to content
View yurymalkov's full-sized avatar

Organizations

@nmslib

Block or report yurymalkov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NanoGPT (124M) in 3 minutes

Python 3,970 520 Updated Dec 17, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,429 83 Updated Dec 1, 2025

Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI

TypeScript 27,749 2,897 Updated Dec 19, 2025

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 5,000 225 Updated Dec 19, 2025

The repo for In-context Autoencoder

Jupyter Notebook 157 19 Updated May 11, 2024

build your own vector database -- the littlest hnsw

Python 67 2 Updated Jan 7, 2025

Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paths.

Python 35 29 Updated Dec 19, 2025

πŸ›°οΈ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,526 80 Updated Sep 25, 2025

utilities for decoding deep representations (like sentence embeddings) back to text

Python 1,025 112 Updated Aug 5, 2025

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,878 567 Updated Jul 11, 2025

JVector: the most advanced embedded vector search engine

Java 1,662 143 Updated Dec 11, 2025

A minimal implementation of diffusion models for text generation

Python 407 37 Updated May 11, 2023

Benchmark for vector databases.

Python 968 305 Updated Dec 19, 2025

The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"

Python 312 34 Updated Nov 17, 2025

Image captioning from scratch (or pre-trained vision/language models) using transformers

Python 7 Updated Feb 13, 2025

πŸš€ efficient approximate nearest neighbor search algorithm collections library written in Rust πŸ¦€ .

Rust 2,654 77 Updated Jan 31, 2024

Hierarchical Navigable Small World (HNSW) algorithm for vector similarity search in PostgreSQL

C 574 27 Updated Dec 14, 2023

HSNW module for Redis

Rust 59 3 Updated Sep 1, 2020

Fast Open-Source Search & Clustering engine Γ— for Vectors & Arbitrary Objects Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

C++ 3,479 249 Updated Nov 30, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,081 122 Updated Jun 1, 2023

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 929 53 Updated Jul 6, 2024

Numbers every LLM developer should know

4,275 140 Updated Jan 16, 2024

Graph Library for Approximate Similarity Search

C++ 136 24 Updated Sep 9, 2025

hnswlib-wasm attempts to create a browser friendly version of hnswlib

C++ 60 15 Updated Jul 21, 2023

hnswlib-node provides Node.js bindings for Hnswlib

C++ 126 12 Updated Dec 12, 2025

A joint community effort to create one central leaderboard for LLMs.

Python 308 36 Updated Aug 23, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,905 835 Updated Nov 21, 2025

Making large AI models cheaper, faster and more accessible

Python 41,297 4,545 Updated Dec 8, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,764 1,889 Updated Dec 11, 2025
Next