Stars
Pure MLX implementations of UMAP, t-SNE, PaCMAP, TriMap, DREAMS, CNE, MMAE, and NNDescent for Apple Silicon. Metal GPU for computation and video rendering.
Beautiful, open source, WebGPU-based charting library
A fast multi-core implementation of the PLSCAN clustering algorithm.
Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest neighbor search methods, or vector indexes, using modern embedding datasets.
music visualization via umap of stable audio latents
IsUMap is a tool for manifold learning, dimension reduction and data visualization
A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Code repository for the NeurIPS 2024 paper "Navigating the Effect of Parametrization for Dimensionality Reduction".
Simplified implementation of UMAP like dimensionality reduction algorithm
Opinionated provides simple, clean stylesheets for plotting with matplotlib and seaborn.
R package implementing edge bundling algorithms
Creating beautiful plots of data maps
A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.
Some hidden knowledge found in the 20 Newsgroups dataset
Approximate nearest neighbor search in Python.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
Official repo for the SOC-Embedding blogpost: https://www.rpisoni.dev/posts/self-organizing-class-embeddings/
utilities for decoding deep representations (like sentence embeddings) back to text
SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types — from 6-bit floats to 64-bit complex — across x86, Arm, RISC-V, and WASM, with bindings fo…
A specification for OpenInference, a semantic mapping of ML inferences
Companion repository to our Lause et al. (2023) preprint "Compound models and Pearson residuals for normalization of single-cell RNA-seq data without UMIs" (bioRxiv))