Stars
Multilingual embedding
4 repositories
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
MTEB: Massive Text Embedding Benchmark
Code repository for the paper - "Matryoshka Representation Learning"
A library for efficient similarity search and clustering of dense vectors.