embeddings-similarity

Here are 40 public repositories matching this topic...

casper-vdb / Casper

⚡ World’s Fastest Vector Database for AI & RAG

search search-engine machine-learning nearest-neighbor-search neural-networks image-search recommender-system search-engines similarity-search ai-search knn-algorithm mlops hnsw vector-search vector-database neural-search vector-search-engine embeddings-similarity ai-search-engine

Updated Dec 16, 2025
Python

mmilunovic / m2vdb

Star

vector db built by someone with no idea how to build a vector db

nearest-neighbor-search vector-search vector-database embeddings-similarity retrieval-augmented-generation

Updated Dec 5, 2025
Python

proxectonos / simil-eval

Star

Multilingual toolkit for evaluating LLMs using embeddings

similarity-measures embeddings-similarity llm surprisal llm-evaluation

Updated Dec 5, 2025
Python

cgast / embird

Star

An open-source project for crawling RSS feeds and websites, extracting news content, and storing it with vector embeddings for semantic search, clustering and visualization..

news-aggregator self-hosted embeddings-similarity

Updated Nov 3, 2025
Python

Senju14 / agentic-rag-001

Star

Learning project: modular RAG pipeline for legal document search & Q&A using SBERT, Pinecone, and FastAPI.

python3 rag embeddings-similarity pineconedb groq-ai-api

Updated Oct 16, 2025
Python

RAG Mini Project — Retrieval‑Augmented Generation chatbot with FastAPI backend (Docker on Hugging Face Spaces) and Streamlit frontend (Render), featuring document ingestion, vector search, and LLM‑powered answers

fastapi streamlit embeddings-similarity huggingface-spaces render-deployment retrieval-augmented-generation-rag vector-search-embeddings

Updated Aug 31, 2025
Python

tdeshazo / passage-probe

Star

A command-line tool to index and perform hybrid semantic & lexical search over text files

cli embeddings search-algorithm bm25 embeddings-similarity

Updated Jul 28, 2025
Python

TanuTiwari4722 / RAG_Application

Star

Demonstrating RAG with streamlit.

application pdf-document-processor sentence-transformers embeddings-similarity rag-chatbot

Updated Jul 22, 2025
Python

harehimself / pinecone-lab

Star

Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, benchmark performance across different embedding models, and better understand what is possible with vector search in production environments.

vector embeddings embedding-models semantic-search retrieve-data pinecone rag retrieval-systems vector-search embedding-vectors vector-database embeddings-similarity vectordb vector-database-embedding retrieval-augmented-generation pineconedb pinecone-db retrival-augmented-generation

Updated Jun 30, 2025
Python

eu90h / semantic-dictionary

Star

A Python dictionary that uses semantic similarity for key matching instead of exact matches. This library allows you to retrieve values using keys that are semantically similar to the ones stored, making it ideal for natural language interfaces, etc.

python nlp machine-learning data-structures semantic-similarity embeddings-similarity agentic-ai

Updated Jun 30, 2025
Python

Adityarya11 / Semantic-search-transcript

Star

A Python-based semantic search system to find relevant transcript chunks based on user queries. Supports TF-IDF and Hugging Face LLM (llm2) search methods, with a Streamlit web interface and CLI for interactive querying. Outputs results in the format [timestamp], <chunk> and logs them to output/output.txt.

streamlit sklearn-metrics sentence-transformers embeddings-similarity langchain

Updated May 28, 2025
Python

jaypinho / transcript-accuracy

Star

A Streamlit app to evaluate the accuracy of automatic speech recognition (ASR) transcription services.

asr embeddings-similarity llms

Updated May 23, 2025
Python

EulerSearch / embedding_studio

Star

Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.

search-engine embeddings semantic-similarity search-algorithm query-parser fine-tuning unstructured-data vector-database embeddings-similarity unstructured-search llm-inference search-query-parser

Updated Apr 24, 2025
Python

ragu-manjegowda / RAGify

Star

Contextual Code Exploration for Developers

python cpp rag treesitter embeddings-similarity llm

Updated Feb 20, 2025
Python

wetrocloud / WetroLearn-TextEmbeddings

Star

A Streamlit app to visualize text similarity using embeddings and cosine distance. Compare and analyze texts interactively!

machine-learning natural-language-processing artificial-intelligence plagiarism-detection embeddings-similarity

Updated Feb 15, 2025
Python

Med-Karim-Ben-Boubaker / localume

Star

Localume is a powerful desktop application that enables semantic search across your documents using advanced vector embeddings and retrieval technology. The application monitors specified directories in real-time, automatically indexing new and modified files to maintain an up-to-date searchable database.

search filesystem python3 multi-modal tkinter-gui vector-database embeddings-similarity faiss-vector-database

Updated Jan 22, 2025
Python

Masetto96 / music-collection-analyzer

Star

An essentia-based tool for extracting features from a collection of audio files. Two simple user interfaces, to create playlists and explore track similarities based on extracted audio features and embeddings.

audio-analysis essentia streamlit embeddings-similarity

Updated Dec 5, 2024
Python

comhendrik / vectorMatch

Star

Dockerized application that embeds text in a pgvecto.rs database and retrieves data with a similarity search to generate a response with an llm from ollama.

python nlp docker docker-compose vector postgresql vector-database embeddings-similarity ollama pgvecto-rs

Updated Dec 1, 2024
Python

MinLee0210 / evento

Star

Building an Event Retrieval System from Visual Data participating in Ho Chi Minh's AI Challenge in 2024

translations life-log embeddings-similarity retrieval-system event-retrieval

Updated Nov 17, 2024
Python

Babelscape / CroCoAlign

Star

A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.

nlp machine-translation sentence-embeddings sentence-alignment bilingual-corpora multilinguality embeddings-similarity

Updated Sep 11, 2024
Python

Improve this page

Add a description, image, and links to the embeddings-similarity topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the embeddings-similarity topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embeddings-similarity

Here are 40 public repositories matching this topic...

casper-vdb / Casper

mmilunovic / m2vdb

proxectonos / simil-eval

cgast / embird

Senju14 / agentic-rag-001

korupolujayanth2004 / Rag

tdeshazo / passage-probe

TanuTiwari4722 / RAG_Application

harehimself / pinecone-lab

eu90h / semantic-dictionary

Adityarya11 / Semantic-search-transcript

jaypinho / transcript-accuracy

EulerSearch / embedding_studio

ragu-manjegowda / RAGify

wetrocloud / WetroLearn-TextEmbeddings

Med-Karim-Ben-Boubaker / localume

Masetto96 / music-collection-analyzer

comhendrik / vectorMatch

MinLee0210 / evento

Babelscape / CroCoAlign

Improve this page

Add this topic to your repo