Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
-
Updated
Apr 24, 2025 - Python
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
The Real Time Social Media Content Retrieval System fetches real-time LinkedIn posts based on user queries, offering multiple post retrieval and customization options. Although initially focused on LinkedIn, it can be expanded to incorporate other social media platforms, facilitating cross-channel post similarity searches.
asctb-ct-label-mapper: A package to recommend controlled vocabulary for annotations of scRNA-seq datasets. and thereby enable cross-dataset or cross-experiment comparison of annotations.
An essentia-based tool for extracting features from a collection of audio files. Two simple user interfaces, to create playlists and explore track similarities based on extracted audio features and embeddings.
Building representation in the vector space
A Streamlit app to evaluate the accuracy of automatic speech recognition (ASR) transcription services.
Multilingual toolkit for evaluating LLMs using embeddings
RemEz is a descriptive question based learning platform built for students in highly theoretical subjects. The Frontend and Backend of this platform is built with the MERN stack and tailwind. This repository contains nlp code for pdf processing and descriptive QA generation via a LLM along with a similarity assessment of two descriptive answers.
Dockerized application that embeds text in a pgvecto.rs database and retrieves data with a similarity search to generate a response with an llm from ollama.
Word Mini-Game : Guess the secret word ! Play here :
Learning project: modular RAG pipeline for legal document search & Q&A using SBERT, Pinecone, and FastAPI.
Contextual Code Exploration for Developers
Data Collection repository for Reverse Search Engine
RAG Mini Project — Retrieval‑Augmented Generation chatbot with FastAPI backend (Docker on Hugging Face Spaces) and Streamlit frontend (Render), featuring document ingestion, vector search, and LLM‑powered answers
Python library for correcting registry and spelling errors in user input when comparing with a database of texts.
Building an Event Retrieval System from Visual Data participating in Ho Chi Minh's AI Challenge in 2024
A Python dictionary that uses semantic similarity for key matching instead of exact matches. This library allows you to retrieve values using keys that are semantically similar to the ones stored, making it ideal for natural language interfaces, etc.
AI song recommendations based on the feel of a song
Python package for labeing data more efficiently
Add a description, image, and links to the embeddings-similarity topic page so that developers can more easily learn about it.
To associate your repository with the embeddings-similarity topic, visit your repo's landing page and select "manage topics."