using sentence-transformer model for embeddings and streamlit for frontend
-
Updated
Jan 14, 2024 - Jupyter Notebook
using sentence-transformer model for embeddings and streamlit for frontend
Exploring topic modeling techniques, sentiment analysis, and classification algorithms using the Kaggle dataset "Tripadvisor Reviews 2023"
最小構成で動く日本語RAGシステム | Minimal Japanese RAG with FastAPI + Chroma + SentenceTransformers + Ollama
Simple API to upload images, store them in S3 and find other similar images among the uploaded ones. Created for a take home interview assignment.
Collection of some of my works during my internship period at Salahkart for preview and educational purpose only.
Data ingestion pipeline using dlt to load data from a REST API into LanceDB.
A movie recommendation system that utilizes FastAPI and Sentence Transformers to suggest films based on plot similarity.
PageSage is a modular web scraping and semantic retrieval system using BeautifulSoup for extraction, SentenceTransformers for dense embeddings, and ChromaDB for scalable vector search—enabling efficient data indexing for advanced RAG pipelines.
A production-grade benchmarking suite that evaluates vector databases (Qdrant, Milvus, Weaviate, ChromaDB, Pinecone, SQLite, TopK) for music semantic search applications. Features automated performance testing, statistical analysis across 15-20 iterations, real-time web UI for database comparison, and comprehensive reporting with production.
AI-powered RAG agent to upload CSV, Excel, PDF & chat with your data using FastAPI, ChromaDB, and Google Gemini API.
A lightweight Retrieval-Augmented Generation (RAG) agent powered by Groq AI and local embeddings, built to process and understand text data efficiently. It retrieves relevant context from your own files and generates accurate, natural-language responses -all while keeping your data private and running locally.
This project demonstrates how to build a complete Retrieval-Augmented Generation (RAG) system using LangChain, ChromaDB, and Sentence Transformers.
“Automating PDF Interaction using LangChain and Open-Source LLMs,” aims to build an intelligent system that allows users to upload a PDF and interact with it through natural language queries.
Using different technique to deal with Text Classification for Sentimental Analsysi from the a base TF-IDF encoding to pre-trained sentence embeddings.
Simple self-hosted embedding model service
Chatbot to answer question from your own database
This repository is dedicated to exploring and implementing vector-based retrieval methods and reranking algorithms. It includes Jupyter notebooks with practical examples and code snippets that demonstrate how these techniques can be applied for efficient information retrieval in various datasets.
This repository is dedicated to exploring and implementing vector-based retrieval methods and reranking algorithms. It includes Jupyter notebooks with practical examples and code snippets that demonstrate how these techniques can be applied for efficient information retrieval in various datasets.
Add a description, image, and links to the sentence-transformers topic page so that developers can more easily learn about it.
To associate your repository with the sentence-transformers topic, visit your repo's landing page and select "manage topics."