Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.
-
Updated
May 29, 2025 - Python
Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.
The extended version of simhash supports fingerprint extraction of documents and images.
Semestrální práce z předmětu Information Retrieval
AI-powered hybrid search engine combining keyword, vector, and LLM-based contextual search using RAG with support for AI21, OpenAI or any other LLM.
NoteWeb is a local-first AI tool that semantically indexes and searches your documents using LLaMA 3 and vector embeddings.
Chat with your PDF documents using Streamlit, LlamaIndex, and Qdrant. Upload, embed, and search documents with a modern UI—containerized for easy deployment.
RAG-PDF Assistant — A simple Retrieval-Augmented Generation (RAG) chatbot that answers questions using custom PDF documents. It uses HuggingFace embeddings for text representation, stores them in a Chroma vector database, and generates natural language answers with Google Gemini. In this example, the assistant is powered by a few school policy doc
Open-source semantic document search (RAG) engine with FastAPI and instant self-hosted deployment
Stichwortfinder für Texte in Dokumenten eines Ordners / Keyword Finder for Texts in Documents of a Directory (for English, see README-en.md)
Information retrieval of text document using TF-IDF weighting & Cosine Similarity Algorithm.
AI-powered document search and summarisation with FastAPI and Docker
SmartRAG is a terminal-based RAG system using LangGraph. It processes queries by retrieving relevant content from markdown or PDFs, then responds using OpenAI GPT. Supports webpage-to-PDF conversion, vector DB search, and modular flow control.
AI-powered finance policy chatbot with English/Bahasa Malaysia support for hospital employees
dead simple document index and search, nothing fancy
PostgreSQL-native semantic search engine with multi-modal capabilities. Add AI-powered search to your existing database without separate vector databases, vendor fees, or complex setup. Features text + image search using CLIP embeddings, native SQL joins, and 10-minute Docker deployment.
📄 Empower document management with this FastAPI service that uploads, searches, and summarizes text documents using advanced NLP techniques.
A Python-based application that extracts and processes PDF content using a Retrieval-Augmented Generation (RAG) approach. Leverage vector embeddings to enable efficient querying of both text-based and scanned PDFs, and interact with your documents using a large language model.
CLI tools for Google Docs: AI chatbot with two-tier architecture for efficient multi-document queries and analysis
Add a description, image, and links to the document-search topic page so that developers can more easily learn about it.
To associate your repository with the document-search topic, visit your repo's landing page and select "manage topics."