document-search

Here are 39 public repositories matching this topic...

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF

python glob pdf-converter python3 tf-idf querying pdfminer document-search pdf-search

Updated Oct 15, 2019
Python

Qyokizzzz / simhash

Star

The extended version of simhash supports fingerprint extraction of documents and images.

fingerprint simhash image-search image-deduplication document-search

Updated Aug 22, 2022
Python

neuml / cord19q

Star

COVID-19 Open Research Dataset (CORD-19) Analysis

python search nlp machine-learning medical scientific-papers document-search covid-19

Updated Nov 20, 2022
Python

kcubeterm / achoz

Star

Search through all your personal data efficiently like web search.

search-engine crawler websearch filesearch document-search

Updated Jan 31, 2023
Python

harishartanto / information-retrieval

Star

Information retrieval of text document using TF-IDF weighting & Cosine Similarity Algorithm.

information-retrieval tf-idf document-search cosine-similiarity

Updated Jun 2, 2023
Python

robindekoster / chatgpt-custom-knowledge-chatbot

Star

This open source chatbot project lets you create a chatbot that uses your own data to answer questions, thanks to the power of the OpenAI GPT-3.5 model.

python machine-learning ai chatbot python3 openai gpt knowledge-base document-search contextual-chatbot chatgpt chatgpt-api openai-chatgpt llama-index

Updated Jul 13, 2023
Python

lethalbit / bookwurm

Sponsor

Star

dead simple document index and search, nothing fancy

document-search document-indexing

Updated Mar 28, 2024
Python

kunjankanani / Document_Query_Search

Star

Retrieval-Augmented Generation, or RAG, is an innovative approach that enhances the capabilities of pre-trained large language models (LLMs) by integrating them with external data sources. This technique leverages the generative power of LLMs (Large Language Model), and combines it with the precision of specialized data search mechanisms.

document-search rag llm retrieval-augmented-generation document-query-search

Updated Jul 16, 2024
Python

EricSchoebel / DocSpector

Star

Stichwortfinder für Texte in Dokumenten eines Ordners / Keyword Finder for Texts in Documents of a Directory (for English, see README-en.md)

python pyqt5 document-search keyword-search

Updated Oct 7, 2024
Python

tomlin7 / AI-research-assistant

Star

Semantic document search system with pgvector and PGAI

postgres machine-learning natural-language-processing ai sentiment-analysis text-similarity postgresql assistant text-summarization summarization semantic-search sentence-embeddings document-search research-assistant sentence-transformers pgvector ollama pgai

Updated Nov 9, 2024
Python

capjamesg / jamesql

Sponsor

Star

An in-memory NoSQL database implemented in Python.

python nosql nosql-database web-search document-search

Updated Feb 10, 2025
Python

MichaelSykesUK / document-prompter

Star

An interactive GPT-style web application that lets you query folders of PDFs using open-source LLMs from Meta, Microsoft, Google, Mistral, and more.

nlp open-source pdf machine-learning query web-app prompt transform document-search llm

Updated Mar 24, 2025
Python

redis-developer / redis-arXiv-search

Star

Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.

react nlp redis machine-learning openai arxiv document-retrieval cohere document-search vector-search huggingface vector-database arxiv-papers

Updated Apr 15, 2025
Python

GoodGuyAdy / QueryBaseAI

Star

AI-powered hybrid search engine combining keyword, vector, and LLM-based contextual search using RAG with support for AI21, OpenAI or any other LLM.

elasticsearch django ai django-rest-framework openai document-search rag vector-search milvus llm

Updated May 3, 2025
Python

shekar369 / rag_local_pdfs

Star

Local Retrieval-Augmented Generation (RAG) pipeline using LangChain and ChromaDB to query PDF files with LLMs.

pdf document-search rag ai-agent generative-ai vector-db langchain local-llm cromadb local-llm-integration

Updated May 5, 2025
Python

udabin / knowflow

Star

Internal Knowledge Assistant powered by RAG + LangChain. Upload documents, ask natural language questions, and get contextual answers instantly.

openai document-search faiss rag fastapi llm langchain

Updated May 5, 2025
Python

aimaster-dev / chatbot-using-rag-and-langchain

Star

Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.

Updated May 29, 2025
Python

Jivl00 / KIV_IR

Star

Semestrální práce z předmětu Information Retrieval

information-retrieval web-crawler inverted-index tf-idf stemming lemmatization vector-model czech-language boolean-model document-search pyqt5-desktop-application

Updated May 29, 2025
Python

aimaster-dev / SmartRAG

Star

SmartRAG is a terminal-based RAG system using LangGraph. It processes queries by retrieving relevant content from markdown or PDFs, then responds using OpenAI GPT. Supports webpage-to-PDF conversion, vector DB search, and modular flow control.

Updated Jun 17, 2025
Python

laxmanclo / pany

Star

PostgreSQL-native semantic search engine with multi-modal capabilities. Add AI-powered search to your existing database without separate vector databases, vendor fees, or complex setup. Features text + image search using CLIP embeddings, native SQL joins, and 10-minute Docker deployment.

Updated Jul 4, 2025
Python

Improve this page

Add a description, image, and links to the document-search topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-search topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-search

Here are 39 public repositories matching this topic...

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Qyokizzzz / simhash

neuml / cord19q

kcubeterm / achoz

harishartanto / information-retrieval

robindekoster / chatgpt-custom-knowledge-chatbot

lethalbit / bookwurm

kunjankanani / Document_Query_Search

EricSchoebel / DocSpector

tomlin7 / AI-research-assistant

capjamesg / jamesql

MichaelSykesUK / document-prompter

redis-developer / redis-arXiv-search

GoodGuyAdy / QueryBaseAI

shekar369 / rag_local_pdfs

udabin / knowflow

aimaster-dev / chatbot-using-rag-and-langchain

Jivl00 / KIV_IR

aimaster-dev / SmartRAG

laxmanclo / pany

Improve this page

Add this topic to your repo