document-search

Here are 84 public repositories matching this topic...

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF

python glob pdf-converter python3 tf-idf querying pdfminer document-search pdf-search

Updated Oct 15, 2019
Python

HarshKothari21 / Natural-Language-Processing-Specialization

Star

NLP Course By Deep learning.io powered by @coursera. Taught by: Younes Bensouda Mourri, Instructor of AI at Stanford University and Łukasz Kaiser, Staff Research Scientist at Google Brain.

sentiment-analysis language-translation word-embeddings lsh pca document-search tweet-classification

Updated Aug 30, 2020
Jupyter Notebook

dileepgodithi / DistributedSearch

Star

Distributed document search using TF-IDF algorithm.

java http serialization distributed-systems networking service-discovery protocol-buffers distributed-computing http-client zookeeper http-server tf-idf leader-election document-search

Updated Oct 13, 2020
Java

mdietrichstein / ir-search-engine-rust

Star

Rust-based text search engine from scratch supporting multiple document similarity metrics (TF-IDF, BM25, BM25VA)

search nlp rust search-engine information-retrieval document-similarity document-search

Updated Jun 5, 2021
Rust

AI-STACK-dev / Covid19-Comorbidities-NLP-WEB

Star

COVID-19 comorbidities analysis platform based on Natural Language Processing(NLP)

python nlp document-search covid-19

Updated Nov 16, 2021
JavaScript

Qyokizzzz / simhash

Star

The extended version of simhash supports fingerprint extraction of documents and images.

fingerprint simhash image-search image-deduplication document-search

Updated Aug 22, 2022
Python

neuml / cord19q

Star

COVID-19 Open Research Dataset (CORD-19) Analysis

python search nlp machine-learning medical scientific-papers document-search covid-19

Updated Nov 20, 2022
Python

kcubeterm / achoz

Star

Search through all your personal data efficiently like web search.

search-engine crawler websearch filesearch document-search

Updated Jan 31, 2023
Python

liviobisogni / solr-ocr-indexing

Star

Apache Solr Document Search and Indexing Analysis with OCR

search search-engine pdf ocr solr tesseract indexing leptonica tesseract-ocr optical-character-recognition indexing-engine apache-solr document-search

Updated Apr 19, 2023
Java

zayedrais / DocumentSearchEngine

Star

Document Search Engine project with TF-IDF abd Google universal sentence encoder model

Updated May 1, 2023
Jupyter Notebook

harishartanto / information-retrieval

Star

Information retrieval of text document using TF-IDF weighting & Cosine Similarity Algorithm.

information-retrieval tf-idf document-search cosine-similiarity

Updated Jun 2, 2023
Python

robindekoster / chatgpt-custom-knowledge-chatbot

Star

This open source chatbot project lets you create a chatbot that uses your own data to answer questions, thanks to the power of the OpenAI GPT-3.5 model.

python machine-learning ai chatbot python3 openai gpt knowledge-base document-search contextual-chatbot chatgpt chatgpt-api openai-chatgpt llama-index

Updated Jul 13, 2023
Python

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

nodejs multilingual search-engine browser trie fuzzy-matching full-text-search lucene tf-idf client-side phonetics text-processing bk-tree bm25 text-search document-search damerau-levenshtein-distance document-indexing state-hydration

Updated Jul 21, 2023
TypeScript

easonlai / chatbot_with_pdf_streamlit

Star

This code example shows how to make a chatbot for semantic search over documents using Streamlit, LangChain, and various vector databases. The chatbot lets users ask questions and get answers from a document collection. The code is in Python and can be customized for different scenarios and data.

python azure openai chroma embedding-models semantic-search pinecone vector-similarity document-search vector-search streamlit vector-database azure-cognitive-search gpt-3 azure-openai langchain gpt-35-turbo langchain-python

Updated Sep 3, 2023
Jupyter Notebook

ramezlahzy / search-in-docs

Star

search productivity search-engine documentation information-retrieval utilities indexing full-text-search text-processing document-search

Updated Nov 8, 2023
JavaScript

lethalbit / bookwurm

Sponsor

Star

dead simple document index and search, nothing fancy

document-search document-indexing

Updated Mar 28, 2024
Python

domwal / acervo-digital-pessoal

Star

Website in PHP to index all pdf content and easy way to find any text

javascript mysql css html bootstrap jquery php pdf ajax windows-10 indexing full-text-search document-search php73 linux-debian pdf-search

Updated Apr 15, 2024
PHP

krisluczka / OSSE

Sponsor

Star

Open Source Search Engine with built-in web/document crawler and an indexing method.

search-engine cpp web-crawler web-crawling indexing-engine document-search document-searching web-indexing web-indexer document-indexing

Updated May 4, 2024
C++

EmirhanSyl / TheBSTSearchEngine

Star

Mini desktop search engine with Binary Search Tree

datastructures binary-search-tree data-structures-and-algorithms document-search java-swing-application

Updated May 7, 2024
Java

kunjankanani / Document_Query_Search

Star

Retrieval-Augmented Generation, or RAG, is an innovative approach that enhances the capabilities of pre-trained large language models (LLMs) by integrating them with external data sources. This technique leverages the generative power of LLMs (Large Language Model), and combines it with the precision of specialized data search mechanisms.

document-search rag llm retrieval-augmented-generation document-query-search

Updated Jul 16, 2024
Python

Improve this page

Add a description, image, and links to the document-search topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-search topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-search

Here are 84 public repositories matching this topic...

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

HarshKothari21 / Natural-Language-Processing-Specialization

dileepgodithi / DistributedSearch

mdietrichstein / ir-search-engine-rust

AI-STACK-dev / Covid19-Comorbidities-NLP-WEB

Qyokizzzz / simhash

neuml / cord19q

kcubeterm / achoz

liviobisogni / solr-ocr-indexing

zayedrais / DocumentSearchEngine

harishartanto / information-retrieval

robindekoster / chatgpt-custom-knowledge-chatbot

kyr0 / clientside-search

easonlai / chatbot_with_pdf_streamlit

ramezlahzy / search-in-docs

lethalbit / bookwurm

domwal / acervo-digital-pessoal

krisluczka / OSSE

EmirhanSyl / TheBSTSearchEngine

kunjankanani / Document_Query_Search

Improve this page

Add this topic to your repo