Build software better, together

castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Updated Dec 15, 2025
Python

jackfrancis1110 / Information-Retrieval-Systems-Course-Projects

css docker search-engine crawler machine-learning information-retrieval course retrieval ntlk thesaurus indexing information-extraction dbpedia scrapy recommender-system mysqli boolean-retrieval ranked-retrieval

Updated Dec 15, 2025
Python

onyx-dot-app / onyx

Star

Open Source AI Platform - AI Chat with advanced features that works with every LLM

python information-retrieval ai nextjs enterprise-search chatui rag ai-chat llm chatgpt gen-ai llm-ui

Updated Dec 15, 2025
Python

amanuel1131417512 / arxiv-paper-curator

Star

📚 Curate arXiv papers effectively using a modern AI approach with Retrieval-Augmented Generation to enhance your learning and research experience.

data-science machine-learning natural-language-processing information-retrieval automation text-analysis web-scraping user-interface arxiv document-management content-discovery academic-papers research-tools citation-management paper-curation

Updated Dec 15, 2025
Python

ToBeGreat5 / Mimir

Star

🔒 Enable secure federated autoregressive inference for multiple parties using a shared model while keeping private inputs confidential.

cli ioc information-retrieval osint interface graph-algorithms vector intel prometheus indexing nmap graph-database codebase threatintel semantic-search observability honeydb multi-agent-system

Updated Dec 15, 2025
Python

Separative-involucre520 / SearchPaperByEmbedding

Star

🔍 Search for similar academic papers using semantic search. Utilize local models or OpenAI API for high-quality results.

python api data-science machine-learning natural-language-processing information-retrieval deep-learning text-similarity embeddings neural-networks recommendation-systems vector-database academic-search research-tools search-papers

Updated Dec 15, 2025
Python

codefusion-24 / deep-research-ai

Star

🧠 Boost research efficiency with Deep Research AI, an advanced multi-agent system that leverages cutting-edge reasoning techniques for smarter insights.

react python search information-retrieval research ai nextjs openai gpt agents hacktoberfest ai-agent llm hacktoberfest2025 chatgpt openrouter agent-builder exa-search

Updated Dec 15, 2025
Python

ggrbipin / AI-Research-Assistant

Star

🤖 Enhance your research efficiency with an AI-powered assistant that analyzes documents and provides insights through a smart multi-agent system.

react agent information-retrieval web transformers openai free gpt language-model semantic-search ai-agents ai-research rag obsidian-plugin obsidian-md hacktoberfest2025 chatgpt agent-builder

Updated Dec 15, 2025
Python

britorbs / consciousdb

Star

🗄️ Streamline data analysis with ConsciousDB, a vector database that integrates directly with your models for enhanced performance and ease of use.

information-retrieval coherence reranking graph-optimization energy-minimization explainable-ai rag vector-search vector-database retrieval-augmented-generation

Updated Dec 15, 2025
Python

Prashntgtm / chatbot-search-agent

Star

🔍 Access multiple knowledge sources with this Streamlit chatbot powered by Groq LLM and LangChain for accurate and quick information retrieval.

search agent information-retrieval duckduckgo question-answering arxiv agent-based agents multi-agent-systems web-search chabot streamlit ai-agent large-language-models langchain chainlit retrieval-augmented-generation literalai

Updated Dec 15, 2025
Python

Naxh156 / claude-agent-sdk-python

Star

🤖 Build and interact with Claude Agent using this Python SDK for seamless integration and efficient asynchronous querying.

python agent json information-retrieval streaming mcp developer-tools agents web-crawling rag exa-api agentic-workflow agentic-ai fastmcp research-agents claude-code exa-research exa-code

Updated Dec 15, 2025
Python

Omg1221 / search_evals

Star

🔍 Evaluate web search APIs with our framework, testing accuracy and relevance across multiple AI agents and benchmarks for better information retrieval.

search machine-learning natural-language-processing information-retrieval evaluations performance-metrics data-analysis software-development data-collection user-feedback automated-testing text-search algorithm-evaluation search-optimization results-visualization

Updated Dec 15, 2025
Python

reformetech / haystack

Star

🛠️ Build powerful search systems effortlessly with Haystack, a framework for developing end-to-end question answering and search applications.

nlp agent information-retrieval ai text-generation haystack gemini summarization semantic-search ai-tools openhaystack large-language-models llm generative-ai generative-qa retrieval-augmented-generation haystack-ai agentic-ai

Updated Dec 15, 2025
Python

NoliNobdon / TriStage-RAG

Star

🎯 Optimize retrieval with TriStage-RAG, a 3-stage pipeline that enhances document discovery while overcoming the limits of single-vector embeddings.

open-source machine-learning natural-language-processing information-retrieval ai deep-learning text-generation cloud-computing data-retrieval document-processing conversational-ai rag optimized-performance retrieval-augmented-generation tristage

Updated Dec 15, 2025
Python

PaidXSmall / RAG-QA-demo

Star

📄 Create a local, free Retrieval-Augmented Q&A system to easily extract answers from your personal documents in minutes.

python nlp information-retrieval jupyter-notebook embeddings tf-idf faiss rag vector-search retrival streamlit sentence-transformers vector-embeddings llm hybrid-retrieval retrieval-augmented-generation rag-pipeline document-question-answering

Updated Dec 15, 2025
Python

salems-3Dpov / ai-agent-pipeline

Star

🐙 AI Agent Pipeline routes queries by intent to docs, weather, or chat, with LangGraph, ChromaDB, and LangSmith for modular, observable workflows across CLI and UI.