Mini RAG Assistant (Offline Retrieval System)

A minimal Retrieval-Augmented Generation (RAG) style pipeline implemented in Python using TF-IDF vectorization and cosine similarity.

This project demonstrates the core concept behind retrieval-based AI systems without relying on external APIs.

🚀 Overview

This project simulates the retrieval component of a RAG system:

Load external knowledge base
Split knowledge into searchable units (sentences)
Convert text into vector representations (TF-IDF)
Convert user query into vector
Compute cosine similarity
Retrieve Top-K most relevant sentences
Generate response from retrieved context

Although no LLM is used, the architecture mirrors real-world RAG pipelines.

🧠 Architecture

Knowledge Base
→ Sentence Splitting
→ TF-IDF Vectorization
→ Cosine Similarity Search
→ Top-K Retrieval
→ Response Generation

This reflects the retrieval layer of modern AI systems.

🛠 Tech Stack

Python
NumPy
Scikit-learn
TF-IDF Vectorizer
Cosine Similarity

📂 Project Structure

mini-rag-assistant/
│
├── main.py
├── knowledge.txt
├── requirements.txt
└── README.md

▶️ Run Locally

Install dependencies:

python3 -m pip install -r requirements.txt

Run the assistant:

python3 main.py

Type a question and press Enter. Type exit to quit.

💬 Example Questions

What is RAG?
How does retrieval work?
What are embeddings?
What problem does RAG solve?
What are RAG use cases?
How does similarity search work?

🔍 How It Works (Technical Explanation)

The knowledge base is split into sentences.
Each sentence is converted into a TF-IDF vector.
The user query is vectorized using the same vocabulary.
Cosine similarity measures relevance between query and sentences.
The Top-2 most similar sentences are returned as the response.

This mimics how embedding-based retrieval works in real RAG systems.

📈 Future Improvements

Replace TF-IDF with embedding models
Add vector database (FAISS / Pinecone)
Integrate real LLM for answer synthesis
Add similarity threshold filtering
Add API interface

🎯 Purpose

Built as a portfolio project to demonstrate understanding of:

Information retrieval
Vector similarity
Retrieval-Augmented Generation concepts
NLP preprocessing
Modular AI pipeline design

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
knowledge.txt		knowledge.txt
main.py		main.py
questions.txt		questions.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mini RAG Assistant (Offline Retrieval System)

🚀 Overview

🧠 Architecture

🛠 Tech Stack

📂 Project Structure

▶️ Run Locally

💬 Example Questions

🔍 How It Works (Technical Explanation)

📈 Future Improvements

🎯 Purpose

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mini RAG Assistant (Offline Retrieval System)

🚀 Overview

🧠 Architecture

🛠 Tech Stack

📂 Project Structure

▶️ Run Locally

💬 Example Questions

🔍 How It Works (Technical Explanation)

📈 Future Improvements

🎯 Purpose

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages