0% found this document useful (0 votes)

26 views11 pages

Natural Language Processing

The document discusses Retrieval-Augmented Generation (RAG), which combines retrieval systems with generative AI models to produce accurate responses, addressing issues like hallucination and data staleness. It outlines key components such as the retriever and generator, various RAG workflows (Standard, Corrective, Speculative, and Agentic), and compares RAG with fine-tuning methods. The document emphasizes the importance of RAG in applications requiring up-to-date and domain-specific information.

Uploaded by

fayazullah775

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views11 pages

Natural Language Processing

Uploaded by

fayazullah775

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

19/06/2025

Natural Language Processing

Spring 2025
Prof. Dr. M. Fasih Uddin Butt

Building Generative AI Applications

To Your Needs

➢ What is RAG?
➢ Why we need RAG
➢ Important Terminologies in RAG (Key Components)
➢ How RAG works ? (WorkFlow in RAG)
➢ Types
➢ Comparison
➢ Fine Tuning (Alternative Of RAG)

1
19/06/2025

What is RAG?
➢ RAG stands for Retrieval-Augmented Generation.
➢ It combines retrieval systems with Generative AI
models to produce accurate and relevant responses.
➢ It is particularly useful for applications that require
up-to-date, fact-based, or domain-specific
responses.

Why we need RAG ?

➢ Halucination (Incorrect Information), when an AI model

generates incorrect or misleading results. This can happen
in any type of AI model, including natural language
processing (NLP) models and computer vision models.
➢ Data Staleness The model's inability to provide updated
information because it was trained on a fixed dataset that
does not include newer data.

2
19/06/2025

Important Terminologies in RAG (Key Components)

Retriever:
(But there is something which is done before, Let’s See that First)
➢ Searches for relevant information from external knowledge bases or
datasets.

Generator:

➢ Uses the retrieved information to create coherent and accurate

responses.

Feedback Loop: (Optional)

➢ Optional mechanism to refine outputs iteratively.

Preprocessing Before Retrieval

1. Chunking
● What it is:
Breaking large documents or datasets into smaller, manageable
pieces (chunks).
● Why it’s needed:
○ Large text blocks are difficult to process efficiently.
○ Helps maintain context and relevance in retrieval.
● Example:
○ A 10,000-word article might be divided into 500-word chunks.

3
19/06/2025

2. Tokenization
● What it is:
Splitting text into smaller units called tokens (e.g., words, phrases,
or characters).
● Why it’s needed:
○ Allows text to be processed numerically for embedding and search.
○ Prepares the text for the embedding model.
● Example:
○ "Retrieval-Augmented Generation" →
["Retrieval", "-", "Augmented", "Generation"]

3. Embedding
● What it is:
Converting text chunks into dense numerical vectors using pre-
trained models (e.g., Sentence Transformers, OpenAI Embedding
API).
● Why it’s needed:
○ Vectors represent semantic meaning, enabling efficient similarity
search.
○ These embeddings capture the context of the text.
● Where it's stored:
○ Store embeddings in vector databases (e.g., FAISS, Pinecone, Weaviate,
ChromaDB).
○ These databases allow quick and efficient similarity searches.

4
19/06/2025

Important Terminologies in RAG (Key Components)

Retriever

● The retriever is responsible for finding the most relevant information

from an external knowledge base, database, or document store.
● It uses methods like vector similarity search (e.g., FAISS,
ElasticSearch) or traditional keyword matching to locate data
relevant to the input query.
● Why it’s important:
○ Ensures the generative model has access to accurate and
contextually appropriate information to base its response.

Important Terminologies in RAG (Key Components)

Generator

● The generator is a pre-trained language model (e.g., GPT, BERT, T5

or from Groq) that creates responses by incorporating the retrieved
information.
● It synthesizes retrieved data and transforms it into human-like,
coherent text.
● Why it’s important:
○ Acts as the "voice" of the system, converting raw retrieved data
into usable, conversational, or actionable outputs.

5
19/06/2025

Important Terminologies in RAG (Key Components)

Feedback Loop (Optional)

● A mechanism to iteratively refine the output by re-querying the

retriever or adjusting the generator’s response based on user
feedback or model evaluation.
● Why it’s important:
○ Helps improve the accuracy and relevance of responses over
time.
○ Critical for applications requiring high precision, like healthcare
or legal advisory systems.

How RAG works

( WorkFlow Diagram )

6
19/06/2025

Standard RAG

➢ Combines retrieval with generation in a straightforward manner.

Workflow:

1. Input query.
2. Retrieve relevant documents.
3. Generate response using retrieved documents.

Use Case:

● Question answering using enterprise knowledge bases

Corrective RAG

➢ Enhances response accuracy by correcting errors in real-time.

Workflow:

1. Generate an initial response.

2. Identify errors using retrieval.
3. Correct errors based on retrieved facts.

Use Case:

● Customer support chatbots with high accuracy requirements.

7
19/06/2025

Corrective RAG

Speculative RAG

➢ Prioritizes efficiency by speculating which documents are relevant

without full retrieval.
Workflow:

1. Model predicts relevance without actual retrieval.

2. Generates speculative output.

Advantages:
● Faster responses at the cost of potential accuracy.

Use Case:
● Real-time conversational AI with high-speed requirements.

8
19/06/2025

Speculative RAG

Agentic RAG
➢ Adds decision-making capabilities to the RAG model.

Workflow:

1. Retrieve information.
2. Evaluate context and goals.
3. Generate adaptive and strategic responses.

Use Case:

● Virtual assistants for decision-making tasks.

9
19/06/2025

Agentic RAG

Comparison

Technique Focus Strengths Weaknesses

Standard RAG Simplicity Easy to implement Limited

adaptability

Corrective Accuracy Error correction in Slower responses

RAG real-time
Speculative Efficiency Faster responses Risk of
RAG inaccuracies

Agentic RAG Decision- Strategic outputs Higher

making complexity

10
19/06/2025

Fine Tuning versus RAG

Aspect Fine Tuning RAG

Definition Modifies a pre-trained model by

training it on new data.
Combines a pre-trained model with external
knowledge retrieval

Purpose Customizes the model for a specific

task
Enhances responses dynamically with
external information.

Data Requires training on task-specific

data.
Uses external data stored in a vector
database or index.
Dependency
Flexibility Requires retraining for updates or
new data.
Dynamically updates responses without
retraining.

Computational High, due to additional training

requirements, High GPU, CPU req.
Low, as it uses pre-trained models with
retrieval.
Cost

Example Use Creating a specialized application for

a specific domain e.g (health care)
Answering questions about frequently
updated knowledge (e.g., news, chatbot).
Case

Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
No ratings yet
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
23 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
Challenge
No ratings yet
Challenge
8 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
Chapters
No ratings yet
Chapters
7 pages
Tyjt
No ratings yet
Tyjt
2 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
RAG
No ratings yet
RAG
4 pages
Title
No ratings yet
Title
2 pages
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
No ratings yet
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
23 pages
RAG Seminar
No ratings yet
RAG Seminar
11 pages
Rag
No ratings yet
Rag
10 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
Document 2
No ratings yet
Document 2
12 pages
Advanced Gen-AI Development
No ratings yet
Advanced Gen-AI Development
57 pages
RAG Architecture
100% (10)
RAG Architecture
52 pages
RAG and Vector Database Guide
No ratings yet
RAG and Vector Database Guide
18 pages
(Retrieval Augmented Generation) : by Uttam Grade
No ratings yet
(Retrieval Augmented Generation) : by Uttam Grade
6 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
Understanding RAG AI
No ratings yet
Understanding RAG AI
6 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
No ratings yet
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
21 pages
Blue Futuristic Artificial Intelligence Presentation
No ratings yet
Blue Futuristic Artificial Intelligence Presentation
8 pages
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
No ratings yet
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
12 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
RAG Understanding PDF
No ratings yet
RAG Understanding PDF
12 pages
A Powerful Technique For Improved Text Generation and Efficiency
No ratings yet
A Powerful Technique For Improved Text Generation and Efficiency
14 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
???: ??? ??? ?? ??????? ?? ?????????!
No ratings yet
???: ??? ??? ?? ??????? ?? ?????????!
6 pages
RAG Developers Stack
No ratings yet
RAG Developers Stack
13 pages
Understanding Retrieval-Augmented Generation (RAG)
No ratings yet
Understanding Retrieval-Augmented Generation (RAG)
12 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
Understanding RAG
No ratings yet
Understanding RAG
16 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
7 Agentic RAG System Architectures To Build AI Agents
100% (2)
7 Agentic RAG System Architectures To Build AI Agents
12 pages
LangChain & RAG - U1
No ratings yet
LangChain & RAG - U1
32 pages
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
No ratings yet
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
16 pages
RAG QUestions
No ratings yet
RAG QUestions
3 pages
26 RAG Concepts in Alphabetical Order
100% (1)
26 RAG Concepts in Alphabetical Order
15 pages
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
No ratings yet
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
9 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
Rag PDF
No ratings yet
Rag PDF
10 pages
RAG Research Document Abhishek
No ratings yet
RAG Research Document Abhishek
2 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
The Complete Guide To RAG
No ratings yet
The Complete Guide To RAG
27 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
20 pages
Retrieval Augmented Generation (Rag) For Precision Language Models
No ratings yet
Retrieval Augmented Generation (Rag) For Precision Language Models
10 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
RAG Detailed Overview
No ratings yet
RAG Detailed Overview
3 pages
Agentic RAG: Survey on AI Advancements
No ratings yet
Agentic RAG: Survey on AI Advancements
39 pages
SIP-106 GHG Emissions Inventory For Asphalt Mix Production in The US - NAPA June 2022
No ratings yet
SIP-106 GHG Emissions Inventory For Asphalt Mix Production in The US - NAPA June 2022
34 pages
Quadratic Equation (Short Notes)
No ratings yet
Quadratic Equation (Short Notes)
3 pages
ABB Test Unit
No ratings yet
ABB Test Unit
21 pages
Factor Affecting Financial Performance of Commercial Bak
No ratings yet
Factor Affecting Financial Performance of Commercial Bak
11 pages
Allied - 3261 - Basic Anatomy (Including Histology) - Anttp (December-2020) - December-2020 (Oct-20)
No ratings yet
Allied - 3261 - Basic Anatomy (Including Histology) - Anttp (December-2020) - December-2020 (Oct-20)
2 pages
21 Greatest Athletes of The 21st Century (So Far) - FOX Sports
No ratings yet
21 Greatest Athletes of The 21st Century (So Far) - FOX Sports
22 pages
Business Negotiation Skills
No ratings yet
Business Negotiation Skills
32 pages
Jurnah Hubungan Kerja
No ratings yet
Jurnah Hubungan Kerja
11 pages
As Media Studies Coursework Blog
100% (2)
As Media Studies Coursework Blog
6 pages
Enhanced Retro Pay
100% (1)
Enhanced Retro Pay
22 pages
Phs CL - Notes
No ratings yet
Phs CL - Notes
15 pages
04-Ceragon-IP-10G Radio Configuration PDF
No ratings yet
04-Ceragon-IP-10G Radio Configuration PDF
16 pages
Multimodal Analysis
No ratings yet
Multimodal Analysis
26 pages
Palombini - 1993 - Machine Songs V Pierre Schaeffer From Research I
No ratings yet
Palombini - 1993 - Machine Songs V Pierre Schaeffer From Research I
7 pages
Civil Cad 2010 v-1.0 Eng
No ratings yet
Civil Cad 2010 v-1.0 Eng
11 pages
OTTO Primer 1216: Stone & Metal Adhesion
No ratings yet
OTTO Primer 1216: Stone & Metal Adhesion
2 pages
What Is Crypto Fintechzoom
No ratings yet
What Is Crypto Fintechzoom
2 pages
Family Presence During Resuscitation 1-S2.0-S0099176721001902-Main
No ratings yet
Family Presence During Resuscitation 1-S2.0-S0099176721001902-Main
4 pages
TOFD vs Radiography for Steel Welds
No ratings yet
TOFD vs Radiography for Steel Welds
6 pages
SSPC QP 5 - 2012
No ratings yet
SSPC QP 5 - 2012
10 pages
Member List - PEPC
No ratings yet
Member List - PEPC
13 pages
EC325 Week 2 Problem Set
No ratings yet
EC325 Week 2 Problem Set
1 page
Northbayou March 2024 Updated PL
No ratings yet
Northbayou March 2024 Updated PL
3 pages
Diaphargm Wall Design
80% (5)
Diaphargm Wall Design
24 pages
012 Tacr 01a PDF
No ratings yet
012 Tacr 01a PDF
399 pages
RRD24FR007 - Pigg.02 02 24 - Redacted Rel
No ratings yet
RRD24FR007 - Pigg.02 02 24 - Redacted Rel
35 pages
Forests 16 00164
No ratings yet
Forests 16 00164
32 pages
Political Philosophy of Thomas Hobbes & State of Nature
100% (2)
Political Philosophy of Thomas Hobbes & State of Nature
22 pages
Collaborative Feedback Form
No ratings yet
Collaborative Feedback Form
2 pages
Deck Slab Reinforcement Details: Nagpur Metro Rail Project
No ratings yet
Deck Slab Reinforcement Details: Nagpur Metro Rail Project
1 page

Natural Language Processing

Uploaded by

Natural Language Processing

Uploaded by

19/06/2025

Natural Language Processing

Building Generative AI Applications

Why we need RAG ?

➢ Halucination (Incorrect Information), when an AI model

Important Terminologies in RAG (Key Components)

➢ Uses the retrieved information to create coherent and accurate

Feedback Loop: (Optional)

➢ Optional mechanism to refine outputs iteratively.

Preprocessing Before Retrieval

Important Terminologies in RAG (Key Components)

● The retriever is responsible for finding the most relevant information

Important Terminologies in RAG (Key Components)

● The generator is a pre-trained language model (e.g., GPT, BERT, T5

Important Terminologies in RAG (Key Components)

Feedback Loop (Optional)

● A mechanism to iteratively refine the output by re-querying the

How RAG works

➢ Combines retrieval with generation in a straightforward manner.

● Question answering using enterprise knowledge bases

➢ Enhances response accuracy by correcting errors in real-time.

1. Generate an initial response.

● Customer support chatbots with high accuracy requirements.

➢ Prioritizes efficiency by speculating which documents are relevant

1. Model predicts relevance without actual retrieval.

● Virtual assistants for decision-making tasks.

Technique Focus Strengths Weaknesses

Standard RAG Simplicity Easy to implement Limited

Corrective Accuracy Error correction in Slower responses

Agentic RAG Decision- Strategic outputs Higher

Fine Tuning versus RAG

Definition Modifies a pre-trained model by

Purpose Customizes the model for a specific

Data Requires training on task-specific

Computational High, due to additional training

Example Use Creating a specialized application for

You might also like