0% found this document useful (0 votes)

10 views25 pages

Instance Based Learning: Aiml/ Bda

The document discusses Instance Based Learning (IBL) and the K-Nearest Neighbors (K-NN) algorithm, which classifies new instances based on their proximity to stored training instances. It highlights the advantages and disadvantages of IBL, provides examples of its application, and explains the importance of selecting the appropriate value for K. Additionally, it clarifies that KNN is a supervised, non-parametric, and lazy learning algorithm.

Uploaded by

tarun.nemaai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views25 pages

Instance Based Learning: Aiml/ Bda

Uploaded by

tarun.nemaai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Instance Based Learning

AIML/ BDA
Topics covered
• K-Nearest Neighbors (K-NN) concept
• Distance metrics
• K-NN for classification
Machine Learning Classification
• Training instances are stored in memory
Instances • For a test (unseen) instances
Based Learning • Compare test instances with instances seen in training and
gives result
• Also known as Memory based learning

IBL algorithms are local

rather than global
approximations.
Instances Based Learning
• IBL methods learn by storing the training data.

• When a new query instance is encountered, a set of similar

related instances is retrieved from memory and used to classify
the new query instance.

• For each distinct query, IBL construct different approximation to

the target function.
• These are local rather than global approximations.
Advantages and Disadvantages of IBL
Advantage:
• Suitable for problems with very complex target functions.

Disadvantage:
• The cost of classifying new instances - high.
• Considered all attributes of the instances – dimension increase
Comparison
Example
• A company produce tissues (used by biological labs).

• The company’s objective is to predict how well their products are

accepted by their clients.

• They conducted a survey with their clients to find the acceptance of

the product. Quality is based on acid durability and strength parameter.
Example
• The data set pertains to a company that produces tissues for use in biological
labs.
Name Acid Acid Acceptability
Durability Strength
Type-1 7 7 Low
Type-2 7 4 Low
Type-3 3 4 High
Type-4 1 4 High

Test data: Type-5 Acid Durability = 3 Strength = 7

• Built a classifier to predict a new type of tissue

• Apply the Euclidian distance measure for the data to find the distances from
the new data Type-5.

Name Acid Strength Distance NeighorR

Durability ank
Type-1 7 7 Sqrt((7-3)2+(7-7)2) =4 3
Type-2 7 4 Sqrt((7-3)2+(4-7)2)=5 4
Type-3 3 4 Sqrt((3-3)2+(4-7)2)=3 1
Type-4 1 4 Sqrt((1-3)2+(4-7)2)=3.6 2

• If k =1, ONE immediate neighbor Type 3 Good (new type is = High)

• If k =2, TWO immediate neighbor Type 3, Type 4 = High; (new type is = High)
• If k =3, THREE immediate neighbor Type 3, Type 4 = High & Type 1 = Low (but the
probability of High is high so, consider new type is classified as High)
Example 2
• Assume a Boolean target function and Distance Classification
a 2-dimensional instance space from query
instance
(shown in figure). 1.00 +
• Determine how the k-Nearest 1.35 -
Neighbour Learning algorithm would 1.40 -
1.60 -
classify the new instance xq for k = 1.90 +
1,3,5 and 7. 2.00 +
2.20 -
• The + and – signs in the instance space 2.40 +
refer to positive and negative examples 2.80 -
respectively.
Distance Classification + -
-
from query -

instance - xq +
1.00 + +
-
1.35 - +
1.40 -
1.60 -
1.90 + 1-NN +
2.00 +
2.20 - 3-NN -
2.40 +
2.80 - 5-NN -
7-NN -
Selection of K value ?
• Try many different values for K and see what works best for your
problem.
• K value should be an odd number (3, 5, 7, 9, etc.).
How does the efficiency and accuracy of k-NN search
change as k increases?
• If we have sufficiently large number of training experiences the
accuracy should increase
• The computational complexity of KNN increases with the size of the
training dataset.
• The time to calculate the prediction will also increase.
• In that sense less efficient
• KNN is a Lazy Learning algorithm – why?

• No learning of the model/ algorithm

• It “memorizes” the training dataset

• DT algorithm learns its model during training time

• KNN is a Non-Parametric algorithm – why?

• It makes no assumptions about the functional form of the

problem being solved.
• Is KNN supervised or unsupervised learning algorithm?

• KNN is a supervised learning algorithm, uses labeled data for

classification problem.

• Note: K-means is an unsupervised learning algorithm used for

clustering problem
Thank you

CSC 323-08 Instance-Based Learning
No ratings yet
CSC 323-08 Instance-Based Learning
6 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
ML TRW
No ratings yet
ML TRW
5 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
U3 KNN
No ratings yet
U3 KNN
6 pages
CHP 4
No ratings yet
CHP 4
24 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
74 pages
ML Day6
No ratings yet
ML Day6
20 pages
UNIT V 5.1 ML Instance Based Learning
No ratings yet
UNIT V 5.1 ML Instance Based Learning
52 pages
Mod 3
No ratings yet
Mod 3
56 pages
CH 2
No ratings yet
CH 2
30 pages
Classification
No ratings yet
Classification
58 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
K-NN Numerical N Theory
No ratings yet
K-NN Numerical N Theory
5 pages
ML-LECTURE9 KNN Classification
No ratings yet
ML-LECTURE9 KNN Classification
23 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
1 Unit 2 Notes
No ratings yet
1 Unit 2 Notes
31 pages
UNIT 2 - Notes
No ratings yet
UNIT 2 - Notes
31 pages
Distance-Based Methods - KNN
0% (1)
Distance-Based Methods - KNN
8 pages
K-NN Algorithm and Clustering Analysis
No ratings yet
K-NN Algorithm and Clustering Analysis
93 pages
ML 5
No ratings yet
ML 5
76 pages
KNN Classifier for Data Scientists
No ratings yet
KNN Classifier for Data Scientists
16 pages
12 - 23ECE216 - Nearest Neighbors
No ratings yet
12 - 23ECE216 - Nearest Neighbors
29 pages
KNN
No ratings yet
KNN
53 pages
FPA Unit 2
No ratings yet
FPA Unit 2
20 pages
04 Unit-Iv - ML
No ratings yet
04 Unit-Iv - ML
23 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
6 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
19 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
No ratings yet
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
61 pages
08 Classification Using K NN
No ratings yet
08 Classification Using K NN
23 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
KNN Algorithm: Clustering & Classification
No ratings yet
KNN Algorithm: Clustering & Classification
10 pages
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
No ratings yet
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
15 pages
Unit 3 - Supervise Learning Classification
No ratings yet
Unit 3 - Supervise Learning Classification
23 pages
ML Mid2 Ans
No ratings yet
ML Mid2 Ans
24 pages
Co-2 ML 2019
No ratings yet
Co-2 ML 2019
71 pages
ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
KNN & Decision Tree Basics
No ratings yet
KNN & Decision Tree Basics
9 pages
Supervised Learning KNN
No ratings yet
Supervised Learning KNN
23 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
ML Unit 2 (Ab22)
No ratings yet
ML Unit 2 (Ab22)
61 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
E Learning KNN
No ratings yet
E Learning KNN
31 pages
Machine Learning Theory Essentials
No ratings yet
Machine Learning Theory Essentials
9 pages
Unit 5 ML
No ratings yet
Unit 5 ML
13 pages
k-NN Algorithm Overview & Applications
No ratings yet
k-NN Algorithm Overview & Applications
35 pages
Time Series Smoothing in Excel
No ratings yet
Time Series Smoothing in Excel
11 pages
Math Script
No ratings yet
Math Script
4 pages
Taxi Fare Prediction Using Random Forests
No ratings yet
Taxi Fare Prediction Using Random Forests
10 pages
Time and Space Complexity
No ratings yet
Time and Space Complexity
5 pages
Non-Negative Matrix Factorization (NMF) : Benjamin Wilson
No ratings yet
Non-Negative Matrix Factorization (NMF) : Benjamin Wilson
43 pages
Image Fusion by Wavelet Method: D.Ravikrishna Reddy
No ratings yet
Image Fusion by Wavelet Method: D.Ravikrishna Reddy
25 pages
7 A High-Performance Customer Churn Prediction
No ratings yet
7 A High-Performance Customer Churn Prediction
35 pages
Deep Learning - IIT Ropar - Unit 11 - Week 8
No ratings yet
Deep Learning - IIT Ropar - Unit 11 - Week 8
4 pages
Maximum Mark: 50: Cambridge International Examinations Cambridge Ordinary Level
No ratings yet
Maximum Mark: 50: Cambridge International Examinations Cambridge Ordinary Level
6 pages
Convolutional Neural Network (CNN) Architectures - GeeksforGeeks
No ratings yet
Convolutional Neural Network (CNN) Architectures - GeeksforGeeks
17 pages
Digital Signal Processing (DSP)
No ratings yet
Digital Signal Processing (DSP)
3 pages
Pole Placement1
100% (2)
Pole Placement1
46 pages
Uninformed Search Algorithms in AI
No ratings yet
Uninformed Search Algorithms in AI
58 pages
DSP MCQs for EEE Students
100% (3)
DSP MCQs for EEE Students
128 pages
Group Assignment 1
No ratings yet
Group Assignment 1
4 pages
Question Paper Code: 13519: Reg. No
No ratings yet
Question Paper Code: 13519: Reg. No
6 pages
Data Structures Exam: Stacks, Queues, Trees, Graphs
No ratings yet
Data Structures Exam: Stacks, Queues, Trees, Graphs
4 pages
Haskell Exercises Solutions
No ratings yet
Haskell Exercises Solutions
6 pages
Strassen
No ratings yet
Strassen
8 pages
Flajolet-Martin Algorithm Guide
No ratings yet
Flajolet-Martin Algorithm Guide
3 pages
Bidirectional Associative Memory
No ratings yet
Bidirectional Associative Memory
3 pages
Numerical Differentiation
No ratings yet
Numerical Differentiation
26 pages
Soft Computing Technique Based Economic Load Dispatch Using Improved Particle Swarm Optimization
No ratings yet
Soft Computing Technique Based Economic Load Dispatch Using Improved Particle Swarm Optimization
7 pages
Linear Programming Essentials
75% (4)
Linear Programming Essentials
5 pages
Cuckoo Search - Wikipedia
No ratings yet
Cuckoo Search - Wikipedia
3 pages
Recurrence
No ratings yet
Recurrence
65 pages
Chap 1 - Introduction To Algorithms
No ratings yet
Chap 1 - Introduction To Algorithms
2 pages
EEE 420 Digital Signal Processing: Instructor: Erhan A. Ince E-Mail
No ratings yet
EEE 420 Digital Signal Processing: Instructor: Erhan A. Ince E-Mail
19 pages
QSTNS - AI For Pasting
No ratings yet
QSTNS - AI For Pasting
10 pages
Shi & Eberhart 1998
No ratings yet
Shi & Eberhart 1998
10 pages

Instance Based Learning: Aiml/ Bda

Uploaded by

Instance Based Learning: Aiml/ Bda

Uploaded by

Instance Based Learning

IBL algorithms are local

• When a new query instance is encountered, a set of similar

• For each distinct query, IBL construct different approximation to

• The company’s objective is to predict how well their products are

• They conducted a survey with their clients to find the acceptance of

Test data: Type-5 Acid Durability = 3 Strength = 7

• Built a classifier to predict a new type of tissue

Name Acid Strength Distance NeighorR

• If k =1, ONE immediate neighbor Type 3 Good (new type is = High)

• No learning of the model/ algorithm

• It “memorizes” the training dataset

• DT algorithm learns its model during training time

• It makes no assumptions about the functional form of the

• KNN is a supervised learning algorithm, uses labeled data for

• Note: K-means is an unsupervised learning algorithm used for

You might also like