0% found this document useful (0 votes)

70 views15 pages

Module IV - K NN

The document discusses k-nearest neighbor (k-NN) learning, an instance-based learning algorithm. It describes how k-NN works by finding the k closest training instances to make predictions, defines distance metrics, and how it can be used for both discrete and continuous valued prediction problems. It also introduces variants like distance weighting and addressing issues like the curse of dimensionality.

Uploaded by

Deekshith Reddy Kotla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views15 pages

Module IV - K NN

Uploaded by

Deekshith Reddy Kotla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

1 Welcome

INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
2

 k- NEAREST NEIGHBOR algorithm is most basic instance-based

method.
 Algorithms assumes all instances correspond to points in the n-
dimensional space Rn .
 The nearest neighbors of an instance are defined in terms of the
standard Euclidean distance.
 Let an arbitrary instance x be described by the feature vector

𝑎1(𝑥), 𝑎2(𝑥), … , 𝑎𝑛(𝑥)

where ar(x) denotes the value of rth attribute of instance x.

INSTANCE-BASED LEARNING
3 INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
4

 The distance between two instances xi and xj is defined to be d(xi,xj),

where

𝑑(𝑥𝑖, 𝑥𝑗) ≡ 𝑎𝑟 𝑥𝑖 − 𝑎𝑟 𝑥𝑗 2

𝑟=1

 In nearest- neighbor learning the target function may be either

discrete-valued or real-valued.

INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
5

Discrete-Valued :
 The target function of the form f:Rn->V , where V is the finite set
{v1,…,vs}.
 Let xq is the query instance.

 The value f(xq) returned by the algorithm as its estimate of f(xq) is

just the most common value of f among the k training examples

nearest to xq.
 If we choose k = 1, then 1- NEAREST NEIGHBOR algorithm assigns

to 𝑓 𝑥𝑞 the value 𝑓 𝑥𝑞 where is xi is the training instance nearest

to xq
 For larger values of k, the algorithm assigns the most common value
among the k nearest training examples.
INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
6

Discrete-Valued :

 The above figure shows the operation of the k-NEAREST NEIGHBOR algorithm for
the case where the instances are points in a two-dimensional space and where the
target function is Boolean valued.
 The positive and negative training examples are shown ‘+’ and ‘-’ respectively.
 A query point xq is shown as well.
 The 1- NEAREST NEIGHBOR algorithm classifies xq as positive example in this
figure , whereas the 5-NEAREST NEIGHBOR algorithm classifies it as a negative
example. INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
7

Discrete-Valued :
 The k-NEAREST NEIGHBOR algorithm never forms an explicit general
hypothesis f regarding the target function f().
 It simply computes the classification of each new query instance as
needed.

INSTANCE-BASED LEARNING
k- NEAREST NEIGHBOR LEARNING
8

Discrete-Valued :

 The diagram shows the shape of the decision surface induced by 1-

NEAREST NEIGHBOR over the entire instance space.
 The decision surface is a combination of convex polyhedral surrounding
each of the training examples.
 For every training example, the polyhedron indicates the set of query
points whose classification will be completely determined by that training
example.
 Query point outside the polyhedron are closer to some other training
example.
 This kind of diagram is often called the Voronoi diagram of the set of
INSTANCE-BASED LEARNING
training examples.
k- NEAREST NEIGHBOR LEARNING
9

Continuous-Valued :
 k-NEAREST NEIGHBOR can be adapted to approximating
continuous-valued target function.
 So we must calculate the mean value of the k nearest training
examples rather than calculate their most common value.
 The approximate real-valued target function f:Rn -> R by :
𝑘
𝑖=1 𝑓(𝑥𝑖)
𝑓 𝑥𝑞 ←
𝑘

INSTANCE-BASED LEARNING
Distance – Weighted NEAREST NEIGHBOR
Algorithm
10

 The one of the refinement of k-NEAREST NEIGHBOR algorithm is to

weight the contribution of each of the k neighbors according to their
distance to the query point xq,.
 This can be accomplished by
𝑘

𝑓 𝑥𝑞 ← 𝑎𝑟𝑔max 𝜔𝑖𝛿 𝑣, 𝑓 𝑥𝑖
𝑣∈𝑉
𝑖=1
where
1
𝑤𝑖 ≡
𝑑 𝑥𝑞, xi 2
 If the query point xq exactly matches one of the training instances xi and
the denominator d(xq,xi)2 is therefore zero , so we assign f(xq) to be f(xi) .
 If there are several such training examples, we assign the majority
classification among them. INSTANCE-BASED LEARNING
Distance – Weighted NEAREST NEIGHBOR
Algorithm
11

 Distance-weight the instances for real-valued target function in a

similar in a similar fashion by

𝑘
𝑖=1 𝑤𝑖𝑓(𝑥𝑖)
𝑓 𝑥𝑞 ← 𝑘
𝑖=1 𝑤𝑖

 Where wi is a constant that normalizes the contributions of the

various weights.
 The variants of the k-NEAREST NEIGHBOR algorithm consider
only the k nearest neighbors to classify the query point.
INSTANCE-BASED LEARNING
Distance – Weighted NEAREST NEIGHBOR
Algorithm
12

 We add distance weighting , there is really no harm in allowing all

training examples to have an influence on the classification of the
xq, because very distant examples will have very little effect on
f(xq).
Disadvantage :
 The classifier will run more slowly.

 If all training examples are considered when classifying a new

query instance, we call the algorithm a global method.

INSTANCE-BASED LEARNING
Distance – Weighted NEAREST NEIGHBOR
Algorithm
13

Remarks :
 It is highly effective inductive inference method for many practical
problems.
 The inductive bias corresponds to an assumption that the
classification of an instance xq will be most similar to the
classification of other instances that are nearby in Euclidean
distance.
 The distance between instances is calculated based on all attributes
of the instance.
 Though an instance is described by large number of attributes , but
only few of them will be relevant to determine the classification for
the function.
 If these few attributes identical values may be distant from one
another in the d-dimensional space. LEARNING
INSTANCE-BASED
Distance – Weighted NEAREST NEIGHBOR
Algorithm
14

Remarks :
 The similarity metric used by k-NN depending on all attributes will
be misleading.
 The distance between two neighbors will be dominated by the large
number of irrelevant attributes. This difficulty , which arise when
many irrelevant attributes are present, is sometimes referred to as
the curse of dimensionality.
 We can overcome this problem , by attaching the weight each
attribute differently when calculating the distance between two
instances.
 That means we are stretching the axes in the Euclidean space.
Shortening the axes that correspond to less relevant attribute and
lengthing the axes that correspond to more relevant attributes.
INSTANCE-BASED LEARNING
Distance – Weighted NEAREST NEIGHBOR
Algorithm
15

Remarks :
 We us cross validation to determine the amount by which each axis
is to be stretched automatically.

 The other alternative is to eliminate the least relevant attributes

from the instance space.

 This means to setting some of the scaling factors to zero.

 Cross validation , we explore methods based on leave-one-out cross

validation, in which the set of m training examples is repeatedly
divided into a training set of size m-1LEARNING
INSTANCE-BASED and test set of size 1 .

Lecture 05 - Nearest Neighbour
No ratings yet
Lecture 05 - Nearest Neighbour
17 pages
ML Lec7
No ratings yet
ML Lec7
5 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
26 pages
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
No ratings yet
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
9 pages
Module 5
No ratings yet
Module 5
94 pages
CHP 4
No ratings yet
CHP 4
24 pages
B 8 Cacdd 9
No ratings yet
B 8 Cacdd 9
11 pages
Nearest-Neighbor Classifier Guide
No ratings yet
Nearest-Neighbor Classifier Guide
2 pages
Chapter 6: Classification and Prediction: Classify Predictions
No ratings yet
Chapter 6: Classification and Prediction: Classify Predictions
23 pages
CS8082U4L01 - K-Nearest Neighbour Learning
No ratings yet
CS8082U4L01 - K-Nearest Neighbour Learning
21 pages
Mod 3
No ratings yet
Mod 3
56 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
ML KN
No ratings yet
ML KN
12 pages
CH 2
No ratings yet
CH 2
30 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Aiml Module 3 Part 2
No ratings yet
Aiml Module 3 Part 2
12 pages
Instance Based Learning: Vibhav Gogate The University of Texas at Dallas
No ratings yet
Instance Based Learning: Vibhav Gogate The University of Texas at Dallas
25 pages
1 Unit 2 Notes
No ratings yet
1 Unit 2 Notes
31 pages
CSC 323-08 Instance-Based Learning
No ratings yet
CSC 323-08 Instance-Based Learning
6 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
13 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Text Book 2 Module 4 Chapter 3-Similarity Based Learning
No ratings yet
Text Book 2 Module 4 Chapter 3-Similarity Based Learning
12 pages
@vtudeveloper - in ML Mod 3
No ratings yet
@vtudeveloper - in ML Mod 3
32 pages
Module3-Similarity-based Learning-11Mar2024
No ratings yet
Module3-Similarity-based Learning-11Mar2024
34 pages
Siddu AIml
No ratings yet
Siddu AIml
8 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
Supervised Learning: Instance Based Learning
No ratings yet
Supervised Learning: Instance Based Learning
16 pages
Instance Based Learning: November 2015
No ratings yet
Instance Based Learning: November 2015
11 pages
4 3DM - Classification-Methods
No ratings yet
4 3DM - Classification-Methods
9 pages
BTech V KCS 055 Unit3
No ratings yet
BTech V KCS 055 Unit3
12 pages
Nearest Neighbor Classifier Guide
No ratings yet
Nearest Neighbor Classifier Guide
16 pages
UNIT V 5.1 ML Instance Based Learning
No ratings yet
UNIT V 5.1 ML Instance Based Learning
52 pages
Lecture8 KNN1
No ratings yet
Lecture8 KNN1
16 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
Lecture 12
No ratings yet
Lecture 12
15 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
ML Module5Notes
No ratings yet
ML Module5Notes
20 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
k-Nearest Neighbors Lecture Notes
No ratings yet
k-Nearest Neighbors Lecture Notes
23 pages
Intro to k-Nearest Neighbor Algorithm
No ratings yet
Intro to k-Nearest Neighbor Algorithm
3 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
The Nearest Neighbour Algorithm
No ratings yet
The Nearest Neighbour Algorithm
3 pages
ML - Module 3 - Chapter 4 RNSIT
No ratings yet
ML - Module 3 - Chapter 4 RNSIT
5 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
19 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
19 pages
PowerPoint Presentation - KNN Presentation
No ratings yet
PowerPoint Presentation - KNN Presentation
16 pages
4.1-K - Nearest Neighbour Learning
No ratings yet
4.1-K - Nearest Neighbour Learning
27 pages
Lect 06
No ratings yet
Lect 06
26 pages
OIST Research Intern Application
100% (1)
OIST Research Intern Application
12 pages
Readings in The Philippine History Course Assessment 2
No ratings yet
Readings in The Philippine History Course Assessment 2
1 page
Mavigard - Mas
No ratings yet
Mavigard - Mas
37 pages
Aiml Solved QA
No ratings yet
Aiml Solved QA
54 pages
Automated Pest Detection for Farmers
No ratings yet
Automated Pest Detection for Farmers
4 pages
Kickstart Automation for Students
No ratings yet
Kickstart Automation for Students
3 pages
11TH Computer Application EM Unit Test
No ratings yet
11TH Computer Application EM Unit Test
3 pages
Files: Ipython Writing A File
No ratings yet
Files: Ipython Writing A File
5 pages
Colin Waters: Experience
No ratings yet
Colin Waters: Experience
5 pages
Chandan Resume
No ratings yet
Chandan Resume
6 pages
Gaurav Kangune 5247
No ratings yet
Gaurav Kangune 5247
12 pages
Java Developer Expertise Overview
No ratings yet
Java Developer Expertise Overview
10 pages
Sheet 5 Answer PDF
No ratings yet
Sheet 5 Answer PDF
3 pages
Class XII Computer Science Exam 2023
No ratings yet
Class XII Computer Science Exam 2023
2 pages
Gordon R. Pointers and References in C++. Fifth Step in C++ Learning 2023
No ratings yet
Gordon R. Pointers and References in C++. Fifth Step in C++ Learning 2023
108 pages
Fluxion Wi-Fi Penetration Guide
No ratings yet
Fluxion Wi-Fi Penetration Guide
19 pages
1 Click Convert 3D To CAD - CGPersia Forums
0% (1)
1 Click Convert 3D To CAD - CGPersia Forums
4 pages
Digital Image Processing
No ratings yet
Digital Image Processing
36 pages
E-AMBULANCE: Real-Time Integration Platform For Heterogeneous Medical Telemetry System
No ratings yet
E-AMBULANCE: Real-Time Integration Platform For Heterogeneous Medical Telemetry System
8 pages
Ak4113 (Spdif In)
No ratings yet
Ak4113 (Spdif In)
49 pages
ACC 203 ZO2 Course Syllabus - Fall '21
No ratings yet
ACC 203 ZO2 Course Syllabus - Fall '21
8 pages
Satel Larm
No ratings yet
Satel Larm
2 pages
Applea10seriesapplicationprocessor 170822124054
No ratings yet
Applea10seriesapplicationprocessor 170822124054
30 pages
LINEAR
No ratings yet
LINEAR
4 pages
Postgresql - PHP Interface
No ratings yet
Postgresql - PHP Interface
7 pages
P443 OrderForm - v56 - 012025
No ratings yet
P443 OrderForm - v56 - 012025
14 pages
Trans Sped OTP - User Guide
No ratings yet
Trans Sped OTP - User Guide
20 pages
Intermediate Algebra Everyday Explorations 5th Edition Kaseberg Cripe and Wildman Test Bank
100% (67)
Intermediate Algebra Everyday Explorations 5th Edition Kaseberg Cripe and Wildman Test Bank
32 pages
Humidity & Temperature Transmitter: Plurasens
No ratings yet
Humidity & Temperature Transmitter: Plurasens
20 pages
CCTV Network Diagram: IP Network Remote Brower
No ratings yet
CCTV Network Diagram: IP Network Remote Brower
1 page

Module IV - K NN

Uploaded by

Module IV - K NN

Uploaded by

1 Welcome

 k- NEAREST NEIGHBOR algorithm is most basic instance-based

𝑎1(𝑥), 𝑎2(𝑥), … , 𝑎𝑛(𝑥)

 The distance between two instances xi and xj is defined to be d(xi,xj),

 In nearest- neighbor learning the target function may be either

 The value f(xq) returned by the algorithm as its estimate of f(xq) is

just the most common value of f among the k training examples

to 𝑓 𝑥𝑞 the value 𝑓 𝑥𝑞 where is xi is the training instance nearest

 The diagram shows the shape of the decision surface induced by 1-

 The one of the refinement of k-NEAREST NEIGHBOR algorithm is to

 Distance-weight the instances for real-valued target function in a

 Where wi is a constant that normalizes the contributions of the

 We add distance weighting , there is really no harm in allowing all

 If all training examples are considered when classifying a new

query instance, we call the algorithm a global method.

 The other alternative is to eliminate the least relevant attributes

 This means to setting some of the scaling factors to zero.

 Cross validation , we explore methods based on leave-one-out cross

You might also like