0% found this document useful (0 votes)

10 views18 pages

Lecture 17 - KNN

KNN

Uploaded by

raoseshu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views18 pages

Lecture 17 - KNN

KNN

Uploaded by

raoseshu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Transfer Functions

Supervised Learning – Classification

K-Nearest Neighbor Algorithm
Definition of Nearest Neighbor

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points that

have the k smallest distance to x
Basic Idea

 k-NN classification rule is to assign to a test sample the

majority category label of its k nearest training samples
 In practice, k is usually chosen to be odd, so as to avoid
ties
 The k = 1 rule is generally called the nearest-neighbor
classification rule
Nearest-Neighbor Classifiers: Issues

– The value of k, the number of nearest

neighbors to retrieve
– Choice of Distance Metric to compute
distance between records
– Computational complexity
– Size of training set
– Dimension of data
Value of K
 Choosing the value of k:
 If k is too small, sensitive to noise points
 If k is too large, neighborhood may include points from
other classes

Rule of thumb:
K = sqrt(N)
N: number of training points X
Distance Metrics
Distance Measure: Scale Effects

 Different features may have different measurement scales

 E.g., patient weight in kg (range [50,200]) vs. blood protein
values in ng/dL (range [-3,3])
 Consequences
 Patient weight will have a much greater influence on the
distance between samples
 May bias the performance of the classifier
Standardization

 Transform raw feature values into z-scores

x ij - m j
zij =
sj

 x ijis the value for the ith sample and jth feature
 m j is the average of all x ij for feature j
 s is the standard deviation of all x over all input samples
j ij
 Range and scale of z-scores should be similar (providing
distributions of raw feature values are alike)
Additional Material
Voronoi Diagram

Properties:
1) All possible points
within a sample's
Voronoi cell are the
nearest neighboring
points for that sample
2) For any sample, the
nearest sample is
determined by the
closest Voronoi cell
edge
Distance-weighted k-NN

k
Replace
fˆ (q) = arg max å d (v, f ( xi ))


vÎV i =1

k
fˆ (q) = argmax å
1
d (v, f (x i ))
d( x i, x q )
2
v ÎV i=1

General Kernel functions like Parzen Windows may be considered

Instead of inverse distance.
Distance for Heterogeneous Data

Wilson, D. R. and Martinez, T. R., Improved Heterogeneous Distance Functions, Journal of

Artificial Intelligence Research, vol. 6, no. 1, pp. 1-34, 1997
Nearest Neighbour : Computational
Complexity
 Expensive
 To determine the nearest neighbour of a query point q, must
compute the distance to all N training examples
+ Pre-sort training examples into fast data structures (kd-trees)
+ Compute only an approximate distance (LSH)
+ Remove redundant data (condensing)
 Storage Requirements
 Must store all training data P
+ Remove redundant data (condensing)
- Pre-sorting often increases the storage requirements
 High Dimensional Data
 “Curse of Dimensionality”
 Required amount of training data increases exponentially with dimension
 Computational cost also increases dramatically
 Partitioning techniques degrade to linear search in high dimension
KNN: Alternate Terminologies

 Instance Based Learning

 Lazy Learning
 Case Based Reasoning
 Exemplar Based Learning
Discussions
 kNN can deal with complex and arbitrary decision
boundaries.
 Despite its simplicity, researchers have shown that the
classification accuracy of kNN can be quite strong and in
many cases as accurate as those elaborated methods.
 kNN is slow at the classification time
 kNN does not produce an understandable model
Summary
 Applications of supervised learning are in almost any field
or domain.
 We studied 4 classification techniques.
 There are still many other methods, e.g.,
 Bayesian networks
 Neural networks
 Genetic algorithms
 Fuzzy classification
This large number of methods also show the importance of
classification and its wide applicability.
 It remains to be an active research area.

K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
K Nearest Neighbor Classification
0% (1)
K Nearest Neighbor Classification
32 pages
KNN Classifier for Data Scientists
No ratings yet
KNN Classifier for Data Scientists
16 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
Wikipedia K Nearest Neighbor Algorithm
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
4 pages
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Distance-Based Methods - KNN
0% (1)
Distance-Based Methods - KNN
8 pages
ML 2
No ratings yet
ML 2
6 pages
3.2.1. K Nearest Neighbors
No ratings yet
3.2.1. K Nearest Neighbors
34 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
11 pages
Unit Ii
No ratings yet
Unit Ii
102 pages
ML 5
No ratings yet
ML 5
35 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
k-NN Algorithm Overview & Applications
No ratings yet
k-NN Algorithm Overview & Applications
35 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
04 Unit-Iv - ML
No ratings yet
04 Unit-Iv - ML
23 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
08 Classification Using K NN
No ratings yet
08 Classification Using K NN
23 pages
AIML-Unit 4 Notes-Assignment 4
No ratings yet
AIML-Unit 4 Notes-Assignment 4
21 pages
Presentation UNIT-2
No ratings yet
Presentation UNIT-2
96 pages
k-Nearest Neighbors Lecture Slides
No ratings yet
k-Nearest Neighbors Lecture Slides
57 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
ML Unit 2 (Ab22)
No ratings yet
ML Unit 2 (Ab22)
61 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
K Nearest Neighbour
No ratings yet
K Nearest Neighbour
2 pages
Chapter 4
No ratings yet
Chapter 4
40 pages
K-Nearest Neighbors Algorithm - Wikipedia
No ratings yet
K-Nearest Neighbors Algorithm - Wikipedia
10 pages
K-Nearest Neighbor Overview
No ratings yet
K-Nearest Neighbor Overview
14 pages
KNN & Decision Tree Basics
No ratings yet
KNN & Decision Tree Basics
9 pages
Intro to k-Nearest Neighbor Algorithm
No ratings yet
Intro to k-Nearest Neighbor Algorithm
3 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
Nearest Neighbor Classifier Guide
No ratings yet
Nearest Neighbor Classifier Guide
16 pages
445 Lecture 5
No ratings yet
445 Lecture 5
28 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Nearest Neighbor Algorithms Guide
No ratings yet
Nearest Neighbor Algorithms Guide
26 pages
KNN
No ratings yet
KNN
53 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
ML 03 Classification
No ratings yet
ML 03 Classification
15 pages
K-Nearest Neighbors (KNN) Algorithm: Dr. Nagaraju K, CSE
No ratings yet
K-Nearest Neighbors (KNN) Algorithm: Dr. Nagaraju K, CSE
24 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
KNN Algorithm
No ratings yet
KNN Algorithm
9 pages
Chapter 3
No ratings yet
Chapter 3
33 pages
KNN Basics for Machine Learning Beginners
100% (1)
KNN Basics for Machine Learning Beginners
8 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
Dynamic KNNF
No ratings yet
Dynamic KNNF
3 pages
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
No ratings yet
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
6 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
ML-LECTURE9 KNN Classification
No ratings yet
ML-LECTURE9 KNN Classification
23 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
K-Nearest Neighbourhood
100% (1)
K-Nearest Neighbourhood
7 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
Unit 5 ML
No ratings yet
Unit 5 ML
13 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
CH 2
No ratings yet
CH 2
30 pages
Welding Consumable Guide
No ratings yet
Welding Consumable Guide
5 pages
Dr. Shefali Pandya
No ratings yet
Dr. Shefali Pandya
13 pages
411 Final Exam - Forupload
No ratings yet
411 Final Exam - Forupload
4 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
4 pages
Algebra Factoring Color by Number
No ratings yet
Algebra Factoring Color by Number
4 pages
Squares and Square Roots Bingo
100% (2)
Squares and Square Roots Bingo
34 pages
Physics Sample Problems With Solutions
No ratings yet
Physics Sample Problems With Solutions
10 pages
2012 - Knechel - Non Audit Services and Knowledge Spillovers Evidence From New Zealand PDF
No ratings yet
2012 - Knechel - Non Audit Services and Knowledge Spillovers Evidence From New Zealand PDF
22 pages
OK
No ratings yet
OK
2 pages
IC3 GS5 Key Applications Projects Lesson 04
No ratings yet
IC3 GS5 Key Applications Projects Lesson 04
4 pages
25-01-2024 - SR - Super60 - Elite, Target & LIIT-BTs - Jee-Main-GTM-18 - Q.PAPER
No ratings yet
25-01-2024 - SR - Super60 - Elite, Target & LIIT-BTs - Jee-Main-GTM-18 - Q.PAPER
22 pages
Lecture On Holography Method of Condmatphy
No ratings yet
Lecture On Holography Method of Condmatphy
86 pages
C Programming Basic-II - Exercises, Practice, Solution - W3resource
No ratings yet
C Programming Basic-II - Exercises, Practice, Solution - W3resource
3 pages
Angle Measurement
No ratings yet
Angle Measurement
10 pages
pr3 Reviewer With Answers
No ratings yet
pr3 Reviewer With Answers
5 pages
Doh Fourier Theory Applications and Derivatives Mark S Nixon Download
100% (7)
Doh Fourier Theory Applications and Derivatives Mark S Nixon Download
78 pages
UG I Sem Time Table 2023-24
No ratings yet
UG I Sem Time Table 2023-24
3 pages
Sci Davis
No ratings yet
Sci Davis
151 pages
0 Boiler Design Softwear
No ratings yet
0 Boiler Design Softwear
54 pages
Information Theory and Entropy
100% (1)
Information Theory and Entropy
111 pages
Year 10 Skate Ramp Math Project
No ratings yet
Year 10 Skate Ramp Math Project
4 pages
Tfy4280 T7a
No ratings yet
Tfy4280 T7a
9 pages
Geometry Chapter 4 Practice Test
No ratings yet
Geometry Chapter 4 Practice Test
12 pages
Eigenfaces for Face Recognition
No ratings yet
Eigenfaces for Face Recognition
13 pages
V I V A: Example 5.5: - Solution
No ratings yet
V I V A: Example 5.5: - Solution
15 pages
IGCSE Mathematics A 4WM1H 01R - May 2025 Mark Scheme PDF
No ratings yet
IGCSE Mathematics A 4WM1H 01R - May 2025 Mark Scheme PDF
30 pages
Engineering Fracture Mechanics: Sciencedirect
No ratings yet
Engineering Fracture Mechanics: Sciencedirect
16 pages
Aturan Sin-Cos Segitiga Bola
No ratings yet
Aturan Sin-Cos Segitiga Bola
4 pages
Advanced Digital Signal Processing
No ratings yet
Advanced Digital Signal Processing
125 pages
SVM Assignment ABA Course To Be Returned With Your Answers
No ratings yet
SVM Assignment ABA Course To Be Returned With Your Answers
10 pages

Lecture 17 - KNN

Uploaded by

Lecture 17 - KNN

Uploaded by

Transfer Functions

Supervised Learning – Classification

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points that

 k-NN classification rule is to assign to a test sample the

– The value of k, the number of nearest

 Different features may have different measurement scales

 Transform raw feature values into z-scores

General Kernel functions like Parzen Windows may be considered

Wilson, D. R. and Martinez, T. R., Improved Heterogeneous Distance Functions, Journal of

 Instance Based Learning

You might also like