K - Nearest Neighbours

K-Nearest Neighbors (K-NN) is a supervised machine learning algorithm used for classification and regression, characterized by its non-parametric, lazy learning, and instance-based nature. The algorithm involves selecting a value for K, computing distances to training data, and making predictions based on the nearest neighbors, with various distance metrics like Euclidean and Manhattan. While K-NN is simple and effective for small datasets, it can be computationally expensive and sensitive to noise, requiring optimizations like feature scaling and dimensionality reduction for better performance.

Uploaded by

syednehanazmeensyedneha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

33 views6 pages

K - Nearest Neighbours

Uploaded by

syednehanazmeensyedneha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 6

4/26/2025 K-Nearest Neighbours Dr. Eva Patel Phecoctate Professor Advanced Computer Science aud Engineering Department Introduction * K-Nearest Neighbors (K-NN) is a supervised machine learning algorithm that can be used for both classification and regression tasks. + Key Characteristics: + Non-parametric: It does not make assumptions about the data distribution. * Lazy Learning: No explicit training phase, that is, there is no model learning: all computations take place is deferred until prediction. * Instance-Based: Stores all training examples and makes decisions based on similarity. * Distance-Based: Classifies or predicts based on the distance to the nearest neighbors.4/26/2025 The KNN Algorithm + The K-NN algorithm follows these steps: 1. Choose the value of K (number of nearest neighbors). 2. Compute the distance between the new data point and all training data points. 3. Sort the distances and select the K nearest neighbors. 4. For classification: Assign the most common class among the K neighbors. 5. For regression: Compute the average (or weighted average) of the values of the K neighbors. Distance Metrics in K-NN + The choice of distance metric affects the performance of K-NN. A. Euclidean Distance + Most commonly used d(A, B) = + Suitable for continuous variables. * Sensitive to different scales of data. B. Manhattan Distance da Bil t Works well when data varies along axes (e.g., city block distance). d(A,B)4/26/2025 Distance Metrics in K-NN C. Minkowski Distance (Generalized Form) (A,B) = (> 14i- ar) i=1 * When p = 2, it becomes Euclidean distance. * When p = 1, it becomes Manhattan distance. D. Cosine Similarity Se AB cos = Tara * Used for text or high-dimensional data. Choosing the Right K Value + Small K (e.g., K = 1, K = 3): High variance, more sensitive to noise. * Large K (e.g., K = 10, K = 20): High bias, smooth decision boundary. * Common practice: Use odd values of K to avoid ties in classification. + Rule of thumb: K = VN (where N is the number of training examples). * Applications * Handwritten digit recognition (e.g., MNIST dataset) * Recommendation systems (e.g., movie or product recommendations) + Anomaly detection in cybersecurity * Medical diagnosis (e.g., cancer detection) * Credit scoring and fraud detection4/26/2025 K-Nearest Neighbours Advantages Disadvantages of K-NN * Simple and intuitive + Computationally expensive at + No training phase (fast to set up) peas time (slow for large atasets) * Works well with small datasets a + Sensitive to irrelevant features and * Can handle multi-class classification noise * Can be used for both classification and |- performance depends on the choice regression of distance metric * Struggles with high-dimensional data (curse of dimensionality) Optimizing K-NN + Feature Scaling: Normalize data using Min-Max Scaling or Standardization (Z- score normalization). + Dimensionality Reduction: Use PCA (Principal Component Analysis) to reduce features. * Weighted K-NN: Give closer neighbors more weight in decision-making. + Efficient Search Algorithms: + KD-Tree (for low-dimensional data) * Ball Tree (for higher dimensions) + Approximate Nearest Neighbors (ANN) methods4/26/2025 K-NN in Python (using Scikit-Learn) from sklearn.neighbors import KNeighborsClassifier from sklearn.model_selection import train_test_split from sklearn.preprocessing import StandardScaler from sklearn.datasets import load_iris # Load dataset X, y = iris.data, iris.target 4# Split into training and testing sets X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42) K-NN in Python (using Scikit-Learn) # Feature scaling (important for distance-based algorithms) scaler = StandardScaler() X_train = scalerfit_transform(X_train) X_test = scaler:transform(X_test) # Train K-NN model knn = KNeighborsClassifier(n_neighbors=5, metric='euclidean') knn.fit(X_train, y_train) # Make predictions y_pred = kn, predict(X_test)4/26/2025 # Evaluate accuracy i: K-NN in Python (using Scikit-Learn) from sklearn.metrics import accuracy_score print("Accuracy:", accuracy _score(y test, y_pred))

Shivwangi Banerjee - ML
No ratings yet
Shivwangi Banerjee - ML
10 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
Dr. BC Roy Engineering College Durgapur
No ratings yet
Dr. BC Roy Engineering College Durgapur
10 pages
Week 5 - Instance-Based Learning & PCA
No ratings yet
Week 5 - Instance-Based Learning & PCA
69 pages
Notes On K
No ratings yet
Notes On K
3 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
K-Nearest Neighbors (K-NN) Algorithm
No ratings yet
K-Nearest Neighbors (K-NN) Algorithm
10 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Bài nhóm tìm hiểu về KNN
No ratings yet
Bài nhóm tìm hiểu về KNN
5 pages
Shubh
No ratings yet
Shubh
10 pages
Presentation of KNN-1
No ratings yet
Presentation of KNN-1
18 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
k-NN Algorithm: Basics, Applications, and Advantages
No ratings yet
k-NN Algorithm: Basics, Applications, and Advantages
42 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
k-NN Algorithm Overview & Applications
No ratings yet
k-NN Algorithm Overview & Applications
35 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
Unit 2
No ratings yet
Unit 2
30 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Algorithms: K Nearest Neighbors
No ratings yet
Algorithms: K Nearest Neighbors
16 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
Mastering K-Nearest Neighbors (KNN) For Accurate Predictions
No ratings yet
Mastering K-Nearest Neighbors (KNN) For Accurate Predictions
18 pages
Unit V Non Parametric Machine Learning
No ratings yet
Unit V Non Parametric Machine Learning
47 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Week 07
No ratings yet
Week 07
24 pages
Pec-Cs 701e
No ratings yet
Pec-Cs 701e
4 pages
Lec 02 - KNN
No ratings yet
Lec 02 - KNN
36 pages
KNN Algorithm
No ratings yet
KNN Algorithm
11 pages
KNN Algorithm
No ratings yet
KNN Algorithm
2 pages
KNN
No ratings yet
KNN
53 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
K-Nearest Neighbors (KNN) Algorithm in Machine Learning
No ratings yet
K-Nearest Neighbors (KNN) Algorithm in Machine Learning
3 pages
Supervised Example KNN
No ratings yet
Supervised Example KNN
22 pages
k-Nearest Neighbors Lecture Notes
No ratings yet
k-Nearest Neighbors Lecture Notes
23 pages
KNN Lecture Presentation
No ratings yet
KNN Lecture Presentation
9 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Enhancing K-Nearest Neighbor Algorithm: A Comprehensive Review and Performance Analysis of Modifications
No ratings yet
Enhancing K-Nearest Neighbor Algorithm: A Comprehensive Review and Performance Analysis of Modifications
55 pages
Intro to k-Nearest Neighbor Algorithm
No ratings yet
Intro to k-Nearest Neighbor Algorithm
3 pages
Research Paper
No ratings yet
Research Paper
6 pages
K-Nearest Neighbors (KNN) Algorithm
No ratings yet
K-Nearest Neighbors (KNN) Algorithm
26 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
33 pages
K - Nearest Neigbors Hyperparameter: and It's
No ratings yet
K - Nearest Neigbors Hyperparameter: and It's
8 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
No ratings yet
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
11 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
K-Nearest Neighbors (KNN)
No ratings yet
K-Nearest Neighbors (KNN)
16 pages
Intro To KNN
No ratings yet
Intro To KNN
8 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
E Learning KNN
No ratings yet
E Learning KNN
31 pages
Machine L
No ratings yet
Machine L
5 pages
Day43 KNN Intro
No ratings yet
Day43 KNN Intro
4 pages

K - Nearest Neighbours

Uploaded by

K - Nearest Neighbours

Uploaded by

You might also like