0% found this document useful (0 votes)

30 views21 pages

Lecture 5

Here are the key points about performance measures for multi-class classification: - Confusion matrix is extended to an N×N matrix, where N is the number of classes - Diagonal elements represent correctly classified samples for each class - Off-diagonal elements represent misclassified samples between classes - Accuracy is calculated as the sum of all diagonal elements divided by total samples - Precision, recall, F1 score can be calculated individually for each class - Overall precision/recall averages the scores across all classes - Specificity remains the same definition as binary classification The confusion matrix provides insights into which classes are most confused or misclassified by the model. Focusing on improving predictions for those classes can enhance multi-class classification

Uploaded by

aly869556

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views21 pages

Lecture 5

Uploaded by

aly869556

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Lecture 5:Performance

Measures
By: Dr. Eman Ahmed
Contents
• Confusion Matrix
• Accuracy
• Precision
• Sensitivity
• Specificity
• F1-Score
Confusion Matrix
• A confusion matrix is a performance evaluation tool in machine
learning.
• It displays the number of true positives, true negatives, false
positives, and false negatives.
• A Confusion matrix is an N x N matrix used for evaluating the
performance of a classification model, where N is the total
number of classes. The matrix compares the actual target values
with those predicted by the machine learning model.
Confusion Matrix
• For a binary classification problem, we would have a 2 x 2 matrix

Total Predicted Positive

Total Predicted Negative

Total Actual Positive Total Actual Negative

•The class variable has two values: Positive or Negative
•The columns represent the classes actual values
•The rows represent the classes predicted values
• True Positive (TP)
• The predicted class matches the actual class.
• The actual class was positive, and the model predicted a positive class.

• True Negative (TN)

• The predicted class matches the actual class.
• The actual class was negative, and the model predicted a negative class.
• False Positive (FP) – Type I Error
• The predicted class was falsely predicted.
• The actual class was negative, but the model predicted a positive class.

• False Negative (FN) – Type II Error

• The predicted class was falsely predicted.
• The actual value was positive, but the model predicted a negative value.
Example of Confusion Matrix (Sick or Healthy)
Example
• Get the values of TP, TN, FP, FN.
• How many samples are there in the test set?
Example
• True Positive (TP) = 560, meaning the model correctly classified
560 positive class data points.
• True Negative (TN) = 330, meaning the model correctly
classified 330 negative class data points.
• False Positive (FP) = 60, meaning the model incorrectly classified
60 negative class data points as belonging to the positive class.
• False Negative (FN) = 50, meaning the model incorrectly
classified 50 positive class data points as belonging to the
negative class.
• Total Number of samples = TP + TN + FN + FP = 1000
Performance Measures
• Accuracy
𝑇𝑃 + 𝑇𝑁
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
𝑇𝑃 + 𝑇𝑁 + 𝐹𝑁 + 𝐹𝑃
Performance Measures
• Precision: It tells us how many of the positively predicted cases
actually turned out to be positive.

𝑇𝑃
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
𝑇𝑃 + 𝐹𝑃

• This would determine whether our model is reliable or not.

Performance Measures
• Sensitivity or Recall or True Positive Rate (TPR):
• It tells us how many of the actual positive cases we were able to
predict correctly with our model.
𝑇𝑃
𝑇𝑃𝑅 =
𝑇𝑃 + 𝐹𝑁

Example: In medical case, sensitivity refers to a test's ability to classify an individual

with disease as positive. A highly sensitive test means that there are few false
negative results, and thus fewer cases of disease are missed.
Get Precision and Recall for
Result and Comment
Among all the positively predicted classes, 50% were
actually postives

Among all the positively actual classes, 75 % were correctly

predicted as positives

Precision is a useful metric in cases where False Positive is a higher concern than False Negatives.
Example: Music recommendation systems or E-commerse. There are two classes: recommended
(positive) and not recommended (negative). Many false positives means considering not recommended
music as recommended. Hence, customers will get bored and stop using the app causing loss in
business.

Recall is a useful metric in cases where False Negative is a higher concern than False Positives.
Example: Medical applications. There are two classes: sick (+ve) or healthy (-ve). False negatives means
considering sick patient as healthy putting his life at risk because he won’t take medications.
Performance Measure
• F1-Score (for the positive class). The harmonic mean of the precision
and recall scores obtained for the positive class.

In a binary classification model, a large F1 score of 1 indicates excellent precision and recall, while a low
score indicates poor model performance. In general, a higher F1 score suggests better model
performance.

It is used when it is not clear which of the precision or recall is most important for a given problem.
Performance Measures
• Specificity: The number of samples predicted correctly to be in the
negative class out of all the samples in the dataset that actually
belong to the negative class. (True Negative Rate (TNR))

Example: For medical application, the specificity of a test is its ability

to classify an individual who does not have a disease as negative.
Confusion Matrix for a Multi-class problem
• Assume you have 4 classes with the following Confusion Matrix (M),
each cell is named as M_ij: Actual
• What is the accuracy of this classifier?
• What is the class for which the
classifier has the best performance?
• What is the class for which the
classifier has the worst performance?
• The diagonal elements are the correctly predicted samples. A total of
145 samples were correctly predicted out of the total 191 samples.
Thus, the overall accuracy is 75.92%.
• M_24=0 implies that the model does not confuse samples originally
belonging to class-4 with class-2, i.e., the classification boundary
between classes 2 and 4 was learned well by the classifier.
• To improve the model’s performance, one should focus on the
predictive results in class-3. A total of 18 samples (adding the
numbers in the red boxes of column 3) were misclassified by the
classifier, which is the highest misclassification rate among all the
classes. Accuracy in prediction for class-3 is, thus, 58.14% only.
What is TP, TN, FP, FN for class 1?
Actual
Actual

TP FP
FN TN
What is TP, TN, FP, FN for class 2?
Actual
Actual
Performance Measures of Multi-class

Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Confusion Matrix
No ratings yet
Confusion Matrix
18 pages
Performance Measures
No ratings yet
Performance Measures
9 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
Confusion Matrix - Explained
No ratings yet
Confusion Matrix - Explained
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
11 pages
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
No ratings yet
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
13 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Confusion Matris
No ratings yet
Confusion Matris
15 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Wa0013.
No ratings yet
Wa0013.
9 pages
Confusion Matrix & Metrics Guide
No ratings yet
Confusion Matrix & Metrics Guide
13 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
009 Confusion Matrix - Unlocked
No ratings yet
009 Confusion Matrix - Unlocked
4 pages
Notes 03
No ratings yet
Notes 03
38 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
ML Classification Metrics Guide
100% (1)
ML Classification Metrics Guide
30 pages
BA
No ratings yet
BA
11 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Understanding The Confusion Matrix in Machine Learning
No ratings yet
Understanding The Confusion Matrix in Machine Learning
4 pages
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
No ratings yet
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
5 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
2 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
20 pages
Unit 3
No ratings yet
Unit 3
13 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
Confusion Matrix
No ratings yet
Confusion Matrix
23 pages
Ads 5
No ratings yet
Ads 5
5 pages
Logistic Regression - Validating
No ratings yet
Logistic Regression - Validating
2 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
Confusion Matrix
No ratings yet
Confusion Matrix
24 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
No ratings yet
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
4 pages
Confusion Matrix and Outliers
No ratings yet
Confusion Matrix and Outliers
32 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Unit 2 Classification
No ratings yet
Unit 2 Classification
59 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
21-General Approach To Classification, Classification by Decision Tree Induction-17-02-2025
No ratings yet
21-General Approach To Classification, Classification by Decision Tree Induction-17-02-2025
15 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Unit IV - Principles and Styles of Teaching and Learning
No ratings yet
Unit IV - Principles and Styles of Teaching and Learning
64 pages
Forgive Us All Reflection Paper
No ratings yet
Forgive Us All Reflection Paper
4 pages
HERRC Guest Policies Guide
No ratings yet
HERRC Guest Policies Guide
14 pages
HCA203 - Acute Care Critical Thinking Scenarios Jan 2023
No ratings yet
HCA203 - Acute Care Critical Thinking Scenarios Jan 2023
3 pages
Resume Sample ECE
No ratings yet
Resume Sample ECE
1 page
117 Week 2
No ratings yet
117 Week 2
3 pages
Yoga Music Therapy
No ratings yet
Yoga Music Therapy
25 pages
OET Reading Test 1 - Part B
57% (7)
OET Reading Test 1 - Part B
11 pages
Clinical TRail Check List by Kishan
100% (1)
Clinical TRail Check List by Kishan
1 page
Safety Day 2022 Proposal As of 5 Oct 2022 Rev3
No ratings yet
Safety Day 2022 Proposal As of 5 Oct 2022 Rev3
9 pages
Shockwave Therapy for Clinicians
100% (5)
Shockwave Therapy for Clinicians
16 pages
Biology First Term Exam For S1 All
No ratings yet
Biology First Term Exam For S1 All
1 page
Geriatric Physical Therapy 3rd Ed 3rd Edition Andrew A. Guccione Download Full Chapters
No ratings yet
Geriatric Physical Therapy 3rd Ed 3rd Edition Andrew A. Guccione Download Full Chapters
62 pages
CMH Knight
No ratings yet
CMH Knight
4 pages
Osa-F06a Parental-Consent
No ratings yet
Osa-F06a Parental-Consent
3 pages
Sambong Upload
No ratings yet
Sambong Upload
6 pages
133 2022 May Injectable Xeris XeriJect
No ratings yet
133 2022 May Injectable Xeris XeriJect
4 pages
What To Do When You Get Something in Your Eye
No ratings yet
What To Do When You Get Something in Your Eye
9 pages
Veterinary Pharmacology Courses
100% (1)
Veterinary Pharmacology Courses
14 pages
International Journal of Women's Dermatology
No ratings yet
International Journal of Women's Dermatology
4 pages
Company Fact Sheet
No ratings yet
Company Fact Sheet
1 page
Nicheformer - A Foundation Model For Single-Cell and Spatial Omics
No ratings yet
Nicheformer - A Foundation Model For Single-Cell and Spatial Omics
31 pages
For - Quarter, Year - : Self-Monitoring Report
No ratings yet
For - Quarter, Year - : Self-Monitoring Report
9 pages
The BioMechanics Method For Corrective Exercise - 1st Edition Complete Chapter Download
100% (19)
The BioMechanics Method For Corrective Exercise - 1st Edition Complete Chapter Download
15 pages
Neurosurgical Focus Dr. Robert G. Heath - A Controversial Figure in The History of Deep Brain Stimulation
No ratings yet
Neurosurgical Focus Dr. Robert G. Heath - A Controversial Figure in The History of Deep Brain Stimulation
8 pages
GMAT Practice Worksheet: Sentence Correction (Idioms)
No ratings yet
GMAT Practice Worksheet: Sentence Correction (Idioms)
5 pages
UHC Vision Plan Summary
No ratings yet
UHC Vision Plan Summary
2 pages
370) CNS 03 - Class Notes (MBBS Prof 2nd Year)
No ratings yet
370) CNS 03 - Class Notes (MBBS Prof 2nd Year)
17 pages
Lesson Plan Nutrition
100% (1)
Lesson Plan Nutrition
21 pages
ACTIVITY 1.2 - Match Up Challenge
No ratings yet
ACTIVITY 1.2 - Match Up Challenge
2 pages

Lecture 5

Uploaded by

Lecture 5

Uploaded by

Lecture 5:Performance

Total Predicted Positive

Total Predicted Negative

Total Actual Positive Total Actual Negative

• True Negative (TN)

• False Negative (FN) – Type II Error

• This would determine whether our model is reliable or not.

Example: In medical case, sensitivity refers to a test's ability to classify an individual

Among all the positively actual classes, 75 % were correctly

Example: For medical application, the specificity of a test is its ability

You might also like