0% found this document useful (0 votes)

11 views14 pages

Performance Parameters

The document outlines performance metrics essential for evaluating machine learning models, focusing on classification and regression metrics. It emphasizes the importance of precision, recall, and F1-score, particularly in cases of skewed classes, and discusses the use of ROC curves for model comparison. Additionally, it highlights the benefits of ranking instances based on predicted probabilities rather than solely classifying them.

Uploaded by

mranonymousgotyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views14 pages

Performance Parameters

Uploaded by

mranonymousgotyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Lesson 7.

DATA SCIENCE AND

AUTOMATION COURSE

MASTER DEGREE SMART

TECHNOLOGY ENGINEERING

Performance metrics
TEACHER
Mirko Mazzoleni
PLACE
University of Bergamo
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

2 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

3 /14
Metrics
It is extremely important to use quantitative metrics for evaluating a machine learning
model

• Until now, we relied on the cost function value for regression and classification

• Other metrics can be used to better evaluate and understand the model

• For classification
 Accuracy/Precision/Recall/F1-score, ROC curves,…
• For regression
 Normalized RMSE, Normalized Mean Absolute Error (NMAE),…

4 /14
Classification case: metrics for skewed classes
Disease dichotomic classification example

Train logistic regression model ℎ 𝒙 , with 𝑦 = 1 if disease, 𝑦 = 0 otherwise.

Find that you got 1% error on test set (99% correct diagnoses)

The 𝑦 = 1 class has very few examples with

Only 0.50% of patients actually have disease
respect to the 𝑦 = 0 class

If I use a predictor that predicts always the 𝟎 class, I get 99.5% of accuracy!!

For skewed classes, the accuracy metric can be deceptive

5 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

6 /14
Precision and recall
Suppose that 𝑦 = 1 in presence of a rare class that we want to detect

Precision (How much we are precise in the detection) Confusion matrix

Of all patients where we predicted 𝑦 = 1,
what fraction actually has the disease? Actual class

Predicted class
1 (p) 0 (n)
True Positive True Positive
=
# Predicted Positive True Positive + False Positive
True positive False positive
1 (Y)
(TP) (FP)
Recall (How much we are good at detecting)
Of all patients that actually have the disease, what False negative True negative
fraction did we correctly detect as having the disease? 0 (N)
(FN) (TN)
True Positive True Positive
=
# Actual Positive True Positive + False Negative

7 /14
Trading off precision and recall
Logistic regression: 0 ≤ ℎ 𝒙 ≤ 1
At different thresholds, correspond
• Predict 1 if ℎ 𝒙 ≥ 0.5 different confusion matrices!
These thresholds can
be different from 0.5!
• Predict 0 if ℎ 𝒙 < 0.5

Suppose we want to predict 𝑦 = 1 (disease) only if very confident

• Increase threshold → Higher precision, lower recall

Suppose we want to avoid missing too many cases of disease (avoid false negatives).
• Decrease threshold → Higher recall, lower precision

8 /14
F1-score
It is usually better to compare models by means of one number only. The F1 − score can
be used to combine precision and recall

Precision(P) Recall (R) Average F1 Score

Algorithm 1 0.5 0.4 0.45 0.444 The best is Algorithm 1
Algorithm 2 0.7 0.1 0.4 0.175
Algorithm 3 0.02 1.0 0.51 0.0392
Algorithm 3 predict always 𝟏 Average says not correctly
that Algorithm 3 is the best

P+R PR • P = 0 or R = 0 ⇒ F1 score = 0
Average = F1 score = 2
2 P+R
• P = 1 and R = 1 ⇒ F1 score = 1

9 /14
Summaries of the confusion matrix
Different metrics can be computed from the confusion matrix, depending on the class of
interest (https://en.wikipedia.org/wiki/Precision_and_recall)

10 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

11 /14
Ranking instead of classifying
Classifiers such as logistic regression can output a probability of belonging to a class (or
something similar).

• We can use this to rank the different istances and take actions on the cases at top of
the list

• We may have a budget, so we have to target most promising individuals

• Ranking enables to use different techniques for visualizing model performance

12 /14
Ranking instead of classifying
p n

Y 0 0 p n
Instance
True class Score N 100 100 Y
1 0
description
99 100
…………… 1 0,99 N

…………… 1 0,98
…………… 0 0,96 p n
2 0
…………… 0 0,90 Y

…………… 1 0,88 N 98 100

p n
…………… 1 0,87 2 1
Y
…………… 0 0,85 98 99
N
…………… 1 0,80 p n
…………… 0 0,70 Y
6 4
Different confusion
N 94 96
matrices by changing
Adapated from [1] the threshold

13 /14
ROC curves
ROC curves are a very general way to represent and compare the performance of
different models (on a binary classification task)

Perfection Observations
• 0,0 : predict always negative
Random • 1,1 : predict always positive
True positive rate

guessing
• Diagonal line: random classifier
• Below diagonal line: worse than random classifier
• Different classifiers can be compared
• Area Under the Curve (AUC): probability that a randomly
chosen positive instance will be ranked ahead of randomly
chosen negative instance
False positive rate

14 /14

ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
03 Performance Metrics
No ratings yet
03 Performance Metrics
15 pages
Evaluation in Ai
No ratings yet
Evaluation in Ai
25 pages
Classification Metrics Guide
No ratings yet
Classification Metrics Guide
15 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Machine Learning Evaluation Metrics Lecturer
No ratings yet
Machine Learning Evaluation Metrics Lecturer
30 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
DSML Clasification
No ratings yet
DSML Clasification
44 pages
Unit 4
No ratings yet
Unit 4
20 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
ML Classification Metrics Guide
100% (1)
ML Classification Metrics Guide
30 pages
Model Evaluation for Data Scientists
No ratings yet
Model Evaluation for Data Scientists
7 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Module 2
No ratings yet
Module 2
72 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Data M
No ratings yet
Data M
10 pages
Data M11
No ratings yet
Data M11
5 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
4-1 Fine-Tuning Your Model
No ratings yet
4-1 Fine-Tuning Your Model
60 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Confusion Matrix
No ratings yet
Confusion Matrix
8 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Chaos To Complexity in Strategic Planning
No ratings yet
Chaos To Complexity in Strategic Planning
9 pages
MA Economics Exam Prep
No ratings yet
MA Economics Exam Prep
17 pages
Mathematical Foundation of Computer Science
No ratings yet
Mathematical Foundation of Computer Science
3 pages
LLMs for Knowledge Graph Completion
No ratings yet
LLMs for Knowledge Graph Completion
7 pages
Cryptography and Network Security Unit - 2 Chapter 3 - Block Ciphers and The Data Encryption Standard
No ratings yet
Cryptography and Network Security Unit - 2 Chapter 3 - Block Ciphers and The Data Encryption Standard
86 pages
Ai HW-4
No ratings yet
Ai HW-4
4 pages
D1: Algorithms (Further Maths) : Name .. Score: Percentage: Grade: Further Maths Target Grade
No ratings yet
D1: Algorithms (Further Maths) : Name .. Score: Percentage: Grade: Further Maths Target Grade
1 page
Cse1007 - Java Programming LAB Digital Assignment 1 Name-Jeetesh Gowder Reg No: 19BCE2176 Slot: L21+L22
No ratings yet
Cse1007 - Java Programming LAB Digital Assignment 1 Name-Jeetesh Gowder Reg No: 19BCE2176 Slot: L21+L22
11 pages
Build Obstacle Avoider Robot-Advanced Algorithm (1) .Ino
No ratings yet
Build Obstacle Avoider Robot-Advanced Algorithm (1) .Ino
2 pages
Instant Download Stochastic Processes From Applications To Theory 1st Edition Pierre Del Moral PDF All Chapters
100% (16)
Instant Download Stochastic Processes From Applications To Theory 1st Edition Pierre Del Moral PDF All Chapters
75 pages
Metric Unit Conversions Liters To Kiloliters Hectoliters Decaliters 1 v1
No ratings yet
Metric Unit Conversions Liters To Kiloliters Hectoliters Decaliters 1 v1
2 pages
An Improved Ant Colony Optimization For Constrained Engineering Design Problems
No ratings yet
An Improved Ant Colony Optimization For Constrained Engineering Design Problems
28 pages
Slides Chapter8 DISCRETIZATION OF CONTINUOS SYSTEMS
No ratings yet
Slides Chapter8 DISCRETIZATION OF CONTINUOS SYSTEMS
64 pages
Applied Information Processing Systems 2022
100% (1)
Applied Information Processing Systems 2022
588 pages
Activity Network
No ratings yet
Activity Network
4 pages
Leetcode 75 Questions (NeetCode On Yt)
No ratings yet
Leetcode 75 Questions (NeetCode On Yt)
8 pages
Time Series Analysis: Long-Run Relationships
No ratings yet
Time Series Analysis: Long-Run Relationships
49 pages
Numpy Tutorial Notes (Video 1-5)
No ratings yet
Numpy Tutorial Notes (Video 1-5)
4 pages
Modeling and Controller Designing of Rotary Inverted Pendulum (RIP) - Comparison by Using Various Design Methods
No ratings yet
Modeling and Controller Designing of Rotary Inverted Pendulum (RIP) - Comparison by Using Various Design Methods
8 pages
Pcs - Css - FPSC - General Ability Mcq's Test With Solution - Data Structure and Algorithm Mcq's 12
No ratings yet
Pcs - Css - FPSC - General Ability Mcq's Test With Solution - Data Structure and Algorithm Mcq's 12
4 pages
Final Time Table For Winter 2024 Theory Examination
No ratings yet
Final Time Table For Winter 2024 Theory Examination
5 pages
Kel D
No ratings yet
Kel D
118 pages
DSP Lab 2RT
No ratings yet
DSP Lab 2RT
5 pages
Unit - 5 - Dictionary Technique
No ratings yet
Unit - 5 - Dictionary Technique
19 pages
STEM Entrance Exam Quiz Can You Pass This Stem Exam - Question 1 2
No ratings yet
STEM Entrance Exam Quiz Can You Pass This Stem Exam - Question 1 2
1 page
FCoDS - W02 - Applied Cryptography
No ratings yet
FCoDS - W02 - Applied Cryptography
22 pages
Chapter - 4 - Graph Theory - Part - 3
No ratings yet
Chapter - 4 - Graph Theory - Part - 3
39 pages
Cleaning Correlation Matrices
No ratings yet
Cleaning Correlation Matrices
6 pages
L2 Image Enhancement Spatial Domain
No ratings yet
L2 Image Enhancement Spatial Domain
70 pages
Primal-Dual Subgradient Method Guide
No ratings yet
Primal-Dual Subgradient Method Guide
13 pages