0% found this document useful (0 votes)

18 views38 pages

2 Classification

The document discusses various types of classification algorithms, including binary classification, multi-class classification, linear boundaries, confusion matrices, accuracy, and baseline models. It provides examples and explanations of logistic regression, Naive Bayes classification, and support vector machines. Key steps and concepts are outlined for each algorithm.

Uploaded by

Nihad Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views38 pages

2 Classification

Uploaded by

Nihad Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

Classification

CLASSIFICATION
TYPES
Binary Classification Multi-class Classification
Binary classification is the task of Multiclass classification is the task
classifying the elements of a given of classifying the elements of a
set into two groups on the basis of given set into more than two
a classification rule. groups on the basis of a
classification rule.
BMSCE - ME | PA G E
MCL - Python 2
Classification
• Can you separate red class
from blue class ?

BMSCE - ME
MCL - Python
| PA G E 3
Linear Boundary
• Straight line for two dimensions.

BMSCE - ME
MCL - Python
| PA G E 4
Linear Boundary
• Straight line for two dimensions.
• Plane for three dimensions.
• Hyperplane for higher dimension

BMSCE - ME
MCL - Python
| PA G E 5
Confusion Matrix Accuracy
Predicted
𝑃𝑒𝑟𝑐𝑒𝑛𝑡𝑎𝑔𝑒𝑜𝑓 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑒𝑑
𝑑𝑎𝑡𝑎𝑝𝑜𝑖𝑛𝑡𝑠
Positive Negative ¿ ( 𝑇𝑃 +𝑇𝑁 )
A c c u+𝐹𝑃
(𝑇𝑃+𝐹𝑁 r a c +𝑇𝑁
y )
Positive a (TP) b (FN)
Actual Confusion Matrix
Negative c (FP) d (TN)
Other error metrics
• Precision
Sensitivity Specificity • Re c a l l
• F score
a / (a + c) d/ (b + d) • ROC cur ve
Algorithm

Construct a frequency table for the target and

select its most frequent value.

For classification this is Baseline Model

Disease
Yes No

9 6

0.6 0.4
ZeroR Method
BMSCE - ME
MCL - Python
| PA G E 7
Blood
Pressure
Classification

Target
variable is
discrete

Target : 0/1

BMSCE - ME
MCL - Python
| PA G E 11
Can you fit a
Linear regression
model ?

Model the log-odds ratio as a linear function of

independent variables and then convert the log odds ratio
to probability using the sigmoid (logistic) function

Logistic Function

Logistic
Regression BMSCE - ME | PA G E 20
MCL - Python
Algorithm
C a l c u l a te t h e p o s te r i o r p ro ba b i l i t y, P ( A | B ) ,
from P(A), P(B), and P(B|A). Naive Bayes
classifier assume that the effect of the
value of a predictor ( x) on a given class
(c) is independent of the values of other
predictors.

Naive Bayes
BMSCE - ME
MCL - Python
| PA G E 22
Algorithm
C a l c u l a te t h e p o s te r i o r p ro ba b i l i t y, P ( A | B ) ,
from P(A), P(B), and P(B|A). Naive Bayes
classifier assume that the effect of the
value of a predictor ( x) on a given class
(c) is independent of the values of other
predictors.

P(A|D) P(A)

P(D)
Naive Bayes
BMSCE - ME | PA G E
P(D|Alco & S & Age) = P(A|D) * P(S|D)MCL
* P(Age|D)
- Python 23
* P(D)
Naive Bayes
PROS
• Very easy and fast
• Can be used for multiclass prediction
• Performs well with categorical features
• If features are independent NB gives
superior predictions

CONS
• Features are not independent in most real
life examples
• Issue with category that was not found in
training data
• Assumption that the numerical features
follow normal distribution
P(Y | X) = P(X1 | Y) * P( X2 | Y)…. P(Xn|Y) * P(Y)
BMSCE - ME
MCL - Python
| PA G E 24
Algorithm

SVM performs classification by coming up with a

hyperplane that maximizes the separation margin
between the two classes. The vectors that support
the hyperplane are support vectors

 Plot all the data rows as a point in N-Dimensional

space
 N dimensions refer to N features
 So the point will have feature as the value for a
particular coordinate
Support Vector  Come up with a hyperplane that can separate these

Machines (SVM) points in the best way possible into different classes
in that N-dimensions BMSCE - ME | PA G E
25
MCL - Python
Which one would you select ?
Why ?

The black line

► Because it is classifying the points
accurately with the highest margin

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 26
Which one would you select ?
Why ?

I would again select the black line

► Because it is classifying the points
accurately with the highest margin

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 27
Maximum Margin Classifier

► Classifies with the maximum margin

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 28
What will you do in this case?

Can we calculate some other feature from X

and Y and then try to separate this ?

How about Z = X2 + Y2

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 29
What will you do in this case?

Can we calculate some other feature from F1

and F2 and then try to separate this ?

How about Z = F12 + F22

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 30
What will you do in this case?

Can we calculate some other feature from F1

and F2 and then try to separate this ?

How about Z = F12 + F22

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 31
What will you do in this case?

Can we calculate some other feature from F1

and F2 and then try to separate this ?

How about Z = F12 + F22

Decision Boundary
BMSCE - ME
MCL - Python
| PA G E 32
Support Vector
Machines
PROS
 Works very well for small and clean
datasets
 Works well with clear separation margins

 Very effective in high dimensional space

 Kernels give more flexibility

CONS
 Large data sets require lot of training
time and eventually won’t perform well
 Can’t do a good job when noisy data is
given ( overlapping classes)

BMSCE - ME
MCL - Python
| PA G E 33
Algorithm

Decision tree uses Entropy and Information Gain to

construct a tree. Top-down, greedy search through
the space of possible branches with no backtracking

Entropy for 1 attribute

Decision Trees
BMSCE - ME
MCL - Python
| PA G E 34
BMSCE - ME
MCL - Python
| PA G E 35
BMSCE - ME
MCL - Python
| PA G E 36
Steps in Decision Trees

Step
1 Calculate entropy of the target variable

Step Calculate entropy for each branch (split by

2 various features)

Step
3 Calculate Gain for each of the above splits

Step Choose attribute with the largest

4 information gain as the decision node

Step Check if the entropy is zero, else continue

5 further

Step Run recursively on the all branches, until all

6 data is classified (Branch entropy == 0)

BMSCE - ME
MCL - Python
| PA G E 37
Decision Trees

PROS
 Implicitly perform feature selection

 Discover Nonlinear relationships

 Not affected by outliers

 Easy to interpret and explain

 Rules generated which can be shared

easily
CONS
 Decision Trees do not work well if you
have smooth boundaries
 Super attributes will give higher info gain

 Missing values are ignored

BMSCE - ME
MCL - Python
| PA G E 38
Bias-Variance
Tradeoff

BIAS :
How well the model fits the data

Simpler models: Stable (low variance) but they VARIANCE :

don't get close to the truth (high bias).
How much the model changes based on
Complex models: More prone to being over fit changes in the inputs
(high variance) but they are expressive enough to get
close to the truth (low bias).
BMSCE - ME
MCL - Python
| PA G E 39
Decision Trees +
Bagging

If the number of cases in the training set is N, sample

01 N cases at random - but with replacement, from the
original data. This sample will be the training set for
growing the tree.

02 If there are M input variables, a number m<<M is

specified such that at each node, m variables are
selected at random out of the M and the best split on

Random Forest these m is used to split the node.

The value of m is held constant during the forest

03 growing. BMSCE - ME | PA G E 40
MCL - Python
BAGGING
• It is also called as bootstrap aggregating

• Bagging tries to combine predictions from

multiple similar learners trained on
different datasets by averaging their
predictions.

• It reduces variance and helps to avoid

overfitting

• Mostly used with decision trees

BMSCE - ME
MCL - Python
| PA G E 41
Features of
Random Forests

Implicitly gives estimates of what

01 Unexcelled in accuracy among
current algorithms.
04 variables are important in the 07 Generated forests can be
saved for future use on other
classification. data.
It has an effective method for
02 Runs efficiently on
large data sets.
05 estimating missing data and maintains 08 It offers an experimental
method for detecting variable
accuracy when a large proportion of
interactions
the data are missing.

03 06 09
Can handle thousands of input It has methods for balancing error
Uses OOB for error
variables without variable in class population unbalanced
calculation
deletion. data sets.

BMSCE - ME
MCL - Python
| PA G E 42

Supervised Learning Notes
No ratings yet
Supervised Learning Notes
7 pages
20MEMECH Part 3 - Classification
No ratings yet
20MEMECH Part 3 - Classification
49 pages
Datamining Lect12
No ratings yet
Datamining Lect12
75 pages
MILIT PPT Modifies
No ratings yet
MILIT PPT Modifies
43 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Machine Learning: Classification & Naive Bayes
No ratings yet
Machine Learning: Classification & Naive Bayes
20 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Machine Learning Unit-2
No ratings yet
Machine Learning Unit-2
89 pages
Practical # 11
No ratings yet
Practical # 11
10 pages
08 CSE358 Intro To Machine Learning II
No ratings yet
08 CSE358 Intro To Machine Learning II
100 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Unit-4-AIML 1
No ratings yet
Unit-4-AIML 1
19 pages
Machine Learning Lab Guide
No ratings yet
Machine Learning Lab Guide
69 pages
Chapter Four - Part One
No ratings yet
Chapter Four - Part One
44 pages
8 Classification
No ratings yet
8 Classification
45 pages
cs188 Fa22 Note19
No ratings yet
cs188 Fa22 Note19
8 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
ML Models
No ratings yet
ML Models
21 pages
Supervised Learning Classification Algorithms Comparison
No ratings yet
Supervised Learning Classification Algorithms Comparison
6 pages
Lecture 4 Classification P1
No ratings yet
Lecture 4 Classification P1
50 pages
Classification
No ratings yet
Classification
4 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Naive Bayes Classifier Program
No ratings yet
Naive Bayes Classifier Program
11 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
9 Supervised Learning - II
No ratings yet
9 Supervised Learning - II
55 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
35 pages
Lecture 4 Classification P1
No ratings yet
Lecture 4 Classification P1
49 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
Unit 1
No ratings yet
Unit 1
92 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
Mllabprog 5
No ratings yet
Mllabprog 5
6 pages
AIML
No ratings yet
AIML
30 pages
Big Data 2 Analytical Theory
No ratings yet
Big Data 2 Analytical Theory
27 pages
Machine Learning Project 1
No ratings yet
Machine Learning Project 1
19 pages
Lecture 6 - Generative Models
No ratings yet
Lecture 6 - Generative Models
33 pages
Basic Supervised ML Algorithms
No ratings yet
Basic Supervised ML Algorithms
34 pages
Python for Machine Learning Enthusiasts
No ratings yet
Python for Machine Learning Enthusiasts
50 pages
LAB08 Bayes Theory
No ratings yet
LAB08 Bayes Theory
4 pages
Wa0001
No ratings yet
Wa0001
39 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
DATA - FA 2024 - Dist
No ratings yet
DATA - FA 2024 - Dist
85 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
Module - 4 - ECE3047 - Machine Learning
No ratings yet
Module - 4 - ECE3047 - Machine Learning
81 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Machine Learning
100% (1)
Machine Learning
62 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Skeletal Muscle Structure and Function, Muscle Contraction and Relaxation, Muscle Modelling, Metabolism
No ratings yet
Skeletal Muscle Structure and Function, Muscle Contraction and Relaxation, Muscle Modelling, Metabolism
25 pages
Forces Numericals
No ratings yet
Forces Numericals
15 pages
Bone Structure, Function and Its Mechanical Properties
No ratings yet
Bone Structure, Function and Its Mechanical Properties
13 pages
Bfe I
No ratings yet
Bfe I
43 pages
Engine Emission
No ratings yet
Engine Emission
21 pages
Centrifugal Pump and Numerical1
No ratings yet
Centrifugal Pump and Numerical1
37 pages
Centrifugal Compressor and Numerical
No ratings yet
Centrifugal Compressor and Numerical
26 pages
Available Energy and Exergy Analysis
No ratings yet
Available Energy and Exergy Analysis
43 pages
Unit-1 Notes Statistics & Probability EC-B-IV Sem
No ratings yet
Unit-1 Notes Statistics & Probability EC-B-IV Sem
130 pages
Math Derivations
No ratings yet
Math Derivations
29 pages
EVS
No ratings yet
EVS
42 pages
Mortality Prediction Analysis
No ratings yet
Mortality Prediction Analysis
7 pages
A Model For Predicting Music Popularity On Streami
No ratings yet
A Model For Predicting Music Popularity On Streami
10 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
A Soil Moisture Classification Model Based On SVM Used in Agricultural WSN
No ratings yet
A Soil Moisture Classification Model Based On SVM Used in Agricultural WSN
5 pages
Automatic Cell Image Segmentation Using Genetic Algorithms
No ratings yet
Automatic Cell Image Segmentation Using Genetic Algorithms
5 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
40 pages
Python For Data Sceince l1 Hands On
No ratings yet
Python For Data Sceince l1 Hands On
5 pages
SVD and PCA in Data Science
No ratings yet
SVD and PCA in Data Science
58 pages
Intrusion Detection Model Using Machine Learning Algorithms On NSL-KDD Dataset
No ratings yet
Intrusion Detection Model Using Machine Learning Algorithms On NSL-KDD Dataset
14 pages
Explain in Detail Different Types of Machine Learning Models?
No ratings yet
Explain in Detail Different Types of Machine Learning Models?
14 pages
Machine Learning A Review On Binary Classification
No ratings yet
Machine Learning A Review On Binary Classification
5 pages
Multi Classification - Py (For 1 Class TP, TN, FP, FN)
No ratings yet
Multi Classification - Py (For 1 Class TP, TN, FP, FN)
25 pages
Credit Card Fraud Detection Using Machine Learning Techniques
No ratings yet
Credit Card Fraud Detection Using Machine Learning Techniques
9 pages
ML Cheatsheet PDF
100% (1)
ML Cheatsheet PDF
211 pages
How To Do A Logistic Regression in Excel
No ratings yet
How To Do A Logistic Regression in Excel
13 pages
ML Lab
No ratings yet
ML Lab
9 pages
A Review On Evaluation Metrics For Data
No ratings yet
A Review On Evaluation Metrics For Data
11 pages
Lectures On Machine Learning
100% (1)
Lectures On Machine Learning
69 pages
Data Warehousing in The Age of Artificial Intelligence
No ratings yet
Data Warehousing in The Age of Artificial Intelligence
94 pages
9746 14870 1 PB
No ratings yet
9746 14870 1 PB
13 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
AWS Certified Machine Learning - Specialty - Sample Questions
No ratings yet
AWS Certified Machine Learning - Specialty - Sample Questions
5 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Cbse - Department of Skill Education Curriculum For Session 2021-2022
No ratings yet
Cbse - Department of Skill Education Curriculum For Session 2021-2022
12 pages
Understanding The Confusion Matrix in Machine Learning
No ratings yet
Understanding The Confusion Matrix in Machine Learning
4 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Dp-100 Exam Ques
100% (5)
Dp-100 Exam Ques
55 pages
EE2211 Past Paper Ans
No ratings yet
EE2211 Past Paper Ans
19 pages
EE2211 Past Paper
No ratings yet
EE2211 Past Paper
14 pages

2 Classification

Uploaded by

2 Classification

Uploaded by

Classification

Construct a frequency table for the target and

For classification this is Baseline Model

Model the log-odds ratio as a linear function of

SVM performs classification by coming up with a

 Plot all the data rows as a point in N-Dimensional

The black line

I would again select the black line

► Classifies with the maximum margin

Can we calculate some other feature from X

Can we calculate some other feature from F1

How about Z = F12 + F22

Can we calculate some other feature from F1

How about Z = F12 + F22

Can we calculate some other feature from F1

How about Z = F12 + F22

 Very effective in high dimensional space

 Kernels give more flexibility

Decision tree uses Entropy and Information Gain to

Entropy for 1 attribute

Entropy for 1 attribute

Step Calculate entropy for each branch (split by

Step Choose attribute with the largest

Step Check if the entropy is zero, else continue

Step Run recursively on the all branches, until all

 Discover Nonlinear relationships

 Not affected by outliers

 Easy to interpret and explain

 Rules generated which can be shared

 Missing values are ignored

Simpler models: Stable (low variance) but they VARIANCE :

If the number of cases in the training set is N, sample

02 If there are M input variables, a number m<<M is

Random Forest these m is used to split the node.

The value of m is held constant during the forest

• Bagging tries to combine predictions from

• It reduces variance and helps to avoid

• Mostly used with decision trees

Implicitly gives estimates of what

You might also like