0% found this document useful (0 votes)

6 views21 pages

Naïve Bayes Classification

The document discusses Naïve Bayes classification, a statistical method for assigning class labels to unclassified data using Bayes' Theorem. It outlines the classification problem, various classification techniques, and the principles of probability, including conditional and joint probabilities. The Naïve Bayesian classifier operates under the assumption of attribute independence, making it efficient for training and testing, and is widely used in applications like spam filtering.

Uploaded by

snow25jon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views21 pages

Naïve Bayes Classification

Uploaded by

snow25jon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

MCSE0007: Machine Learning

Naïve Bayes Classification

Classification Problem
 Classification consists of assigning a class label
to a set of unclassified data
 More precisely, a classification problem can be
stated as below:
Definition: Classification Problem

Given a database D = 𝑡1, 𝑡2, … . . , 𝑡𝑚 of tuples and a set of classes C =

𝑐1, 𝑐2, … . . , 𝑐𝑘 , the classification problem is to define a mapping f ∶ D → 𝐶,

Where each 𝑡𝑖 is assigned to one class.

Note that tuple 𝑡𝑖 ∈ 𝐷 is defined by a set of attributes 𝐴 = 𝐴1, 𝐴2, … . . , 𝐴𝑛 .

Classification Techniques
⚫ A number of classification techniques are known, which can be broadly
classified into the following categories:

1. Statistical-Based Methods
• Regression
• Bayesian Classifier
•

2. Distance-Based Classification
• K-Nearest Neighbours

3. Decision Tree-Based Classification

• ID3, C 4.5, CART

5. Classification using Machine Learning (SVM)

6. Classification using Neural Network (A𝐍𝐍)

Bayesian Classifier
⚫ Principle
⚫ If it walks like a duck, quacks like a duck, then it is probably a duck
Bayesian Classifier …
⚫ A statistical classifier
⚫ Performs probabilistic prediction, i.e., predicts class
membership probabilities
⚫ Foundation
⚫ Based on Bayes’ Theorem.
⚫ Assumptions
1. The classes are mutually exclusive and exhaustive.
2. The attributes are independent given the class.
⚫ Called “Naïve” classifier because of these assumptions.
⚫ Empirically proven to be useful.
⚫ Scales very well.
Bayesian Classifier …
⚫ In many applications, the relationship between the
attributes set and the class variable is non-deterministic.
o In other words, a test cannot be classified to a class label
with certainty.
o In such a situation, the classification can be achieved
probabilistically.
⚫ The Bayesian classifier is an approach for modelling
probabilistic relationships between the attribute set and the
class variable.
⚫ More precisely, Bayesian classifier use Bayes’ Theorem of
Probability for classification.
Theory of Probability
Simple Probability
Definition: Simple Probability

If there are n elementary events associated with a random experiment and m of n of

them are favorable to an event A, then the probability of happening or occurrence of A
is
𝑚
𝑃 𝐴 =
𝑛

⚫ Suppose, A and B are any two events and P(A), P(B) denote the
probabilities that the events A and B will occur, respectively.

⚫ Mutually Exclusive Events:

⚫ Two events are mutually exclusive, if the occurrence of one precludes the
occurrence of the other.
Example: Tossing a coin (two events)
Tossing a ludo cube (Six events)
Simple Probability …
⚫ Independent events: Two events are independent if occurrences of one
does not alter the occurrence of other.

Example: Tossing both coin and ludo cube together.

Definition: Joint Probability

If P(A) and P(B) are the probability of two events, then

𝑃 𝐴∪𝐵 = 𝑃 𝐴 +𝑃 𝐵 −𝑃 𝐴∩𝐵

If A and B are mutually exclusive, then 𝑃 𝐴 ∩ 𝐵 = 0

If A and B are independent events, then 𝑃 𝐴 ∩ 𝐵 = 𝑃 𝐴 . 𝑃(𝐵)

Thus, for mutually exclusive events

𝑃 𝐴∪𝐵 = 𝑃 𝐴 +𝑃 𝐵
Conditional Probability
Definition: Conditional Probability

If events are dependent, then their probability is expressed by conditional

probability. The probability that A occurs given that B is denoted by 𝑃(𝐴|𝐵).

Suppose, A and B are two events associated with a random experiment. The
probability of A under the condition that B has already occurred and 𝑃(𝐵) ≠ 0 is
given by

Number of events in 𝐵 which are favourable to 𝐴

𝑃 𝐴𝐵 =
Number of events in 𝐵
Number of events favourable to 𝐴 ∩ 𝐵
=
Number of events favourable to 𝐵

𝑃(𝐴 ∩ 𝐵)
=
𝑃(𝐵)
Simple Probability …
Corollary: Conditional Probability

𝑃 𝐴 ∩𝐵 = 𝑃 𝐴 .𝑃 𝐵 𝐴 , 𝑖𝑓 𝑃 𝐴 ≠ 0
or 𝑃 𝐴 ∩𝐵 = 𝑃 𝐵 .𝑃 𝐴 𝐵 , 𝑖𝑓 𝑃(𝐵) ≠ 0

For three events A, B and C

𝑃 𝐴 ∩ 𝐵 ∩ 𝐶 = 𝑃 𝐴 .𝑃 𝐵 .𝑃 𝐶 𝐴 ∩ 𝐵

For n events A1, A2, …, An and if all events are mutually independent to each other

𝑃 𝐴1 ∩ 𝐴2 ∩ … … … … ∩ 𝐴𝑛 = 𝑃 𝐴1 . 𝑃 𝐴2 … … … … 𝑃 𝐴𝑛
Note:
𝑃 𝐴𝐵 =0 if events are mutually exclusive
𝑃 𝐴𝐵 =𝑃 𝐴 if A and B are independent
𝑃 𝐴 𝐵 ⋅ 𝑃 𝐵 = 𝑃 𝐵 𝐴 ⋅ 𝑃(𝐴) otherwise,
P A ∩ B = P(B ∩ A)
Prior and Posterior Probabilities

⚫ P(A) and P(B) are called prior probabilities X Y

⚫ P(A|B), P(B|A) are called posterior probabilities
𝑥1 A
Example 8.6: Prior versus Posterior Probabilities 𝑥2 A
⚫ This table shows that the event Y has two outcomes
namely A and B, which is dependent on another event X 𝑥3 B
with various outcomes like 𝑥1, 𝑥2 and 𝑥3. 𝑥3 A
⚫ Case1: Suppose, we don’t have any information of the
event A. Then, from the given sample space, we can 𝑥2 B
calculate P(Y = A) = 5 = 0.5 𝑥1 A
10
•

⚫ Case2: Now, suppose, we want to calculate P(X = 𝑥1 B

𝑥2 |Y =A) = 2 = 0.4 .
5 𝑥3 B

The later is the conditional or posterior 𝑥2 B

probability, where as the former is the prior 𝑥2 A
probability.
Naïve Bayesian Classifier
⚫ Suppose, Y is a class variable and X = 𝑋1, 𝑋2, … . . , 𝑋𝑛 is a set of attributes,
with instance of Y.

INPUT (X) CLASS(Y)

… … …
… … … …
𝑥 1, 𝑥 2 , … , 𝑥 𝑛 𝑦𝑖
… … … …

⚫ The classification problem, then can be expressed as the class-conditional

probability
𝑃 𝑌 = 𝑦𝑖| 𝑋1 = 𝑥1 AND 𝑋2 = 𝑥2 AND … . . 𝑋𝑛 = 𝑥𝑛
Naïve Bayesian Classifier …
⚫ Naïve Bayesian classifier calculate this posterior probability using Bayes’
theorem, which is as follows.
⚫ From Bayes’ theorem on conditional probability, we have
𝑃(𝑋|𝑌)∙𝑃(𝑌)
𝑃 𝑌𝑋 =
𝑃(𝑋)
𝑃(𝑋|𝑌) ∙ 𝑃(𝑌)
=
𝑃 𝑋 𝑌 = 𝑦1 ∙ 𝑃 𝑌 = 𝑦1 + ⋯ + 𝑃 𝑋 𝑌 = 𝑦𝑘 ∙ 𝑃 𝑌 = 𝑦𝑘
where,
𝑃 𝑋 = σ𝑘𝑖=1 𝑃(𝑋|𝑌 = 𝑦𝑖) ∙ 𝑃(Y = 𝑦𝑖 )
Note:
 𝑃 𝑋 is called the evidence (also the total probability) and it is a constant.
 The probability P(Y|X) (also called class conditional probability) is therefore
proportional to P(X|Y)∙ 𝑃(𝑌).
 Thus, P(Y|X) can be taken as a measure of Y given that X.
P(Y|X) ≈ 𝑃 𝑋 𝑌 ∙ 𝑃(𝑌)
Naïve Bayesian Classifier …
⚫ Suppose, for a given instance of X (say x = (𝑋1 = 𝑥1) and …..
(𝑋𝑛= 𝑥𝑛)).

⚫ There are any two class conditional probabilities namely P(Y=

𝑦𝑖|X=x) and P(Y= 𝑦𝑗 | X=x).

⚫ If P(Y= 𝑦𝑖 | X=x) > P(Y= 𝑦𝑗 | X=x), then we say that 𝑦𝑖 is more

stronger than 𝑦𝑗 for the instance X = x.

⚫ The strongest 𝑦𝑖 is the classification for the instance X = x.

Naïve Bayesian Classifier …
Naïve Bayes Algorithm (for discrete input attributes) has two phase
– 1. Learning Phase: Given a training set S,
Learning is easy, just
For each target value of ci (ci  c1 ,,c L )
create probability
P̂(C  ci )  estimate P(C  ci ) with examples in S; tables.
For every attributevalue xjk of each attributeXj ( j  1,, n; k  1,, N j )

P̂(X j  x jk |C  ci )  estimate P(Xj  x jk |C  ci ) with examples in S;

Output: conditional probability tables; for Xj , N j  L elements

– 2. Test Phase: Given an unknown instance X (a1 ,,,an)
Look up tables to assign the label c* to X’ if
[Pˆ(a |c* )  Pˆ(a |c* )]P̂(c * ) [Pˆ(a |c)  Pˆ(a |c)]P̂(c), c  c* , c  c ,,c
1 n 1 n 1 L

Classification is easy, just multiply probabilities

Naïve Bayesian Classifier …
Naïve Bayesian Classifier …
• Naïve Bayes is based on the independence assumption
– Training is very easy and fast; just requiring considering each attribute in
each class separately
– Test is straightforward; just looking up tables or calculating conditional
probabilities with normal distributions

• Naïve Bayes is a popular generative classifier model

1. Performance of naïve Bayes is competitive to most of state-of-the-art
classifiers even if in presence of violating the independence assumption
2. It has many successful applications, e.g., spam mail filtering
3. A good candidate of a base learner in ensemble learning
4. Apart from classification, naïve Bayes can do more…
Any Questions ?

Lecture Note #7 - PEC-CS701E
No ratings yet
Lecture Note #7 - PEC-CS701E
28 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
37 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
Naive by
No ratings yet
Naive by
23 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
37 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Bayesian Classification Insights
No ratings yet
Bayesian Classification Insights
7 pages
Naive Bayes - Lecture Slides
No ratings yet
Naive Bayes - Lecture Slides
11 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Probability Models
No ratings yet
Probability Models
23 pages
L4 Naive Bayes
No ratings yet
L4 Naive Bayes
31 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Lecture 9 - 10 Naive Generative Analysis
No ratings yet
Lecture 9 - 10 Naive Generative Analysis
54 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Additional Material - Naive Bayes
No ratings yet
Additional Material - Naive Bayes
6 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
ML 09 Naive Bayes Classifier
No ratings yet
ML 09 Naive Bayes Classifier
24 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Bayesian Classification Guide
No ratings yet
Bayesian Classification Guide
6 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
46 pages
Unit 4
No ratings yet
Unit 4
36 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
No ratings yet
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
35 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
ML L9 Naive Bayes
No ratings yet
ML L9 Naive Bayes
18 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
CS-DM Module-4
No ratings yet
CS-DM Module-4
22 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
Lecture 2.4-2.5
No ratings yet
Lecture 2.4-2.5
16 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
BayesianClassifier
No ratings yet
BayesianClassifier
64 pages
Bayes Rule PR-2
No ratings yet
Bayes Rule PR-2
5 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Machine Learning & Bayesian Methods
No ratings yet
Machine Learning & Bayesian Methods
28 pages
DM NaiveBayes
No ratings yet
DM NaiveBayes
15 pages
Bayes' Theorem for Data Science
No ratings yet
Bayes' Theorem for Data Science
10 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
47 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Project Plagiarism Report
No ratings yet
Project Plagiarism Report
21 pages
IoT-based Health Monitoring of Sports Personnel Through Wearables Using Machine Learning Technology
No ratings yet
IoT-based Health Monitoring of Sports Personnel Through Wearables Using Machine Learning Technology
17 pages
Soil Based Fertilizer Recommendation System Using Internet of Things
No ratings yet
Soil Based Fertilizer Recommendation System Using Internet of Things
7 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
Module-2 Part-1 - Merged
No ratings yet
Module-2 Part-1 - Merged
66 pages
Online Payment Fraud Detection ML
No ratings yet
Online Payment Fraud Detection ML
17 pages
Detecting Fake Product Reviews
No ratings yet
Detecting Fake Product Reviews
37 pages
Business Analytics Assignment
No ratings yet
Business Analytics Assignment
3 pages
8615 2nd Assignment
No ratings yet
8615 2nd Assignment
22 pages
Spam Email Filtering with Naive Bayes
No ratings yet
Spam Email Filtering with Naive Bayes
4 pages
Machine Learning with R, the tidyverse, and mlr 1st Edition Hefin Ioan Rhys updated 2025
100% (2)
Machine Learning with R, the tidyverse, and mlr 1st Edition Hefin Ioan Rhys updated 2025
57 pages
Unit V
No ratings yet
Unit V
67 pages
Syllabus Sem 5
No ratings yet
Syllabus Sem 5
90 pages
Classification - Naive Bayes Classifier: DR - Aruna Malapati Asst Professor Dept of CS & IT BITS Pilani, Hyderabad Campus
No ratings yet
Classification - Naive Bayes Classifier: DR - Aruna Malapati Asst Professor Dept of CS & IT BITS Pilani, Hyderabad Campus
9 pages
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Datamites Certified Data Scientist Syllabus PDF
50% (2)
Datamites Certified Data Scientist Syllabus PDF
12 pages
Icai23 Abstract Book
No ratings yet
Icai23 Abstract Book
66 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
2 pages
CC - Unit IV - Chapters
No ratings yet
CC - Unit IV - Chapters
47 pages
Civil Complaints Management System by Using Machine Learning Techniques
No ratings yet
Civil Complaints Management System by Using Machine Learning Techniques
4 pages
Data Science Curriculum
No ratings yet
Data Science Curriculum
20 pages
Machine-Learning Techniques For Predictive Analytics
No ratings yet
Machine-Learning Techniques For Predictive Analytics
53 pages
Data Mining for Analysts
No ratings yet
Data Mining for Analysts
38 pages
Data Clustering & Classification Guide
No ratings yet
Data Clustering & Classification Guide
60 pages
AirBnB Customer Acquisition Report-2
No ratings yet
AirBnB Customer Acquisition Report-2
10 pages
Data Science & Analytics: Course Code: CSE3105 Credits: 02 Credit Hours: 02/week Exam Hours: 03
No ratings yet
Data Science & Analytics: Course Code: CSE3105 Credits: 02 Credit Hours: 02/week Exam Hours: 03
2 pages
Concordia University Machine Learning Assaignment With Solutions
No ratings yet
Concordia University Machine Learning Assaignment With Solutions
8 pages
Stress Detection Synopsis
No ratings yet
Stress Detection Synopsis
14 pages
Ieee Paper
No ratings yet
Ieee Paper
4 pages

Naïve Bayes Classification

Uploaded by

Naïve Bayes Classification

Uploaded by

MCSE0007: Machine Learning

Naïve Bayes Classification

Given a database D = 𝑡1, 𝑡2, … . . , 𝑡𝑚 of tuples and a set of classes C =

Where each 𝑡𝑖 is assigned to one class.

Note that tuple 𝑡𝑖 ∈ 𝐷 is defined by a set of attributes 𝐴 = 𝐴1, 𝐴2, … . . , 𝐴𝑛 .

3. Decision Tree-Based Classification

5. Classification using Machine Learning (SVM)

6. Classification using Neural Network (A𝐍𝐍)

If there are n elementary events associated with a random experiment and m of n of

⚫ Mutually Exclusive Events:

Example: Tossing both coin and ludo cube together.

Definition: Joint Probability

If P(A) and P(B) are the probability of two events, then

If A and B are mutually exclusive, then 𝑃 𝐴 ∩ 𝐵 = 0

Thus, for mutually exclusive events

If events are dependent, then their probability is expressed by conditional

Number of events in 𝐵 which are favourable to 𝐴

For three events A, B and C

⚫ P(A) and P(B) are called prior probabilities X Y

⚫ Case2: Now, suppose, we want to calculate P(X = 𝑥1 B

The later is the conditional or posterior 𝑥2 B

INPUT (X) CLASS(Y)

⚫ The classification problem, then can be expressed as the class-conditional

⚫ There are any two class conditional probabilities namely P(Y=

⚫ If P(Y= 𝑦𝑖 | X=x) > P(Y= 𝑦𝑗 | X=x), then we say that 𝑦𝑖 is more

⚫ The strongest 𝑦𝑖 is the classification for the instance X = x.

P̂(X j  x jk |C  ci )  estimate P(Xj  x jk |C  ci ) with examples in S;

Output: conditional probability tables; for Xj , N j  L elements

Classification is easy, just multiply probabilities

• Naïve Bayes is a popular generative classifier model

You might also like