07 Naive Bayes

Uploaded by

l.arrizabalaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views6 pages

07 Naive Bayes

Uploaded by

l.arrizabalaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Bayesian Classiﬁers

Rubén Sánchez Corcuera

ruben.sanchez@deusto.es
■ Bayesian classiﬁers are statistical classiﬁers based on Bayes’ theorem
● They can predict class membership probabilities such as the probability
that a given tuple belongs to a particular class
■ Class conditional independence

Naïve
● Naïve Bayes classiﬁers assume that the effect of an attribute value on a
given class is independent of the values of the other attributes.
● This is made to simplify the computations involved and this is why is
considered naive.

Bayes
■ Studies have found that a simple Bayesian classiﬁer (Naïve Bayes) can compete
with decision trees and some neural networks
■ Bayesian classiﬁers have also exhibited high accuracy and speed when applied
to large databases.

Bayes Theorem

■ Let B be a data tuple. In Bayesian terms, B is considered “evidence.”

● B is described by measurements on n attributes
■ Let A be some hypothesis:
● The data tuple B belongs to a speciﬁed class C.
■ For classiﬁcation problems, we want to determine P(A|B), the probability
that the hypothesis H holds given the “evidence” or observed data tuple B.
■ In other words, we are looking for the probability that tuple B belongs to
class C, given that we know the attribute description of B.

3 4
Bayes Theorem - Posterior probability Bayes Theorem - Prior probability

■ P(A|B) is the posterior probability, or a posteriori probability, of A ■ P(A) is the prior probability, or a priori probability, of A.
conditioned on B.
● For our example, this is the probability that any patient developing a
● For example, suppose our world of data tuples is conﬁned to patients heart disease without considering speciﬁc factors like age or blood
described by the attributes age and blood pressure level, respectively, pressure.
and that B is a 50-year-old patient with high blood pressure. Suppose
that A is the hypothesis that our patient will develop heart disease. ● The posterior probability, P(A|B), is based on more information (e.g.,
Then P(A∣B) reflects the probability that patient B will develop heart patient information) than the prior probability, P(A), which is
disease given that we know the patient's age and blood pressure level. independent of B.

5 6

Bayes Theorem

■ Similarly, P(B|A) is the posterior probability of B conditioned on A. Steps for

Naïve Bayes
● That is, it is the probability that a patient, B, is 50 years old has high
blood pressure, given that we know the patient has a heart disease
■ P(B) is the prior probability of B.
● The probability that a person from our set of patients is 50 years old
and has high blood pressure
■ “How are these probabilities estimated?” P(A), P(B|A), and P(B) may be
estimated from the given data, as we shall see next.
■ Bayes’ theorem is useful in that it provides a way of calculating the
posterior probability, P(A|B), from P(A), P(B|A), and P(B).

7 8
Naïve Bayes: Step 1 Naïve Bayes: Step 2

■ Suppose that there are m classes, C1, C2, . . . , Cm. Given a tuple, X, the
classiﬁer will predict that X belongs to the class having the highest
posterior probability, conditioned on X. That is, the Naïve Bayesian classiﬁer
predicts that tuple X belongs to the class Ci if and only if
■ Let D be a training set of tuples and their associated class labels.
● As usual, each tuple is represented by an n-dimensional attribute
vector, X = (x1, x2, . . . , xn), depicting n measurements made on the
tuple from n attributes, respectively, A1, A2, . . . , An. ■ Thus, we maximize P(Ci|X). The class Ci for which P(Ci|X) is maximized is
called the maximum posteriori hypothesis. By Bayes’ theorem

9 10

Naïve Bayes: Step 3 Naïve Bayes: Step 4

■ As P(X) is constant for all classes, only P(X|Ci)P(Ci) needs to be ■ Given datasets with many attributes, it would be extremely computationally
maximized. expensive to compute P(X|Ci).
● If the class prior probabilities are not known, then it is commonly ■ To reduce computation in evaluating P(X|Ci), the naïve assumption of
assumed that the classes are equally likely, that is, class-conditional independence is made. This presumes that the attributes’
P(C1) = P(C2) = … = P(Cm), and we would therefore maximize P(X|Ci). values are conditionally independent of one another, given the class label of
Otherwise, we maximize P(X|Ci)P(Ci). the tuple (i.e., that there are no dependence relationships among the
■ Note that the class prior probabilities may be estimated by attributes). Thus,

■ |Ci,D| is the number of training tuples of class Ci in D.

11 12
Naïve Bayes: Step 4 Naïve Bayes: Step 4 - Categorical

■ We can easily estimate the probabilities P(x1|Ci), P(x2|Ci), …,

P(xn|Ci) from the training tuples. ■ If Ak is categorical, then P(xk|Ci) is the number of tuples of class Ci
● Recall that here xk refers to the value of attribute Ak for tuple X. in D having the value xk for Ak, divided by |Ci,D|, the number of
For each attribute, we look at whether the attribute is tuples of class Ci in D.
categorical or continuous-valued.

13 14

Naïve Bayes: Step 4 - Continuous Naïve Bayes: Step 5

■ If Ak is continuous-valued, then we need to do a bit more work, but the ■ To predict the class label of X, P(X|Ci)P(Ci) is evaluated for each
calculation is pretty straightforward. A continuous-valued attribute is class Ci. The classiﬁer predicts that the class label of tuple X is the
typically assumed to have a Gaussian distribution with a mean μ and class Ci if and only if
standard deviation σ, deﬁned by:

■ In other words, the predicted class label is the class Ci for which
P(X|Ci)P(Ci) is the maximum.
■ For this formula we only need to compute μCi and σCi which are the mean
and standard deviation, respectively, of the values of attribute Ak for
training tuples of class Ci. We then plug these two quantities into the
equation, together with xk to estimate P(xk|Ci)
15 16
Naïve Bayes: Final Thoughts
Further reading
■ In theory, Bayesian classifiers have the minimum error rate in comparison to
all other classifiers.
■ Sections 8.3 in [Han and Kamber, 2006]
■ In practice this is not always the case, owing to inaccuracies in the
assumptions made for its use, such as class-conditional independence, and
Extra material:
the lack of available probability data.
■ https://scikit-learn.org/stable/modules/naive_bayes.html
■ Bayesian classifiers are also useful in that they provide a theoretical
justification for other classifiers that do not explicitly use Bayes’ theorem.

○ For example, under certain assumptions, it can be shown that many

neural network and curve-ﬁtting algorithms output the maximum
posteriori hypothesis, as does the naïve Bayesian classiﬁer.

17 18

Exercise 1 Exercise 2

■ Write a script that does the following: ■ We are going to try to classify previously unseen words in their proper
language. We are going to work with Spanish and Finish.
1. loads the iris dataset using sklearn (sklearn.datasets.load_iris)
■ Having only the words in each language, how would you do it? Remember,
2. splits the data into training and testing part using the train_test_split we are going to classify unseen words!
function so that the training set size is 80% of the whole data (give the
call also the random_state=0 argument to make the result ■ Let's prepare what we need:
deterministic)
○ Download the Spanish.txt and Suomi.txt datasets from ALUD.
3. uses Gaussian naive Bayes to ﬁt the training data
(sklearn.naive_bayes.GaussianNB) ○ Open the Naïve Bayes colab in ALUD

4. predicts labels of the test data

5. the function should return the accuracy score of the prediction
performance (sklearn.metrics.accuracy_score)
19 20
Do you have any questions?
ruben.sanchez@deusto.es

Thanks!
21

29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
NOTES
No ratings yet
NOTES
15 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Bayes' Theorem for Data Science
No ratings yet
Bayes' Theorem for Data Science
10 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Naive Bayes & SVM Overview
No ratings yet
Naive Bayes & SVM Overview
79 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
NaiveBayersClassification BA
No ratings yet
NaiveBayersClassification BA
36 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
CSC 325 AI Lecture08 Supervised Learning
No ratings yet
CSC 325 AI Lecture08 Supervised Learning
32 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
ML For ME S15-16 Naïve Bayes
No ratings yet
ML For ME S15-16 Naïve Bayes
17 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
26 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
17 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
25 pages
Naive Bayes for Python Newbies
No ratings yet
Naive Bayes for Python Newbies
3 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
Module - 3 - Last Part
No ratings yet
Module - 3 - Last Part
16 pages
Naive Bayes Classifier 1
No ratings yet
Naive Bayes Classifier 1
18 pages
Naive Bayes
No ratings yet
Naive Bayes
19 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
09 Regression
No ratings yet
09 Regression
5 pages
03 Data Preprocessing
No ratings yet
03 Data Preprocessing
15 pages
14 Dbscan
No ratings yet
14 Dbscan
7 pages
13 Birch
No ratings yet
13 Birch
8 pages
Network Security - Firstday Handout
No ratings yet
Network Security - Firstday Handout
6 pages
DNC 600s
No ratings yet
DNC 600s
116 pages
StackRox WhitePaper Top Kubernetes Security Settings
No ratings yet
StackRox WhitePaper Top Kubernetes Security Settings
6 pages
WAD Module 1
No ratings yet
WAD Module 1
39 pages
OMS Level 3 Demo Prep and Script
No ratings yet
OMS Level 3 Demo Prep and Script
27 pages
IRCC - Sponsoring Your Spouse, Partner or Dependent Child
No ratings yet
IRCC - Sponsoring Your Spouse, Partner or Dependent Child
9 pages
Role of Parser
No ratings yet
Role of Parser
10 pages
Alcatel-Lucent Omnipcx Enterprise Communication Server: Ip Attendant Softphone - Installation and Administration Guide
No ratings yet
Alcatel-Lucent Omnipcx Enterprise Communication Server: Ip Attendant Softphone - Installation and Administration Guide
64 pages
CCTV Maintenance Proposal
No ratings yet
CCTV Maintenance Proposal
3 pages
Financial Analysis for Pharma Group
No ratings yet
Financial Analysis for Pharma Group
1 page
Brother Ferrule Printing Machine PT E800t
No ratings yet
Brother Ferrule Printing Machine PT E800t
4 pages
Chapter-3 - Multimedia Communication Tools
No ratings yet
Chapter-3 - Multimedia Communication Tools
6 pages
Oracle SQL Final
No ratings yet
Oracle SQL Final
87 pages
Cse2021 2
No ratings yet
Cse2021 2
3 pages
DBMS All Units
No ratings yet
DBMS All Units
134 pages
Introduction To Telemedicine
No ratings yet
Introduction To Telemedicine
14 pages
SMSCaster E-Marketer User Manual PDF
100% (1)
SMSCaster E-Marketer User Manual PDF
4 pages
Sentron T Electrical Measurement Transducer: Answers For Energy
No ratings yet
Sentron T Electrical Measurement Transducer: Answers For Energy
16 pages
The Apathy of Empire - Cambodia in American Geopolitics - James A. Tyner Audiobook M4B
No ratings yet
The Apathy of Empire - Cambodia in American Geopolitics - James A. Tyner Audiobook M4B
1 page
Vipin Singh: Cloud Consultant
No ratings yet
Vipin Singh: Cloud Consultant
3 pages
Machine Learning Lab Guide - B.Tech CSE
No ratings yet
Machine Learning Lab Guide - B.Tech CSE
1 page
Apple Rate Proposal For ONGC
No ratings yet
Apple Rate Proposal For ONGC
13 pages
Sample Exam (MS)
No ratings yet
Sample Exam (MS)
5 pages
NetSuite Role Permissions Guide
No ratings yet
NetSuite Role Permissions Guide
22 pages
BRD Reference 2.3.1
No ratings yet
BRD Reference 2.3.1
10 pages
Tally Prime Shortcut Key
No ratings yet
Tally Prime Shortcut Key
15 pages
Impact of e Banking On Customer Satisfaction
No ratings yet
Impact of e Banking On Customer Satisfaction
11 pages
Haresh Jaiswal Rising Technologies, Jalna
No ratings yet
Haresh Jaiswal Rising Technologies, Jalna
34 pages
Databricks Data Engineer Professional
No ratings yet
Databricks Data Engineer Professional
98 pages
Generalized Roughness Bearing Fault Diagnosis Using Time Series Analysis and Gradient Boosted Tree
No ratings yet
Generalized Roughness Bearing Fault Diagnosis Using Time Series Analysis and Gradient Boosted Tree
4 pages