0% found this document useful (0 votes)

28 views31 pages

L4 Naive Bayes

The document outlines the Naïve Bayes classification method, detailing its foundational concepts in probability, the Naïve Bayes theorem, and its algorithm. It explains how to apply the theorem through examples and discusses potential problems and solutions in classification. The conclusion emphasizes the method's effectiveness in practical applications despite its assumptions of conditional independence among attributes.

Uploaded by

haducthuan2003cpqn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views31 pages

L4 Naive Bayes

Uploaded by

haducthuan2003cpqn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Naïve Bayes

Lương Thái Lê
Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Basic concepts of probability
• Suppose we have an experiment (e.g. rolling a dice) whose outcome is random
(depends on the probability)
• Space of possibilities S: set of all possible outcomes
• Eg: S = {1,2,3,4,5,6} for the dice roll experiment
• An event E: a subset of S
• Eg: E = {1,3,5}
• Event space W: P(S)
• Random variable A: a function represents an event, and there is a degree of
probability that this event will occur.
• Eg: A= “The number of dots is odd when the dice are rolled”
Probability representation
• P(A) = The part of the space of events (W) where A is true
Some Properties
•0≤𝑃 𝐴 ≤1
• P(not A) = 1 – P(A)
• P(A V B)= P(A)+ P(B)-P(A ∧B)
• Suppose random variables A and B can take one of k ( >2) values
{v1,v2,…,vk}, then:
• P(A = vi ∧ A = v j ) = 0 if i ≠ j
• P(A=v1 V A=v2 V ... V A=vk) = σ𝑘𝑖=1 𝑃(𝐴 = 𝑣𝑖 ) = 1
• 𝑃(𝐵∧[A=v1 V A=v2 V ... V A=vm]= σ𝑚𝑖=1(𝑃(𝐵 ∧ 𝐴 = 𝑣𝑖 )
Conditional Probability
• P(A|B) is the part of space W in which A is true, provided
that B is true
• Eg:
A: “I will play football tomorow”
B: “It won’t rain tomorow”
=> P(A|B) is the probability that I will play football tomorrow if it
won’t rain
• Let P(A ∧ B) = P(A,B), then
𝑃(𝐴,𝐵)
𝑃 𝐴𝐵 =
𝑃(𝐵)
Probability independent variables (1)
• Two events (random variable) A and B are said to be probabilistically
independent if the probability of event A is the same for all cases:
• B happens
• B does not happen
• Eg:
• A: I will go swimming tomorrow
• B: Long will go swimming tomorrow

P(A|B) = P(A)
Probability independent variables (2)
Probability independent variables with >2
variables
• P(A|B,C) is probability of A when B,C (is known)
• Two variables A and C are said to be conditionally independent
of variable B, if the probability of A with respect to B is equal to
the probability of A with respect to B and C.
P (A|B,C) = P(A|B)
• Eg:
A: I will play football tomorrow
B: The football match will take place indoors
C: It will rain tomorow
Important Rules of Probability
• Chain rules:
• P(A,B) = P(A|B) P(B) = P(B|A) P(A)
• P(A|B) = P(A,B)/P(B) = P(B|A).P(A)/P(B)
• P(A,B|C) = P(A,B,C)/P(C) = P(A|B,C).P(B,C)/P(C)
= P(A|B,C).P(B|C)
• Probability independence and conditional independence
• P(A|B) = P(A); if A and B are probability independence
• P(A,B|C) = P(A|C).P(B|C); if A and B are conditional
independence with C
• P(A1,…,An|C) = P(A1|C)…P(An|C); if Ai are conditional
independence with C
Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Bayes Theorem
𝑃 𝐷 ℎ . 𝑃(ℎ)
𝑃 ℎ𝐷 =
𝑃(𝐷)

• P(h):the prior probability of the (categorical) hypothesis h

• P(D): the prior probability of the observing of data
• P(D|h): the conditional probability of observing of data D, if hypothesis
(category) h is known to be true
• P(h|D): the conditional probability of the (categorical) hypothesis h being
true, if the data D is observed
=>Probabilistic classification methods will use this conditional probability
(called posterior probability)
Bayes Theorem – Example (1)
• Suppose we have this data set
Bayes Theorem – Example (2)
• Data D: Outlook is Sunny and Wind is Strong
• Categorical hypothesis h: He plays tennis
• The prior probability P(h): Probability that he plays tennis (no matter
how outdoors and windy)
• The prior probability P(D): Probability that Outdoors is sunny and
Wind is strong
• P(D|h): Probability that Outdoors is sunny and Wind is strong, if he
plays tennis
• P(h|D): Probability that he plays tennis if Outdoors is sunny and Wind
is strong
Maximum a Posteriori – MAP
• Given a set of possible hypotheses (target classes) H, the learning
system will find the most probable hypothesis h(∈H) for the observed
data D
• This hypothesis h is called the maximum posterior

(Bayes theorem)

(P(D) is the same with all h)

MAP - Example
• The set H includes 2 hypotheses:
• h1 : He plays tennis
• h2 : He dose not play tennis
• Calculate the 2 conditional probabilities: P(h1|D), P(h2|D)
• if P(h1|D)> P(h2|D) hMAP = h1
else hMAP = h1
• Because P(D)=P(D,h1)+P(D,h2) is the same for both hypotheses h1
and h2, so P(D) can be ignored.
• So if P(D|h1).P(h1)> P(D|h2).P(h2) then he plays tennis else he
dosen’t
Maximum Likelihood Estimation (MLE)
• Suppose that all assumptions have the same prior probability value:
P(hi)=P(hj), ∀hi,hj∈H
• The MLE method finds the hypothesis that maximizes the value
P(D|h); where P(D|h) is called the likelihood of data D for h
• The maximum likelihood hypothesis
MLE - Example
• The set H includes 2 hypothesis:
• h1 : He plays tennis
• h2 : He dose not play tennis
D: Data set (dates) in which the Outlook attribute is Sunny and the Wind is
Strong
• Find P(D|h1)and P(D|h2):
• P(Outlook=Sunny,Wind=Weak|h1)=1/8
• P(Outlook=Sunny,Wind=Weak|h2)=1/4
=> hMLE = h2 => He dose not play tennis
Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Naïve Bayes Classification – Idea (1)
• Classification problem:
• Input:
• A training data set D ={(x(i),c(j))}; i= 1,…,m; j= 1,..,k where :
• x: is an training example, is represented by n dimension vector (x1, x2, …, xn)
• c: is a label class in the target class set C = {c1, c2, …, ck}
• A test example z not belong to D
• Output:
• A classification model F
• The class that z is determined belong to by F
• Motivation:
• find cMAP is the most
suitable class for z

Naïve Bayes
Naïve Bayes Classification – Idea (2)
• Because P(z1, z2, …, zn) is the same for all class ci , so we find:
𝑐𝑀𝐴𝑃 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑐𝑖 ∈𝐶 𝑃(𝑧1 , 𝑧2 ,…, 𝑧𝑛 𝑐𝑖 𝑃(𝑐𝑖 )
• Assumption in the Naïve Bayes classifier, attributes are conditionally
independent of classes:
𝑃(𝑧1 , 𝑧2 ,…, 𝑧𝑛 𝑐𝑖 = ς𝑛𝑗=1 𝑃(𝑧𝑗 |𝑐𝑖 )
• Naïve Bayes find the most likelihood class for z:

𝑐𝑁𝐵 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑐𝑖 ∈𝐶 𝑃(𝑐𝑖 ) ෑ 𝑃(𝑧𝑗 |𝑐𝑖 )

𝑗=1
Naïve Bayes Classification – Alg
• Training phase: with a training example x = (x1, x2, …, xn)
• Calculate P(ci) for each class ci ∈ 𝐶
• Calculate P(xj|ci) for each attribute xj ∈ vector x
• Classification phase: for a new example z = (z1, z2, …, zn)
• Calculate 𝑃(𝑐𝑖 ) ς𝑛𝑗=1 𝑃(𝑧𝑗 |𝑐𝑖 )
• Find the most suitable class c* for z:
𝑛

𝑐 ∗ = 𝑎𝑟𝑔𝑚𝑎𝑥𝑐𝑖∈𝐶 𝑃(𝑐𝑖 ) ෑ 𝑃(𝑧𝑗 |𝑐𝑖 )

𝑗=1
Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Naïve Bayes Classification – Example (1)
A young student with an average income and a normal credit rating would buy a calculator?
Naïve Bayes Classification – Example (2)
• Problem modeling:
• z = (Age= Young, Income= Medium, Student= Yes, Credit_Rating= Fair)
• There are 2 class: c1 (Bye computer); c2 (Not bye computer)
• Calculate priorities
• P(c1) = 9/14
• P(c2) = 5/14
• Calculate the probability value of each attribute value for each subclass
Naïve Bayes Classification – Example (3)
• Calculate the probability (likelihood) of the example z for each class ci

• Determine the most possible class

• 𝑃 𝑐1 𝑃 𝑧 𝑐1 = (9/14).0,044 = 0,028
• 𝑃 𝑐2 𝑃 𝑧 𝑐2 = (5/14). 0,019 = 0,007

=> Conclusion: He (z) will buy a computer

Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Naïve Bayes Classification – Problems (1)
• If there are no examples associated with class ci having attribute value zj :
𝑃 𝑧𝑗 𝑐𝑖 = 0 ⇒ 𝑃(𝑐𝑖 ) ς𝑛𝑗=1 𝑃 𝑧𝑗 𝑐𝑖 = 0
• Solution: Using Bayes to estimate 𝑃 𝑧𝑗 𝑐𝑖 :
𝑛 𝑐𝑖 , 𝑧𝑗 + 𝑚𝑝
𝑃 𝑧𝑗 𝑐𝑖 =
𝑛 𝑐𝑖 + 𝑚
• 𝑛 𝑐𝑖 = number of training examples associated with ci
• 𝑛 𝑐𝑖 , 𝑧𝑗 = number of training examples associated with ci having attribute value zj
• p: estimate for the probability value 𝑃 𝑧𝑗 𝑐𝑖
=> p = 1/k for feature fj has k possible values
• m: a weight is chosen
Naïve Bayes Classification – Problems (2)
• Limits on accuracy in computer calculations
• 𝑃 𝑧𝑗 𝑐𝑖 <1, so if the number of attribute values is big then:
lim(ς𝑛𝑗=1 𝑃 𝑧𝑗 𝑐𝑖 ) = 0
• Solution: Using the logarithmic function for probability values
Course Outline

1. Introduction to probability

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

Naïve Bayes Classification – Conclusion
• One of the most commonly used machine learning methods in reality
• Despite assuming the conditional independence of the attributes for the
classifiers, the Naïve Bayes classifier still obtains good classification results
in many fields of practical applications.
• When to use?
• Traing data set has large or medium size
• Examples are represented by a large number of attributes
• Attributes are conditionally independent for target classes

Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
Lecture 5 Bayesian
No ratings yet
Lecture 5 Bayesian
37 pages
Unit 4
No ratings yet
Unit 4
36 pages
Naïve Bayes Classification
No ratings yet
Naïve Bayes Classification
21 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Text Mining - Classification
No ratings yet
Text Mining - Classification
28 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Lecture Note #7 - PEC-CS701E
No ratings yet
Lecture Note #7 - PEC-CS701E
28 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Lecture - 4 Classification (Naive Bayes)
No ratings yet
Lecture - 4 Classification (Naive Bayes)
33 pages
Bayesian Learning
No ratings yet
Bayesian Learning
42 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
10 B Bayesian Learning 2
No ratings yet
10 B Bayesian Learning 2
14 pages
Naive Bayes Classification Guide
No ratings yet
Naive Bayes Classification Guide
21 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Classification - Naive Bayes
No ratings yet
Classification - Naive Bayes
17 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
47 pages
2MLIntrodpart 2
No ratings yet
2MLIntrodpart 2
42 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Probability Models
No ratings yet
Probability Models
23 pages
Naive by
No ratings yet
Naive by
23 pages
UNIT4 - Part2 Aiml
No ratings yet
UNIT4 - Part2 Aiml
46 pages
ML L9 Naive Bayes
No ratings yet
ML L9 Naive Bayes
18 pages
Bayesian Learning Essentials
No ratings yet
Bayesian Learning Essentials
49 pages
6 Easy Steps To Learn Naive Bayes Algorithm With Codes in Python and R
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm With Codes in Python and R
6 pages
Bayesian Decision Theory in ML
No ratings yet
Bayesian Decision Theory in ML
56 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
10 pages
Unit I Probabilistic Reasoning I 9
No ratings yet
Unit I Probabilistic Reasoning I 9
20 pages
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
No ratings yet
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
35 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
37 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Naive Bayes - Lecture Slides
No ratings yet
Naive Bayes - Lecture Slides
11 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
3 - Classification - Naive Bayes
No ratings yet
3 - Classification - Naive Bayes
30 pages
Neil J. Salkind - Encyclopedia of Research Design (2010, SAGE Publications, Inc) PDF
92% (13)
Neil J. Salkind - Encyclopedia of Research Design (2010, SAGE Publications, Inc) PDF
1,644 pages
Targeted Learning in Data Science Causal Inference For Complex Longitudinal Studies Complete EPUB Download
No ratings yet
Targeted Learning in Data Science Causal Inference For Complex Longitudinal Studies Complete EPUB Download
15 pages
Universiteit Hasselt Concepts in Bayesian Inference Exam June 2015
No ratings yet
Universiteit Hasselt Concepts in Bayesian Inference Exam June 2015
8 pages
120 Interview Questions
83% (12)
120 Interview Questions
19 pages
Solutions Chapter 9
No ratings yet
Solutions Chapter 9
34 pages
Probability With STEM Applications 3rd Edition Carlton Instant Download Full Chapters
No ratings yet
Probability With STEM Applications 3rd Edition Carlton Instant Download Full Chapters
178 pages
Decision Trees Parth Gupta
No ratings yet
Decision Trees Parth Gupta
22 pages
10.E Hypothesis Testing With Two Samples (Exercises)
No ratings yet
10.E Hypothesis Testing With Two Samples (Exercises)
29 pages
Module 3 - Inferential Statistics
No ratings yet
Module 3 - Inferential Statistics
2 pages
Quantitative Analysis of Survey Results Coursera
No ratings yet
Quantitative Analysis of Survey Results Coursera
5 pages
Written Work
No ratings yet
Written Work
8 pages
Identifying Growth Patterns of The High-Tech Manufacturing Industry Across The Seoul Metropolitan Area Using Latent Class Analysis
No ratings yet
Identifying Growth Patterns of The High-Tech Manufacturing Industry Across The Seoul Metropolitan Area Using Latent Class Analysis
10 pages
Topic 2 Part 4
No ratings yet
Topic 2 Part 4
7 pages
BSR PPT - Compiled
No ratings yet
BSR PPT - Compiled
24 pages
Cs Ba-Bsc 3rd Sem 2016
No ratings yet
Cs Ba-Bsc 3rd Sem 2016
4 pages
Mat 241 Probability and Statistics
0% (1)
Mat 241 Probability and Statistics
2 pages
Assumptions of T Tests
No ratings yet
Assumptions of T Tests
18 pages
Review Exercises
No ratings yet
Review Exercises
11 pages
Module08 PolynomialRegressionSplineGAMs
No ratings yet
Module08 PolynomialRegressionSplineGAMs
56 pages
Data Modeling For The Sciences: Applications, Basics, Computations 1st Edition Steve Pressé PDF Download
100% (1)
Data Modeling For The Sciences: Applications, Basics, Computations 1st Edition Steve Pressé PDF Download
45 pages
Question Bank For DOT
No ratings yet
Question Bank For DOT
3 pages
Solutions To Chapter 5 Problems
60% (5)
Solutions To Chapter 5 Problems
32 pages
Questions IE621
No ratings yet
Questions IE621
10 pages
Grade 11 Statistics
No ratings yet
Grade 11 Statistics
17 pages
Chi Lab
No ratings yet
Chi Lab
4 pages
Stat2 Chapter 3-1
No ratings yet
Stat2 Chapter 3-1
13 pages
Module 7: Establishing Test Validity and Reliability Activity 7 - January, 2022
No ratings yet
Module 7: Establishing Test Validity and Reliability Activity 7 - January, 2022
4 pages
Intro to Discrete Probability
No ratings yet
Intro to Discrete Probability
67 pages
Project - Time Series Forecasting (Sparkling - CSV) & (Rose - CSV)
100% (1)
Project - Time Series Forecasting (Sparkling - CSV) & (Rose - CSV)
15 pages
Get Intermittent Demand Forecasting: Context, Methods and Applications 1st Edition Aris A. Syntetos Free All Chapters
100% (1)
Get Intermittent Demand Forecasting: Context, Methods and Applications 1st Edition Aris A. Syntetos Free All Chapters
47 pages

L4 Naive Bayes

Uploaded by

L4 Naive Bayes

Uploaded by

Naïve Bayes

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

• P(h):the prior probability of the (categorical) hypothesis h

(P(D) is the same with all h)

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

𝑐𝑁𝐵 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑐𝑖 ∈𝐶 𝑃(𝑐𝑖 ) ෑ 𝑃(𝑧𝑗 |𝑐𝑖 )

𝑐 ∗ = 𝑎𝑟𝑔𝑚𝑎𝑥𝑐𝑖∈𝐶 𝑃(𝑐𝑖 ) ෑ 𝑃(𝑧𝑗 |𝑐𝑖 )

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

• Determine the most possible class

=> Conclusion: He (z) will buy a computer

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

2. Naïve Bayes Theorem

3. Naïve Bayes Algorithm

4. Naïve Bayes Examples

5. Naïve Bayes Problems

6. Naïve Bayes Conclusion

You might also like