0% found this document useful (0 votes)

50 views49 pages

Lecture 11

Uploaded by

srinutirumanisetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views49 pages

Lecture 11

Uploaded by

srinutirumanisetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Bayesian Decision Theory

Primary source of reference: Pattern Classification – Duda

and Hart
An Example
• “Sorting incoming Fish on a conveyor
according to species using optical sensing”

Sea bass (Class 1)

Species
Salmon (Class 2)

Pattern Classification,
1
Chapter 1
Pattern Classification,
2
Chapter 1
• Problem Analysis

– Set up a camera and take some sample images to

extract features like

• Length of the fish

• Lightness (based on the gray level)
• Width of the fish

Pattern Classification,
3
Chapter 1
This is a linear classifier like
Perceptron.
Pattern Classification,
4
Chapter 1
Introduction
• The sea bass/salmon example
(a two class problem)

 For example if we randomly catch 100 fishes and

out of this if 75 are sea bass and 25 are salmon.
 Let the rule, in this case is: For any fish say its class
is sea bass.
 What is the error rate of this rule?
 This information which is independent of feature
values is called apriori knowledge.

5
• Let the two classes are 1 and 2
– P(1) + P( 2) = 1
– State of nature (class) is a random variable
– If P(1) = P(2), we say it is of uniform priors
• The catch of salmon and sea bass is equi-probable

6
• Decision rule with only the prior information
– Decide 1 if P(1) > P(2), otherwise decide 2
• This is not a good classifier.
• We should take feature values into account !
• If x is the pattern we want to classify, then use the rule:

If P(1 | x) > P(2 | x) then assign class 1

Else assign class 2

• P(1 | x) is called posteriori probability of class 1 given that

the pattern is x.

7
Bayes rule
• From data it might be possible for us to
estimate p( x | j ), where i = 1 or 2. These are
called class-conditional distributions.
• Also it is easy to find apriori probabilities P(1)
and P(2) . How this can be done?
• Bayes rule combines apriori probability with
class conditional distributions to find
posteriori probabilities.

8
Bayes Rule

P(A, B) P(A|B) * P(B)

P(B|A) = ----------- = ----------------
P(A) P(A)

This is Bayes Rule

Bayes, Thomas (1763) An essay

towards solving a problem in the doctrine
of chances. Philosophical Transactions
of the Royal Society of London, 53:370-
418

9
p(x | j ) . P (j )
P(j | x) = ---------------------
p(x)
– Where in case of two categories
j 2
p ( x)   p ( x |  j ) P ( j )
j 1

Likelihood . Prior
– Posterior = ----------------------
Evidence
10
11
12
• Decision given the posterior probabilities

X is an observation for which:

if P(1 | x) > P(2 | x) True state of nature = 1

if P(1 | x) < P(2 | x) True state of nature = 2

Therefore:
whenever we observe a particular x, the probability
of error is :
P(error | x) = P(1 | x) if we decide 2
P(error | x) = P(2 | x) if we decide 1

13
• Minimizing the probability of error

• Decide 1 if P(1 | x) > P(2 | x);

otherwise decide 2

Therefore:
P(error | x) = min [P(1 | x), P(2 | x)]
(error of Bayes decision)

14
Average error rate
Average probability of error, P(error) is :

P(error) =  P(error | x) p( x)dx

This is the expected value of P(error|x) w.r.t. x ,
i.e., Ex[P(error | x)]

15
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

16
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

17
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

18
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

19
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

20
21
• But, what is the error, if we use only apriori
probabilities?

22
23
24
25
• Same error? Where is the advantage?!

26
27
• But, P(error) based on apriori probabilities
only is 0.5.
• Error based on the Bayes classifier is the lower
bound.
– Any classifier’s error is greater than or equal to
this.
• One can prove this!

28
•Consider a one dimensional two class problem. The feature used is color of fish. Color can be either white or dark P( ω 1 ) = 0.75, P( ω2 ) = 0.25,

• Can you solve this?

29
Apriori Probabilities plays an
important role.
This is the knowledge about the
domain
Example
• Given height of a person we wish to classify
whether he/she is from India or Nepal.
• We assume that there are no other classes.
(Each and every person should belong to
either class “India” or to the class “Nepal”)
• For time being assume that we have only
height. (Only one feature)
Example: continued …
• Let h be the height and c be the class of a
person.
• Let the height is discretized as 2.0, 2.5, 3.0,
3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0.
• If height is 5.6, we round it to 5.5.
• We randomly took 100 people who are all
Nepalis. For each height value we counted
how many people are there.
Example: continued
•If we take randomly 100 Nepalis, their heights are as below.
•We found probabilities (these are approximate probability values!)
•These probabilities are called class conditional probabilities, i.e.,
P(h | Nepal).
•For example, P(h = 3.5 | class = Nepal) = 0.1

Height 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8

count 0 1 5 10 10 25 25 10 10 4 0 0 0

Probability 0 0.01 0.05 0.1 0.1 0.25 0.25 0.1 0.1 0.04 0 0 0
Class-conditional Distribution
• Class-conditional distribution for Nepalis

0.25

0.2

0.15 probability

0.1

0.05

0
2 3 4 5 6 7 8
Example: continued …
• Similarly, we took
randomly 100 persons
who are Indians and 0.25
found their respective 0.2
class-conditional
0.15
probabilities. probability

0.1

0.05

0
2 3 4 5 6 7 8

Class-conditional Distribution for the class “India”

Example: continued …
• So you took these probabilities to IIIT Sri City.
• You are asked to classify a student whose
height is 4.5.
• You searched the tables and found that P(4.5
| “Nepal”) = 0.25 and P(4.5 | “India”) = 0.1.
• So, you declared the person is a Nepali.
• …. Somewhere ….. Some thing is wrong …!
Example: continued …
• The security-person at the Gate who is watching you told
in a surprise tone… “Sir, don’t you know that in our
college we have only Indians and there are no Nepalis”.
• This is what is called as Prior knowledge.
• If you randomly take 100 people, if 50 of them are
Indians and 50 of them are Nepalis then the rule you
applied is correct.
– In IIITS, if you randomly take 100 students, all of them will be
Indians… So, this rule is incorrect!!
Example: continued …
• Actually you need to findout
P(Nepal | height = 4.5) and
P(India | height = 4.5)
and accordingly you need to classify.
• This is called as Posterior Probability.
Posterior Probability: Bayes Rule
• P(class = Nepal | height = 4.5)

P(height = 4.5 | class = Nepal) P(Nepal)

= --------------------------------------------------------
P(height = 4.5)

• Here, P(Nepal) is the Prior Probability

RELATIONSHIP BETWEEN K-NNC
AND THE BAYES CLASSIFIER

40
41
42
43
44
45
46
47
48

Bayesian Decision Theory Guide
No ratings yet
Bayesian Decision Theory Guide
64 pages
pr2 Bayes
No ratings yet
pr2 Bayes
44 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
Bayes&Voice Recognition
No ratings yet
Bayes&Voice Recognition
76 pages
Bayesian Decision Theory: Prof. Richard Zanibbi
No ratings yet
Bayesian Decision Theory: Prof. Richard Zanibbi
47 pages
Unit-2 Statistical PR
No ratings yet
Unit-2 Statistical PR
26 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
Classification Example
No ratings yet
Classification Example
12 pages
Bayesian Decision Theory: Intro To
No ratings yet
Bayesian Decision Theory: Intro To
56 pages
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
No ratings yet
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
22 pages
Bayes Classifier PDF
100% (1)
Bayes Classifier PDF
18 pages
Fish Species Classification Guide
No ratings yet
Fish Species Classification Guide
141 pages
Bayes Classifier
No ratings yet
Bayes Classifier
31 pages
Pattern Recognition
No ratings yet
Pattern Recognition
76 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
PR January20 03 PDF
No ratings yet
PR January20 03 PDF
74 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
24 pages
2 Unit PR Statistical Decision Making
No ratings yet
2 Unit PR Statistical Decision Making
61 pages
Classification (Naive Bayes)
No ratings yet
Classification (Naive Bayes)
40 pages
Pattern Classification: All Materials in These Slides Were Taken From
No ratings yet
Pattern Classification: All Materials in These Slides Were Taken From
44 pages
Unit - V Pattern Recognition: Dr.K.Sampath Kumar Scse/Gu
No ratings yet
Unit - V Pattern Recognition: Dr.K.Sampath Kumar Scse/Gu
30 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
26 pages
Bayes Decision Theory-1
No ratings yet
Bayes Decision Theory-1
32 pages
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
No ratings yet
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
5 pages
Statistical Perspective
No ratings yet
Statistical Perspective
85 pages
Lecture 7 Baysian Classifier
No ratings yet
Lecture 7 Baysian Classifier
25 pages
Unit I Part3
No ratings yet
Unit I Part3
63 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
27 pages
Theory For Classification and Linear Models (I)
No ratings yet
Theory For Classification and Linear Models (I)
32 pages
Probabilistic Reasoning in AI
No ratings yet
Probabilistic Reasoning in AI
43 pages
Weatherwax Theodoridis Solutions
No ratings yet
Weatherwax Theodoridis Solutions
212 pages
Lecture 2 3
No ratings yet
Lecture 2 3
72 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
ML Unit2
No ratings yet
ML Unit2
33 pages
10 B Bayesian Learning 2
No ratings yet
10 B Bayesian Learning 2
14 pages
Bayesian Decision Theory in ML
No ratings yet
Bayesian Decision Theory in ML
56 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
Bayes Decision Theory and Bayes Classifier
No ratings yet
Bayes Decision Theory and Bayes Classifier
31 pages
Machine Learning Models and Theories
No ratings yet
Machine Learning Models and Theories
38 pages
L23 Bayesian Naive
No ratings yet
L23 Bayesian Naive
18 pages
Machine Learning 04 - Bayes
No ratings yet
Machine Learning 04 - Bayes
35 pages
Data Mining - Classification
No ratings yet
Data Mining - Classification
53 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
25 pages
Introduction To Machine Learning CS - 229
No ratings yet
Introduction To Machine Learning CS - 229
109 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Naive Bayes Classification Guide
No ratings yet
Naive Bayes Classification Guide
21 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Statistical Pattern Recognition
No ratings yet
Statistical Pattern Recognition
15 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Lec 6
No ratings yet
Lec 6
20 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
Lect 7 DM
No ratings yet
Lect 7 DM
65 pages
Naive Bays & Support Vector Machines 2024-PPG
No ratings yet
Naive Bays & Support Vector Machines 2024-PPG
63 pages
Bayesian Classifiers Lecture
No ratings yet
Bayesian Classifiers Lecture
31 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
100% (1)
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
209 pages
Programming Using The Message-Passing Paradigm
No ratings yet
Programming Using The Message-Passing Paradigm
47 pages
A Search: F (N) Estimated Cost of The Best Path That Continues From N To A Goal
No ratings yet
A Search: F (N) Estimated Cost of The Best Path That Continues From N To A Goal
20 pages
Input 1
No ratings yet
Input 1
1 page
Problem Solving by Searching
No ratings yet
Problem Solving by Searching
40 pages
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
No ratings yet
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
27 pages
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
No ratings yet
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
23 pages
17 Decidabi - Ity
No ratings yet
17 Decidabi - Ity
58 pages
18 Reducibility
No ratings yet
18 Reducibility
57 pages
19 Reduction Computation History PCP
No ratings yet
19 Reduction Computation History PCP
25 pages
20 Properties of RE and R Sets
No ratings yet
20 Properties of RE and R Sets
2 pages
16 Turing Machines Variants NTM
No ratings yet
16 Turing Machines Variants NTM
36 pages
Lecture 14
No ratings yet
Lecture 14
20 pages
Lecture 12
No ratings yet
Lecture 12
13 pages
Certificate in Computing (Cic) : Efune' 2008
No ratings yet
Certificate in Computing (Cic) : Efune' 2008
20 pages
CCTV and Access Control Specification
100% (1)
CCTV and Access Control Specification
9 pages
SB 10040290 7251
No ratings yet
SB 10040290 7251
6 pages
DataShed Administration Course
No ratings yet
DataShed Administration Course
3 pages
Passive Voice
No ratings yet
Passive Voice
2 pages
User Story Template Guide
100% (1)
User Story Template Guide
3 pages
Drafting Technology 8 Reviewer Shortbondpaper
100% (2)
Drafting Technology 8 Reviewer Shortbondpaper
3 pages
Https Icar - Org.in Sites Default Files YP I II Advertisement ICT 2022
No ratings yet
Https Icar - Org.in Sites Default Files YP I II Advertisement ICT 2022
8 pages
GradCat13 14final8 21
No ratings yet
GradCat13 14final8 21
150 pages
76354cajournal Oct2023 28
No ratings yet
76354cajournal Oct2023 28
8 pages
Problem Set 1 Answer Sheet
No ratings yet
Problem Set 1 Answer Sheet
4 pages
DCV Nexus
No ratings yet
DCV Nexus
217 pages
ACA (15CS72) MODULE-1: 1.0 Objective
No ratings yet
ACA (15CS72) MODULE-1: 1.0 Objective
61 pages
XCS503 Software Engineering
No ratings yet
XCS503 Software Engineering
3 pages
Sales Forecast For Bhushan Steel Limited
100% (1)
Sales Forecast For Bhushan Steel Limited
31 pages
Shopify Store Conversion Blueprint
No ratings yet
Shopify Store Conversion Blueprint
3 pages
B.Com Computer App Practical Exam July 2024
No ratings yet
B.Com Computer App Practical Exam July 2024
8 pages
Paragon Funds Management Presentation
No ratings yet
Paragon Funds Management Presentation
24 pages
Ins 4360704 Micro Project Front Pages
No ratings yet
Ins 4360704 Micro Project Front Pages
3 pages
Charisma Multispectral Imaging Manual 2013 PDF
No ratings yet
Charisma Multispectral Imaging Manual 2013 PDF
192 pages
JD745A JD785A CellAdvisor BR NSD TM Ae
No ratings yet
JD745A JD785A CellAdvisor BR NSD TM Ae
24 pages
16-Channel High Performance Differential Output, 192 KHZ, 24-Bit Dac
No ratings yet
16-Channel High Performance Differential Output, 192 KHZ, 24-Bit Dac
52 pages
Coplanar Waveguide - Wikipedia
No ratings yet
Coplanar Waveguide - Wikipedia
2 pages
ORG 300-00 - Service Manual - QM164165 - 2019-11-04 - 10
No ratings yet
ORG 300-00 - Service Manual - QM164165 - 2019-11-04 - 10
120 pages
Geek Toys Guide
100% (4)
Geek Toys Guide
11 pages
DAKSA Company Profile 2019
No ratings yet
DAKSA Company Profile 2019
38 pages
COME 2202 NAHPI Introduction To Computer Networks Course Outline April 2024docx
No ratings yet
COME 2202 NAHPI Introduction To Computer Networks Course Outline April 2024docx
24 pages
Pega Dumps Csa
No ratings yet
Pega Dumps Csa
23 pages
Evaluating Software Quality
No ratings yet
Evaluating Software Quality
38 pages
EN Gigaset R650H User Guide
No ratings yet
EN Gigaset R650H User Guide
55 pages

Lecture 11

Uploaded by

Lecture 11

Uploaded by

Bayesian Decision Theory

Primary source of reference: Pattern Classification – Duda

Sea bass (Class 1)

– Set up a camera and take some sample images to

• Length of the fish

 For example if we randomly catch 100 fishes and

If P(1 | x) > P(2 | x) then assign class 1

• P(1 | x) is called posteriori probability of class 1 given that

P(A, B) P(A|B) * P(B)

This is Bayes Rule

Bayes, Thomas (1763) An essay

X is an observation for which:

if P(1 | x) > P(2 | x) True state of nature = 1

• Decide 1 if P(1 | x) > P(2 | x);

P(error) =  P(error | x) p( x)dx

• Can you solve this?

Height 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8

Class-conditional Distribution for the class “India”

P(height = 4.5 | class = Nepal) P(Nepal)

• Here, P(Nepal) is the Prior Probability

You might also like