Aiml 2 3

The document discusses Bayesian learning and the Naive Bayes algorithm. Bayesian learning uses probability and Bayes' theorem to update predictions based on new evidence. The Naive Bayes algorithm is a classification method that assumes independence between features and calculates class probabilities.

Uploaded by

Vivek Tg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views51 pages

Aiml 2 3

Uploaded by

Vivek Tg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

MODULE 4

CHAPTER 8
BAYESIAN LEARNING
8.1 INTRODUCTION TO PROBABILITY-BASED
LEARNING
• Probability-based learning is one of the most important practical
learning methods which combines prior knowledge or prior
probabilities with observed data.
• Probabilistic learning uses the concept of probability theory that
describes how to model randomness, uncertainty, and noise to predict
future events.
• It is a tool for modelling large datasets and uses Bayes rule to infer
unknown quantities, predict and learn from data.
• In a probabilistic model, randomness plays a major role which gives
probability distribution a solution, while in a deterministic model there
is no randomness and hence it exhibits the same initial conditions
every time the model is run and is likely to get a single possible
outcome as the solution.
• Bayesian learning differs from probabilistic learning as it uses
subjective probabilities (i.e., probability that is based on an individual’s
belief or interpretation about the outcome of an event and it can
change over time) to infer parameters of a model.
• Two practical learning algorithms called Naïve Bayes learning and
Bayesian Belief Network (BBN) form the major part of Bayesian
learning. These algorithms use prior probabilities and apply Bayes rule
to infer useful information.
•Bayesian Learning is a learning method that describes and
represents knowledge in an uncertain domain and provides
a way to reason about this knowledge using probability
measure.
•It uses Bayes theorem to infer the unknown parameters of a
model.
•Bayesian inference is useful in many applications which
involve reasoning and diagnosis such as game theory,
medicine, etc. Bayesian inference is much more powerful in
handling missing data and for estimating any uncertainty in
predictions.
For Understanding
• The prior probability is the probability assigned to an event before
the arrival of some information that makes it necessary to revise
the assigned probability.
• The revision of the prior is carried out using Bayes' rule. The new
probability assigned to the event after the revision is called posterior
probability.
What is prior probability in Naive Bayes?
• The probability of each class before any characteristics are observed
is known as the prior probability in the Naive Bayes method
• Posterior probability = prior probability + new data
For Understanding
What is likelihood probability in Machine Learning with example?
• In simple words, as the name suggests, the likelihood is a function
that tells us how likely the specific data point suits the existing data
distribution.
For example.
• Suppose there are two data points in the dataset. The likelihood of
the first data point is greater than the second
For Understanding

Examples of Probability and Likelihood

Examples 1 – Coin Toss

• In the context of coin tosses, likelihood and probability represent
different aspects of the same experiment.
• The likelihood refers to the probability of observing a specific
outcome given a particular model or hypothesis.
• On the other hand, probability represents the long-term frequency of
an event occurring over multiple trials.
For Understanding
• To recap: probability is generally something we consider when
we have a model with a fixed set of parameters and we are
interested in the types of data that might be generated.
• Conversely, likelihood comes into play when we have already
observed data and we want to examine how likely certain model
parameters are.

• The distinction between probability and likelihood is

fundamentally important: Probability attaches to possible
results; likelihood attaches to hypotheses.
For Understanding
What is Probability?
Probability is a measure of the likelihood that an event will actually
occur based on information or assumptions that are currently known.
The probability of the event is commonly stated as a number between 0
and 1, where 0 indicates impossibility and 1 indicates inevitability.

To determine probability, use the following formula −

Probability=Numberoffavorableoutcomes/Totalnumberofoutcomes

For instance, the probability of getting heads when flipping a fair coin is
0.5 because there are two possible outcomes (heads or tails), and each
outcome has an equal likelihood of occurring.
Probability is used to describe the likelihood of events based on
assumptions or to make predictions about the future.
For Understanding

• Probability is used to make predictions about future

events, whereas likelihood is used to estimate unknown
parameters based on seen evidence.
8.2 FUNDAMENTALS OF BAYES THEOREM
• Naïve Bayes Model relies on Bayes theorem that works on the principle of three
kinds of probabilities called prior probability, likelihood probability, and posterior
probability.
• Prior Probability It is the general probability of an uncertain event before an
observation is seen or some evidence is collected. It is the initial probability that
is believed before any new information is collected.
• Likelihood Probability Likelihood probability is the relative probability of the
observation occurring for each class or the sampling density for the evidence
given the hypothesis. It is stated as P (Evidence | Hypothesis), which denotes
the likeliness of the occurrence of the evidence given the parameters.
• Posterior Probability It is the updated or revised probability of an event taking
into account the observations from the training data. P (Hypothesis | Evidence)
is the posterior distribution representing the belief about the hypothesis, given
the evidence from the training data. Therefore,
• Posterior probability = prior probability + new evidence
8.3 CLASSIFICATION USING BAYES MODEL
• Naïve Bayes Classification models work on the principle of Bayes
theorem.
• Bayes’ rule is a mathematical formula used to determine the
posterior probability, given prior probabilities of events.
• Generally, Bayes theorem is used to select the most probable
hypothesis from data, considering both prior knowledge and
posterior distributions. It is based on the calculation of the posterior
probability and is stated as:
P (Hypothesis h | Evidence E)
• where, Hypothesis h is the target class to be classified and Evidence E
is the given test instance.
• P (Hypothesis h| Evidence E) is calculated from the prior probability P
(Hypothesis h), the likelihood probability P (Evidence E |Hypothesis h)
and the marginal probability P (Evidence E).
• It can be written as:

• where, P (Hypothesis h) is the prior probability of the hypothesis h

without observing the training data or considering any evidence.
• It denotes the prior belief or the initial probability that the hypothesis h
is correct. P (Evidence E) is the prior probability of the evidence E from
the training dataset without any knowledge of which hypothesis holds.
It is also called the marginal probability.
• P (Evidence E | Hypothesis h) is the prior probability of Evidence E
given Hypothesis h.
• It is the likelihood probability of the Evidence E after observing the
training data that the hypothesis h is correct.
• P (Hypothesis h | Evidence E) is the posterior probability of
Hypothesis h given Evidence E.
• It is the probability of the hypothesis h after observing the training
data that the evidence E is correct.
• In other words, by the equation of Bayes Eq. (8.1), one can observe
that:
Posterior Probability α Prior Probability × Likelihood Probability
• Bayes theorem helps in calculating the posterior probability for a
number of hypotheses, from which the hypothesis with the highest
probability can be selected.
• This selection of the most probable hypothesis from a set of
hypotheses is formally defined as Maximum A Posteriori (MAP)
Hypothesis
• What is Naive Bayes Classifier?
• Naive Bayes classifier is a probabilistic machine learning model based
on Bayes’ theorem. It assumes independence between features and
calculates the probability of a given input belonging to a particular
class. It’s widely used in text classification, spam filtering, and
recommendation systems.
8.3.1 NAÏVE BAYES ALGORITHM
• It is a supervised binary class or multi class classification algorithm that
works on the principle of Bayes theorem.
• There is a family of Naïve Bayes classifiers based on a common principle.
• These algorithms classify for datasets whose features are independent
and each feature is assumed to be given equal weightage.
• It particularly works for a large dataset and is very fast. It is one of the
most effective and simple classification algorithms.
• This algorithm considers all features to be independent of each other
even though they are individually dependent on the classified object.
• Each of the features contributes a probability value independently during
classification and hence this algorithm is called as Naïve algorithm. Some
important applications of these algorithms are text classification,
recommendation system and face recognition.
• Solution: The training dataset T consists of 10 data instances with
attributes such as ‘CGPA’, ‘Interactiveness’, ‘Practical Knowledge’
and ‘Communication Skills’ as shown in Table 8.1.
• The target variable is Job Offer which is classified as Yes or No for a
candidate student.
• Step 1: Compute the prior probability for the target feature ‘Job
Offer’. The target feature ‘Job Offer’ has two classes, ‘Yes’ and ‘No’.
• It is a binary classification problem.
• Given a student instance, we need to classify whether ‘Job Offer =
Yes’ or ‘Job Offer = No’.
• From the training dataset, we observe that the frequency or the
number of instances with ‘Job Offer = Yes’ is 7 and ‘Job Offer = No’ is
3.
• The prior probability for the target feature is calculated by dividing
the number of instances belonging to a particular target class by the
total number of instances.
• Hence, the prior probability for ‘Job Offer = Yes’ is 7/10 and ‘Job Offer
= No’ is 3/10 as shown in Table 8.2.
• Step 2: Compute Frequency matrix and Likelihood Probability for each
of the feature. Step 2(a): Feature – CGPA Table 8.3 shows the
frequency matrix for the feature CGPA.
• Table 8.4 shows how the likelihood probability is calculated for CGPA
using conditional probability.
• As explained earlier the Likelihood probability is stated as the sampling
density for the evidence given the hypothesis.
• It is denoted as P (Evidence | Hypothesis), which says how likely is the
occurrence of the evidence given the parameters.
• It is calculated as the number of instances of each attribute value and for a
given class value divided by the number of instances with that class value.
• For example P (CGPA ≥9 | Job Offer = Yes) denotes the number of
instances with ‘CGPA ≥9’ and ‘Job Offer = Yes’ divided by the total number
of instances with ‘Job Offer = Yes’.
• From the Table 8.3 Frequency Matrix of CGPA, number of instances with
‘CGPA ≥9’ and ‘Job Offer = Yes’ is 3. The total number of instances with
‘Job Offer = Yes’ is 7. Hence, P (CGPA ≥9 | Job Offer = Yes) = 3/7.
• Similarly, the Likelihood probability is calculated for all attribute values of
feature CGPA.
• Step 2(b): Feature – Interactiveness Table 8.5 shows the frequency
matrix for the feature Interactiveness.
8.3.4 Gibbs Algorithm
• The main drawback of Bayes optimal classifier is that it computes the
posterior probability for all hypotheses in the hypothesis space and
then combines the predictions to classify a new instance.
• Gibbs algorithm is a sampling technique which randomly selects a
hypothesis from the hypothesis space according to the posterior
probability distribution and classifies a new instance.
• It is found that the prediction error occurs twice with the Gibbs
algorithm when compared to Bayes Optimal classifier.
8.4 NAÏVE BAYES ALGORITHM FOR
CONTINUOUS ATTRIBUTES
• There are two ways to predict with Naive Bayes algorithm for
continuous attributes:
• 1. Discretize continuous feature to discrete feature.
• 2. Apply Normal or Gaussian distribution for continuous feature.
Gaussian Naive Bayes Algorithm In Gaussian Naive Bayes, the values
of continuous features are assumed to be sampled from a Gaussian
distribution.
Thank You

Adobe Scan Jun 26, 2025
No ratings yet
Adobe Scan Jun 26, 2025
12 pages
@vtudeveloper - in ML Mod 4
No ratings yet
@vtudeveloper - in ML Mod 4
11 pages
Aiml Module 04
No ratings yet
Aiml Module 04
62 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Unit 4
No ratings yet
Unit 4
36 pages
Bayesian Concept Learning Guide
No ratings yet
Bayesian Concept Learning Guide
157 pages
Chapter 8
No ratings yet
Chapter 8
26 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Unit I Probabilistic Reasoning I 9
No ratings yet
Unit I Probabilistic Reasoning I 9
20 pages
ML Module 4 Chapter 8 RNSIT
No ratings yet
ML Module 4 Chapter 8 RNSIT
5 pages
ML-Module 4-P1
No ratings yet
ML-Module 4-P1
30 pages
An Introduction To Naive Bayes Algorithm For Beginners
No ratings yet
An Introduction To Naive Bayes Algorithm For Beginners
11 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Bayesian Learning
No ratings yet
Bayesian Learning
42 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Probability Models
No ratings yet
Probability Models
23 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Unit 3 Bayesian Concept Learning
No ratings yet
Unit 3 Bayesian Concept Learning
66 pages
Lecture - 4 Classification (Naive Bayes)
No ratings yet
Lecture - 4 Classification (Naive Bayes)
33 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
L4 Naive Bayes
No ratings yet
L4 Naive Bayes
31 pages
Machine Learning & Bayesian Methods
No ratings yet
Machine Learning & Bayesian Methods
28 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Unit 3
No ratings yet
Unit 3
46 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Baye's Theorem - Example
No ratings yet
Baye's Theorem - Example
7 pages
Bayesian Inference & Naive Bayes Guide
No ratings yet
Bayesian Inference & Naive Bayes Guide
14 pages
Machine Learning for Data Science Students
No ratings yet
Machine Learning for Data Science Students
37 pages
Bayesian and Computational Learning
No ratings yet
Bayesian and Computational Learning
178 pages
Lecture 5 Bayesian
No ratings yet
Lecture 5 Bayesian
37 pages
Class 4 Naive Bayes Classification 2
No ratings yet
Class 4 Naive Bayes Classification 2
6 pages
Bayesian Learning Essentials
No ratings yet
Bayesian Learning Essentials
49 pages
Bayes Theorem for ML Enthusiasts
No ratings yet
Bayes Theorem for ML Enthusiasts
37 pages
ML Material-I
No ratings yet
ML Material-I
35 pages
Unit 6
No ratings yet
Unit 6
19 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Lab - 3 Task 5th Feb, 2025
No ratings yet
Lab - 3 Task 5th Feb, 2025
2 pages
NOTES
No ratings yet
NOTES
15 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
47 pages
Naïve Bayes Classification
No ratings yet
Naïve Bayes Classification
21 pages
ML Last Document Group 2 PDF
No ratings yet
ML Last Document Group 2 PDF
13 pages
Module 4
No ratings yet
Module 4
57 pages
Unit 2
No ratings yet
Unit 2
20 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
ML Notes 4
No ratings yet
ML Notes 4
27 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
51 pages
Unit 4
No ratings yet
Unit 4
24 pages
Bayes Decision Theorylect3
No ratings yet
Bayes Decision Theorylect3
12 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
24 pages
ML Unit 3 Bayesian - Learning (Textbook)
No ratings yet
ML Unit 3 Bayesian - Learning (Textbook)
25 pages
18CS71 Module 4
No ratings yet
18CS71 Module 4
30 pages
Naïve Bayes & Bayesian Networks Guide
No ratings yet
Naïve Bayes & Bayesian Networks Guide
9 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
60 pages
Regular Expression
No ratings yet
Regular Expression
14 pages
21CS43 SIMP Questions-TIE
No ratings yet
21CS43 SIMP Questions-TIE
60 pages
Aiml M3 C1
No ratings yet
Aiml M3 C1
59 pages
Pointers
No ratings yet
Pointers
16 pages
Crab Cave A Coastal Adventure For PCs Level 1-2
No ratings yet
Crab Cave A Coastal Adventure For PCs Level 1-2
11 pages
Btech Oe 7 Sem Renewable Energy Resources Koe074 2023
No ratings yet
Btech Oe 7 Sem Renewable Energy Resources Koe074 2023
2 pages
Summer Internship Application Form
No ratings yet
Summer Internship Application Form
6 pages
Brochure of ICP-OES MICS Full Spectrum Direct Reading Inductively Coupled Plasma Emission Spectrometer
No ratings yet
Brochure of ICP-OES MICS Full Spectrum Direct Reading Inductively Coupled Plasma Emission Spectrometer
4 pages
Problems For Chapter 7 185
No ratings yet
Problems For Chapter 7 185
3 pages
Lecture 10
No ratings yet
Lecture 10
4 pages
Physics Lab: Current Force Analysis
No ratings yet
Physics Lab: Current Force Analysis
4 pages
AX 5100 / AX 5200 AC / DC Power Supply Installation Guide: Document No.: D-030-01-00-0022 4/25/11
No ratings yet
AX 5100 / AX 5200 AC / DC Power Supply Installation Guide: Document No.: D-030-01-00-0022 4/25/11
11 pages
0417 - w23 - QP - 11 Ms Paer
No ratings yet
0417 - w23 - QP - 11 Ms Paer
16 pages
MAMALUBA - Assignment For Chem Lab 401
No ratings yet
MAMALUBA - Assignment For Chem Lab 401
2 pages
Primordialism Constructivism Instrumenta
No ratings yet
Primordialism Constructivism Instrumenta
10 pages
Protocol SLR 53481 - Andreas Tzeremes
No ratings yet
Protocol SLR 53481 - Andreas Tzeremes
11 pages
Converting MicroSim® Schematics Designs To OrCAD Capture® Designs
No ratings yet
Converting MicroSim® Schematics Designs To OrCAD Capture® Designs
44 pages
Spark Streaming - Malay
100% (1)
Spark Streaming - Malay
1 page
Bucket Inspection Form
No ratings yet
Bucket Inspection Form
1 page
Lesson Plan (Secondary 1 Express Mathematics Scheme of Work)
No ratings yet
Lesson Plan (Secondary 1 Express Mathematics Scheme of Work)
15 pages
Taking - Immersive - VR - Leap - in - Training - of LSOs ED pp10
No ratings yet
Taking - Immersive - VR - Leap - in - Training - of LSOs ED pp10
10 pages
Vaccine Development Process Guide
No ratings yet
Vaccine Development Process Guide
9 pages
Grade 5 Activity
No ratings yet
Grade 5 Activity
8 pages
Repairing Afridev & Rope Hand Pumps
No ratings yet
Repairing Afridev & Rope Hand Pumps
12 pages
Ao Smith Motors PDF
100% (1)
Ao Smith Motors PDF
59 pages
Contoh Assignment Oum
100% (1)
Contoh Assignment Oum
12 pages
DC Bus Voltage Regulation
No ratings yet
DC Bus Voltage Regulation
2 pages
Data Sheet Triac Bta 16a 600-b
No ratings yet
Data Sheet Triac Bta 16a 600-b
1 page
Elastic Deformation in Axial Loads
No ratings yet
Elastic Deformation in Axial Loads
19 pages
Concrete Solution Builders & Supply
No ratings yet
Concrete Solution Builders & Supply
2 pages
Iot Analytics Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast - Facts and Trends
No ratings yet
Iot Analytics Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast - Facts and Trends
2 pages
Project Synopsis Part 2
No ratings yet
Project Synopsis Part 2
8 pages
1241209/G786L/BGAG/234: Sender Bank Details - First Party
100% (1)
1241209/G786L/BGAG/234: Sender Bank Details - First Party
15 pages
Legal Linguistics in Italy
100% (1)
Legal Linguistics in Italy
28 pages

Aiml 2 3

Uploaded by

Aiml 2 3

Uploaded by

MODULE 4

Examples of Probability and Likelihood

Examples 1 – Coin Toss

• The distinction between probability and likelihood is

To determine probability, use the following formula −

• Probability is used to make predictions about future

• where, P (Hypothesis h) is the prior probability of the hypothesis h

You might also like