0% found this document useful (0 votes)

9 views48 pages

Lecture Slide 10

Maximum Likelihood Estimation (MLE) is a statistical method for estimating the parameters of a probability distribution by maximizing the likelihood function based on observed data. The document outlines the definition, assumptions, and steps involved in MLE, including examples of calculating probabilities and likelihood functions. It emphasizes the distinction between probability density functions and likelihood functions, highlighting how MLE selects parameters that maximize the likelihood of observed data.

Uploaded by

2023aib1008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views48 pages

Lecture Slide 10

Uploaded by

2023aib1008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Maximum Likelihood Estimation

Santosh K. Vipparthi

School of AI & DE, IIT Ropar

Jan - May
Definition
Maximum Likelihood Estimation (MLE) is a statistical method used to
estimate the parameters of a probability distribution by maxi-
mizing the likelihood function.

Santosh K. Vipparthi Machine Learning: MLE 2/19

Definition
Maximum Likelihood Estimation (MLE) is a statistical method used to
estimate the parameters of a probability distribution by maxi-
mizing the likelihood function.
AIM
In statistics, Maximum Likelihood Estimation (MLE) is a method used
to estimate the parameters of an assumed probability distribu-
tion based on observed data.
This is by maximizing the likelihood function, ensuring that, un-
der the assumed statistical model, the observed data is most
probable.

Santosh K. Vipparthi Machine Learning: MLE 2/19

Example

Suppose we have a bag that contains 3 balls and we would like to pick
one ball in each time. Each ball is either red or blue, but we have no
information in addition to this. Let X1 is the first chosen ball is of blue
color that is

Santosh K. Vipparthi Machine Learning: MLE 3/19

Step 1: Probability Setup

A bag contains 3 balls, each ball is either red or blue.

We do not know the number of blue balls, so we define an
unknown parameter:

θ = Number of blue balls in the bag

Possible values of θ:
θ ∈ {0, 1, 2, 3}
Define X1 as:
(
1, if the first drawn ball is blue
X1 =
0, if the first drawn ball is red

Santosh K. Vipparthi Machine Learning: MLE 4/19

Step 2: Finding Probabilities
Probability of selecting a blue ball depends on θ:
If θ = 3, all balls are blue:

P (X1 = 1) = 1, P (X1 = 0) = 0.

If θ = 2, two blue and one red ball:

2 1
P (X1 = 1) = , P (X1 = 0) = .
3 3
If θ = 1, one blue and two red balls:
1 2
P (X1 = 1) = , P (X1 = 0) = .
3 3
If θ = 0, no blue balls:

P (X1 = 1) = 0, P (X1 = 0) = 1.

Santosh K. Vipparthi Machine Learning: MLE 5/19

Step 3: Probability Mass Function (PMF)

The probability Mass function (PMF) describes how X1 is

distributed based on θ:



1, if θ = 3 and X1 = 1
 2 , if θ = 2 and X = 1

1
f (X1 |θ) = 31
 3 , if θ = 1 and X1 = 1



0, if θ = 0 and X = 1
1

Santosh K. Vipparthi Machine Learning: MLE 6/19

Step 3: Probability Mass Function (PMF)

The probability Mass function (PMF) describes how X1 is

distributed based on θ:



1, if θ = 3 and X1 = 1
 2 , if θ = 2 and X = 1

1
f (X1 |θ) = 31
 3 , if θ = 1 and X1 = 1



0, if θ = 0 and X = 1
1

Key point: The PDF depends on θ, which is unknown.

Santosh K. Vipparthi Machine Learning: MLE 6/19

Step 4: Likelihood Function

Likelihood Function Definition:

L(θ) = f (X1 |θ)

Santosh K. Vipparthi Machine Learning: MLE 7/19

Step 4: Likelihood Function

Likelihood Function Definition:

L(θ) = f (X1 |θ)

What does this mean?

Instead of asking: ”What is the probability of getting X1 given θ?”
(PDF)
We now ask: ”Given that we observed X1 , which θ is most likely?”
(Likelihood)

Santosh K. Vipparthi Machine Learning: MLE 7/19

PDF: f (X1 |θ) → Given θ, what is the probability of drawing a
blue ball?
P (X1 = 1|θ) = f (X1 |θ)

Santosh K. Vipparthi Machine Learning: MLE 8/19

PDF: f (X1 |θ) → Given θ, what is the probability of drawing a blue
ball?
P (X1 = 1|θ) = f (X1 |θ)

Likelihood: L(θ|X1 ) → Given that we drew a blue ball, which θ is

most likely?
L(θ) = f (X1 |θ)

Santosh K. Vipparthi Machine Learning: MLE 8/19

Santosh K. Vipparthi Machine Learning: MLE 8/19
PDF: f (X1 |θ) → Given θ, what is the probability of drawing a blue
ball?
P (X1 = 1|θ) = f (X1 |θ)

Likelihood: L(θ|X1 ) → Given that we drew a blue ball, which θ is

most likely?
L(θ) = f (X1 |θ)

Main Difference:

In PDF, θ is fixed, and X1 varies.

In Likelihood, X1 is fixed (we observed it), and θ varies.

Likelihood Measures how well a model explains data

MLE chooses parameters that maximize this likelihood

Santosh K. Vipparthi Machine Learning: MLE 8/19
Let (Ω, P) be a probability space, and X1 , X2 , · · · , Xn be an random
variables. Suppose fk be the probability density function (pdf) of Xk ,
for k = 1, 2, · · · , n.
Suppose θ is an unknown parameter and fk (Xk |θ) is the probability
density function for the true value of θ.

Santosh K. Vipparthi Machine Learning: MLE 9/19

Let (Ω, P) be a probability space, and X1 , X2 , · · · , Xn be an random
variables. Suppose fk be the probability density function (pdf) of Xk ,
for k = 1, 2, · · · , n.
Suppose θ is an unknown parameter and fk (Xk |θ) is the probability
density function for the true value of θ.
Definition
In this case the function f : X = (X1 , X2 , · · · , Xn ) → R defined by
n
Y
f (x1 , x2 , · · · , xn |θ) = f (x1 |θ) · f (x2 |θ) · · · f (xn |θ) = f (xk |θ) is
k=1
known as joint pdf of X = (X1 , X2 , · · · , Xn ).

Santosh K. Vipparthi Machine Learning: MLE 9/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

Example Suppose we have a bag that contains 3 balls and we would like to
pick one ball in each time for 4 times. Each ball is either red or blue, but we
have no information in addition. Let Xi is the ith chosen ball is of blue color,
for i = 1, 2, 3, 4 that is
1 if the ith chosen ball is blue

Xi =
0 if the ith chosen ball is red.
Assume that θ is the number of blue of balls in the bag. Then
θ
if xi = 1
P(Xi = xi |θ) = 3
1 − θ3 if xi = 0.

Santosh K. Vipparthi Machine Learning: MLE 10/19

In the maximum likelihood esti-

mate, we are looking for θ = 2.

Santosh K. Vipparthi Machine Learning: MLE 10/19

Likelihood Function

Let (Ω, P) be a probability space, and X = (X1 , X2 , . . . , Xn ) be

random variables with probability density functions (PDFs) fk for Xk ,
where k = 1, 2, . . . , n.
Given an unknown parameter θ, the conditional PDF of Xk is:

fk (Xk |θ).

Santosh K. Vipparthi Machine Learning: MLE 11/19

Likelihood Function

Let (Ω, P) be a probability space, and X = (X1 , X2 , . . . , Xn ) be

random variables with probability density functions (PDFs) fk for Xk ,
where k = 1, 2, . . . , n.
Given an unknown parameter θ, the conditional PDF of Xk is:

fk (Xk |θ).
The joint PDF of X is:
n
Y
f (X|θ) = fk (Xk |θ),
k=1

which defines the likelihood function:

L(θ|X) = f (X|θ).

Santosh K. Vipparthi Machine Learning: MLE 11/19

Maximum Likelihood Estimate (MLE)

The Maximum Likelihood Estimate (MLE), denoted by θ,

b is the
value of θ that maximizes the likelihood function:

θb = arg max L(θ|X).

Santosh K. Vipparthi Machine Learning: MLE 12/19

Maximum Likelihood Estimate (MLE)

The Maximum Likelihood Estimate (MLE), denoted by θ,

b is the
value of θ that maximizes the likelihood function:

θb = arg max L(θ|X).

Santosh K. Vipparthi Machine Learning: MLE 12/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Finding Maximum Likelihood Estimate
Let (Ω, P) be a probability space, and X = (X1 , X2 , · · · , Xn ) be an random
variables. Suppose fk is the pdf of Xk , for k = 1, 2, · · · , n.
Suppose θ is an unknown parameter and fk (Xk |θ) is the pdf for the true value
of θ, f (X|θ) is the joint pdf of X. Suppose L(X|θ) is the likelihood function
of X = (X1 , X2 , · · · , Xn ).
Using Calculus : Test for finding maximum value If the likelihood
function θ 7→ L(X|θ) is a smooth function (twice differentiable).
Step 1:
d dL
Find all θ’s such that L= = 0. (here we get all the extreme points of
dθ dθ
L).
Step 2:
d2 d2 L
For those above obtained values of θ’s, find all θ’s for which 2 L = < 0.
dθ dθ2
Step 3:
Still if you have more θ’s, then find for what value of θ, with L(θ) is maximum.

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

MLE for more then one unknown parameters Let (Ω, P) be a
probability space, and X = (X1 , X2 , · · · , Xn ) be an random variables.
Suppose fk is the pdf of Xk , for k = 1, 2, · · · , n.
Suppose θi (for i ≥ 2) is an unknown parameter and fk (Xk |θi ) is the pdf for
the true value of θi , f (X|θi ) is the joint pdf of X. Suppose ℓ(X|θi ) is the
log-likelihood function of X = (X1 , X2 , · · · , Xn ).
Using Calculus : Test for finding maximum value If the likelihood
function ℓ(X|θi ) is a smooth function (twice differentiable).
Step 1:
∂ℓ
Find all θi ’s such that = 0. (here we get all the extreme points of L).
∂θi
Step 2:
For those above obtained values of find any one of θi ’s, say θ1 such that
∂2ℓ
< 0.
∂θ12
Step 3:
∂2ℓ
Replacing this θ1 , find next θ2 such that < 0.
∂θ22
Step 4:
Similary find all θi ’s
Using this we get required MEL, θˆi
Santosh K. Vipparthi Machine Learning: MLE 14/19
Let X be a Bernaoulli distribution and x ∈ {0, 1} be a trail in X.
Probability mass function (PMF): P (X = x|θ) = θx (1 − θ)1−x

Log-likelihood: ℓ(θ) = x log θ + (1 − x) log(1 − θ)
dℓ
MLE method: Find θ̂ such that = 0.
dθ
dℓ x 1−x
= − =0
dθ θ 1−θ
x(1 − θ) − θ(1 − x)
= 0 ⇒ x(1 − θ) − θ(1 − x) = 0
θ(1 − θ)
x − xθ − θ(1 − x) = 0 ⇒ x − θ = 0 ⇒ θ = x

MLE: θ̂ = x

Santosh K. Vipparthi Machine Learning: MLE 15/19

MLE for Bernoulli Trials
Let X = (X1 , X2 , · · · Xn ) be iid of Bernaoulli distributions and
x = (x1 , x2 , · · · , xn ) trails in X (here each xn ∈ {0, 1}).
Probability mass function (PMF):
n
Y
P (X = x|θ) = θx1 (1 − θ)1−x1 = θx (1 − θ)1−x , x ∈ {0, 1}
i=1
n
X
Log-likelihood: ℓ(θ) = xi log θ + (1 − xi ) log(1 − θ)
i=1

dℓ
MLE method: Find θ̂ such that =0
dθ
k
X xi 1 − xi
⇒ − =0
θ 1−θ
i=1
n
1X
MLE: θ̂ = xi
n
i=1 Santosh K. Vipparthi Machine Learning: MLE 16/19
MLE for a Binomial Trail

Let X ≈ Bio(n, θ) be iid of Binomial distribution and x trail in X.

n x
Probability mass function (PMF): P (X = x|θ) = θ (1 − θ)n−x .
x

n
Log-likelihood: ℓ(p) = x log θ + (n − x) log(1 − θ)
x
dℓ
MLE method: Find θ̂ such that = 0.
dθ

dℓ n x n−x
= − =0
dθ x θ 1−θ

x(1 − θ) − θ(n − x)
= 0 ⇒ x(1 − θ) − θ(1 − x) = 0
θ(1 − θ)
x
x − xθ − θ(n − x) = 0 ⇒ x − nθ = 0 ⇒ θ =
n
x
MLE: θ̂ =
n
Santosh K. Vipparthi Machine Learning: MLE 17/19
MLE for Binomial Trails
Let X = (X1 , X2 , · · · Xn ) be iid of Binomial distributions with
Xi ≈ Bio(n, θ), for each i, and x = (x1 , x2 , · · · , xn ) trails in X.
Probability mass function (PMF):
n
Y n
P (X = x|θ) = θxi (1 − θ)n−xi
xi
i=1
n
X n
Log-likelihood: ℓ(θ) = [xi log θ + (n − xi ) log(1 − θ)]
xi
i=1

dℓ
MLE method: Find θ̂ such that =0
dθ
k
X n xi 1 − xi
⇒ − =0
xi θ 1−θ
i=1
n
1X
MLE: θ̂ = xi
n
i=1
Santosh K. Vipparthi Machine Learning: MLE 18/19
MLE for a Normal Distribution
Let X = (X1 , X2 , · · · Xn ) be iid of Binomial distributions with
Xi ≈ N (µ, σ 2 ), for each i, and x = (x1 , x2 , · · · , xn ) trails in X.
(x−µ)2
Probability density function (PDF): f (x|µ, σ 2 ) = √ 1 e− 2σ 2
2πσ 2
n
n n 1 X
Log-likelihood: ℓ(µ, σ 2 ) = − log(2π) − log(σ 2 ) − 2 (xi − µ)2
2 2 2σ
i=1
∂ℓ ∂ℓ
MLE Method: Find µ and σ2 such that ∂ ∂µ =0= ∂σ 2
∂2ℓ
and > 0.
∂µ2

Santosh K. Vipparthi Machine Learning: MLE 19/19

MLE for a Normal Distribution
Let X = (X1 , X2 , · · · Xn ) be iid of Binomial distributions with
Xi ≈ N (µ, σ 2 ), for each i, and x = (x1 , x2 , · · · , xn ) trails in X.
(x−µ)2
Probability density function (PDF): f (x|µ, σ 2 ) = √ 1 e− 2σ 2
2πσ 2
n
n n 1 X
Log-likelihood: ℓ(µ, σ 2 ) = − log(2π) − log(σ 2 ) − 2 (xi − µ)2
2 2 2σ
i=1
∂ℓ ∂ℓ
MLE Method: Find µ and σ2 such that ∂ ∂µ =0= ∂σ 2
∂2ℓ
and > 0.
∂µ2
MLE:
n
1X
µ̂ = xi
n
i=1
n
2 1X
σ̂ = (xi − µ̂)2
n
i=1
Santosh K. Vipparthi Machine Learning: MLE 19/19
MLE for a Normal Distribution
Let X = (X1 , X2 , · · · Xn ) be iid of Binomial distributions with
Xi ≈ N (µ, σ 2 ), for each i, and x = (x1 , x2 , · · · , xn ) trails in X.
(x−µ)2
Probability density function (PDF): f (x|µ, σ 2 ) = √ 1 e− 2σ 2
2πσ 2
n
n n 1 X
Log-likelihood: ℓ(µ, σ 2 ) = − log(2π) − log(σ 2 ) − 2 (xi − µ)2
2 2 2σ
i=1
∂ℓ ∂ℓ
MLE Method: Find µ and σ2 such that ∂ ∂µ =0= ∂σ 2
∂2ℓ
and > 0.
∂µ2
MLE:
n
1X
µ̂ = xi
n
i=1
n
2 1X
σ̂ = (xi − µ̂)2
n
i=1
Santosh K. Vipparthi Machine Learning: MLE 19/19

2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Math for CompSci: MLE & Regularization
No ratings yet
Math for CompSci: MLE & Regularization
46 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
7 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
7 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
14 pages
Maximum Likelihood Estimation Guide
No ratings yet
Maximum Likelihood Estimation Guide
8 pages
12 MLEFilled
No ratings yet
12 MLEFilled
8 pages
Lecture17 Mle Map
No ratings yet
Lecture17 Mle Map
29 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
6 pages
Section 5
No ratings yet
Section 5
18 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
MLE Assingnment
No ratings yet
MLE Assingnment
7 pages
Hasan 2 - Estimation Methods Method of Moments and Maximum Likelihood
No ratings yet
Hasan 2 - Estimation Methods Method of Moments and Maximum Likelihood
5 pages
15.097: Probabilistic Modeling and Bayesian Analysis
No ratings yet
15.097: Probabilistic Modeling and Bayesian Analysis
42 pages
MAP&MLE
No ratings yet
MAP&MLE
44 pages
Maximum
No ratings yet
Maximum
3 pages
Ps 2,3
No ratings yet
Ps 2,3
48 pages
4 MLEHandout
No ratings yet
4 MLEHandout
36 pages
Module 4
No ratings yet
Module 4
3 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
55 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
7 pages
Sta255 Week 11-2 Pre
No ratings yet
Sta255 Week 11-2 Pre
21 pages
Week 6 Mle
No ratings yet
Week 6 Mle
41 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Likelihood, Bayesian, and Decision Theory
No ratings yet
Likelihood, Bayesian, and Decision Theory
50 pages
MIT18 05S14 Class10 Slides
No ratings yet
MIT18 05S14 Class10 Slides
23 pages
2 Probability
No ratings yet
2 Probability
30 pages
Foundations of Machine Learning: Part A: Probability Basics
No ratings yet
Foundations of Machine Learning: Part A: Probability Basics
75 pages
Spring 2006 Final Solution
No ratings yet
Spring 2006 Final Solution
12 pages
Massachusetts Institute of Technology
No ratings yet
Massachusetts Institute of Technology
12 pages
Maximum Likelihood Estimation Guide
No ratings yet
Maximum Likelihood Estimation Guide
34 pages
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
No ratings yet
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
6 pages
Slide 8 01
No ratings yet
Slide 8 01
37 pages
Chapter 2 - Maximum Likelihood - HEC - Lausanne
No ratings yet
Chapter 2 - Maximum Likelihood - HEC - Lausanne
277 pages
Frequentist Estimation: 4.1 Likelihood Function
No ratings yet
Frequentist Estimation: 4.1 Likelihood Function
6 pages
Chapte 2 - Maximum Likelihood - HEC - Lausanne
No ratings yet
Chapte 2 - Maximum Likelihood - HEC - Lausanne
276 pages
Lecture 6
No ratings yet
Lecture 6
13 pages
T 3 Estimation
No ratings yet
T 3 Estimation
20 pages
03 Lecturenote MLE MAP
No ratings yet
03 Lecturenote MLE MAP
7 pages
Maximum Likelihood Estimation Lecture
No ratings yet
Maximum Likelihood Estimation Lecture
22 pages
Maximum Likelihood Estimation & Linear Probability Model
No ratings yet
Maximum Likelihood Estimation & Linear Probability Model
36 pages
2 Mle
No ratings yet
2 Mle
28 pages
MLT Unit 4 Notes
No ratings yet
MLT Unit 4 Notes
26 pages
Endsem Solutions
No ratings yet
Endsem Solutions
19 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
10 pages
PSLecture18 2022
No ratings yet
PSLecture18 2022
100 pages
MLEstimation
No ratings yet
MLEstimation
8 pages
Sta255 Week 11-1 Pre
No ratings yet
Sta255 Week 11-1 Pre
37 pages
ML Lecture 03 - Probabilistic Inference (Spring 2024)
No ratings yet
ML Lecture 03 - Probabilistic Inference (Spring 2024)
46 pages
R Optimization for Statisticians
No ratings yet
R Optimization for Statisticians
15 pages
Data Analysis Programming With R, MATLAB, SPSS EXCEL Quick Web Links To FREE 160+ Textbooks, 300+ Lecture Notes Videos, (Links, Dr. Web) (Z-Library)
No ratings yet
Data Analysis Programming With R, MATLAB, SPSS EXCEL Quick Web Links To FREE 160+ Textbooks, 300+ Lecture Notes Videos, (Links, Dr. Web) (Z-Library)
84 pages
01 Cadfil - NC - Controller - Requirements
No ratings yet
01 Cadfil - NC - Controller - Requirements
3 pages
Modular Expansion Joints For Road Bridges: August 2019
No ratings yet
Modular Expansion Joints For Road Bridges: August 2019
65 pages
Data Insights for Auto Parts Firm
100% (3)
Data Insights for Auto Parts Firm
46 pages
Rigorous and Semirigorous Models For The Diatomic Gas
No ratings yet
Rigorous and Semirigorous Models For The Diatomic Gas
15 pages
Operating Systems: I.K. Gujral Punjab Technical University Jalandhar
No ratings yet
Operating Systems: I.K. Gujral Punjab Technical University Jalandhar
158 pages
Electrical Core Specs for Engineers
No ratings yet
Electrical Core Specs for Engineers
1 page
Security Policy Main Tasks
No ratings yet
Security Policy Main Tasks
11 pages
Performance Monitoring and Dashboards For Hospitalists
No ratings yet
Performance Monitoring and Dashboards For Hospitalists
61 pages
Seminar Topics
No ratings yet
Seminar Topics
5 pages
12-Tuning Based On Integral Error Criteria
No ratings yet
12-Tuning Based On Integral Error Criteria
15 pages
ME130-2: Fluid Mechanics: Fluid Properties & Fluid Statics
No ratings yet
ME130-2: Fluid Mechanics: Fluid Properties & Fluid Statics
9 pages
Chapter 95 - Instrument System: BHT-206L3-MM-10
No ratings yet
Chapter 95 - Instrument System: BHT-206L3-MM-10
28 pages
ML
No ratings yet
ML
2 pages
Axcelis Oxygenfreeplasmachipscalepkg
No ratings yet
Axcelis Oxygenfreeplasmachipscalepkg
8 pages
EC - Design of Steel Buildings 7 - Worked Examples
0% (1)
EC - Design of Steel Buildings 7 - Worked Examples
49 pages
Manual Calculation of Ascendant: Data Required
70% (10)
Manual Calculation of Ascendant: Data Required
4 pages
Graphites and Fullerene
No ratings yet
Graphites and Fullerene
9 pages
The Second Chance Scalp Cheat Sheet
100% (3)
The Second Chance Scalp Cheat Sheet
2 pages
CBSE Class 11 Chemistry Notes - Atomic Structure
100% (1)
CBSE Class 11 Chemistry Notes - Atomic Structure
20 pages
Alexit Highgloss Monolyer 460-4F: Characteristics
No ratings yet
Alexit Highgloss Monolyer 460-4F: Characteristics
3 pages
4.OA.5 Practice
No ratings yet
4.OA.5 Practice
23 pages
How To Inspect
No ratings yet
How To Inspect
17 pages
Cryptography Exercises & Solutions
No ratings yet
Cryptography Exercises & Solutions
31 pages
Self-Supervised Visual Learning Insights
No ratings yet
Self-Supervised Visual Learning Insights
13 pages
Solar Plant Project in Basra, Iraq
No ratings yet
Solar Plant Project in Basra, Iraq
24 pages
Wada 1988 2
No ratings yet
Wada 1988 2
7 pages
Solving Boundary Value Problems in Mathcad: Sbval
No ratings yet
Solving Boundary Value Problems in Mathcad: Sbval
4 pages
CBSE - VI - Computers - Introduction To OO Calc
No ratings yet
CBSE - VI - Computers - Introduction To OO Calc
31 pages
Physioex Lab Report: Pre-Lab Quiz Results
No ratings yet
Physioex Lab Report: Pre-Lab Quiz Results
5 pages

Lecture Slide 10

Uploaded by

Lecture Slide 10

Uploaded by

Maximum Likelihood Estimation

School of AI & DE, IIT Ropar

Santosh K. Vipparthi Machine Learning: MLE 2/19

Santosh K. Vipparthi Machine Learning: MLE 2/19

Santosh K. Vipparthi Machine Learning: MLE 2/19

Santosh K. Vipparthi Machine Learning: MLE 3/19

A bag contains 3 balls, each ball is either red or blue.

θ = Number of blue balls in the bag

Santosh K. Vipparthi Machine Learning: MLE 4/19

If θ = 2, two blue and one red ball:

Santosh K. Vipparthi Machine Learning: MLE 5/19

The probability Mass function (PMF) describes how X1 is

Santosh K. Vipparthi Machine Learning: MLE 6/19

The probability Mass function (PMF) describes how X1 is

Key point: The PDF depends on θ, which is unknown.

Santosh K. Vipparthi Machine Learning: MLE 6/19

Likelihood Function Definition:

L(θ) = f (X1 |θ)

Santosh K. Vipparthi Machine Learning: MLE 7/19

Likelihood Function Definition:

L(θ) = f (X1 |θ)

What does this mean?

Santosh K. Vipparthi Machine Learning: MLE 7/19

Santosh K. Vipparthi Machine Learning: MLE 8/19

Likelihood: L(θ|X1 ) → Given that we drew a blue ball, which θ is

Santosh K. Vipparthi Machine Learning: MLE 8/19

Likelihood: L(θ|X1 ) → Given that we drew a blue ball, which θ is

In PDF, θ is fixed, and X1 varies.

In Likelihood, X1 is fixed (we observed it), and θ varies.

Likelihood Measures how well a model explains data

MLE chooses parameters that maximize this likelihood

Santosh K. Vipparthi Machine Learning: MLE 9/19

Santosh K. Vipparthi Machine Learning: MLE 9/19

Santosh K. Vipparthi Machine Learning: MLE 9/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

Santosh K. Vipparthi Machine Learning: MLE 10/19

In the maximum likelihood esti-

Santosh K. Vipparthi Machine Learning: MLE 10/19

Let (Ω, P) be a probability space, and X = (X1 , X2 , . . . , Xn ) be

Santosh K. Vipparthi Machine Learning: MLE 11/19

Let (Ω, P) be a probability space, and X = (X1 , X2 , . . . , Xn ) be

which defines the likelihood function:

Santosh K. Vipparthi Machine Learning: MLE 11/19

The Maximum Likelihood Estimate (MLE), denoted by θ,

θb = arg max L(θ|X).

Santosh K. Vipparthi Machine Learning: MLE 12/19

The Maximum Likelihood Estimate (MLE), denoted by θ,

θb = arg max L(θ|X).

Santosh K. Vipparthi Machine Learning: MLE 12/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 13/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 14/19

Santosh K. Vipparthi Machine Learning: MLE 15/19

Let X ≈ Bio(n, θ) be iid of Binomial distribution and x trail in X.

Santosh K. Vipparthi Machine Learning: MLE 19/19

You might also like