0% found this document useful (0 votes)

39 views5 pages

hw1 Solutions

This document contains solutions to homework problems related to probability and maximum likelihood estimation. It includes the calculation of the probability that a bug reaches a certain point on a lattice within 2 moves, and formulating the likelihood function and deriving the maximum likelihood estimate for the mean and variance of a normal distribution.

Uploaded by

Amanda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views5 pages

hw1 Solutions

Uploaded by

Amanda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

ITU Computer and Informatics Faculty

BLG 454E Learning From Data, Spring 2018

Homework #1
Due March 07, 2018 10pm

HOMEWORK #1 SOLUTIONS

Important Note: If you see ANY mistakes please inform me via kivrakh@itu.edu.tr

1. (10 pts.) In general, the probability that it rains on Saturday is 25%.

Weekend rain has the following relationships:
• If it rains on Saturday, the probability that it rains on Sunday is 50%.
• If it does not rain on Saturday, the probability that it rains on Sunday is 25%.

Given that it rained on Sunday, what is the probability that it rained on Saturday?
Correct answer: 40%
Let T and U be the events of rain on Saturday and Sunday, respectively, and denote the event of no rain
on Saturday as T . The problem gives us:
P (T ) = 25% P (U |T ) = 50% P (U |T ) = 25%.
Via Bayes’ Theorem:

2. (20 pts.) A bug stands on a random point of the lattice below. Each point is equally likely to be the
starting point.

Every minute, the bug selects an adjacent point at random and moves to it. Each adjacent point is
equally likely to be chosen. For example, if the bug is on point B, then each probability to move to the
points A, C, or G is 13 .
What is the probability that the bug reaches point A in 2 moves or less? Each point is equally likely to
be the bug’s starting point. Also, assume starting at A will ”reach” the point in 0 moves.
Correct answer:
22
63

Page 1 of 5
BLG 454E Learning From Data Homework #1

Let A2 be the event that bug reaches point A in two moves or less.
Let A, B, C, D, E, F, and G be the events that the bug starts on each point, respectively. The probability
of each of these events is 71 .
If the bug starts on A, then it doesn’t have to move:

P (A2 | A) = 1.

If the bug starts on B, then it can get to A directly, or through G :

1 1 1 7
P (A2 | B) = + × = .
3 3 6 18

Due to the symmetry of the lattice, the probability to get to A is the same starting from point F :

7
P (A2 | F ) = P (A2 | B) = .
18
If the bug starts on C, then it can get to A through B or through G :

1 1 1 1 1
P (A2 | C) = × + × = .
3 3 3 6 6

Due to the symmetry of the lattice, the probability to get to A is the same starting from point E :

1
P (A2 | E) = P (A2 | C) = .
6
If the bug starts at D, then it can only go through point G :

1 1 1
P (A2 | D) = × = .
3 6 18
If the bug starts at G, it can go to A directly, or it can go through B or F :

1 1 1 5
P (A2 | G) = + 2 × = .
6 6 3 18

P (A2 ) is the sum of all these probabilities multiplied by 71 :

1 7 1 1 5
P (A2 ) = 1+2 +2 + + (1)
7 18 6 18 18
(2)
22
= . (3)
63

3. (40 pts.) The idea for the maximum likelihood estimate (MLE) is to find the value of the pa-
rameter(s) for which the data has the highest probability. You are going to do this with the densities.
Suppose the 1-dimension data points x1 , x2 , ...xn given in ”data.txt” file are drawn from a normal(gauss)
N (µ, σ 2 ) distribution, where µ and σ are unknown.
• (20 pts.) Formulate the likelihood function and derive the equation to find the maximum likelihood
estimate for the pair (µ, σ 2 ).

Page 2 of 5
BLG 454E Learning From Data Homework #1

• (20 pts.) Implement (write the code) MLE in Matlab or Python language and provide your
plot that is similar to Figure 1. You are not allowed use any built-in functions except histogram
functions to provide you a quick of the distribution of the data.

(a) Gaussian probability density function

1 1
P (xi ) = √ exp − 2 (xi − µ)2 , −∞ < xi < ∞
2πσ 2 2σ

X ∼ N (µ, σ 2 )
Given N data samples, the likelihood is
N N
Y Y 1 1 2
L(µ, σ) = P (x1:N |µ, σ) = P (xi |µ, σ) = √ exp − 2 (xi − µ)
i=1 i=1 2πσ 2 2σ
and the log-likelihood is
XN
1 1
log L(µ, σ) = log √ + − 2 (xi − µ)2
2πσ 2 2σ i=1

In order to find estimate µ̂, we need to take derivative of log L(µ, σ) with respect to µ and equate
it to zero.
N X N
∂ ∂ X 1 ∂ 1
log L(µ, σ) = log √ + − 2 (xi − µ)2 = 0
∂µ ∂µ i=1 2πσ 2 ∂µ 2σ i=1
"N #
1 X
= 2 (xi − µ) = 0
σ i=1

N
X N
X
= xi − µ=0
i=1 i=1

Therefore, the estimate of µ and the mean value of the given weights is
N
1 X
µ̂ = xi
N i=1

For the estimate σˆ2 , we need to take derivative of log L(µ, σ) with respect to σ 2 and equate it to
zero. In order to avoid possible mistakes, θ will be used instead of σ 2 throughout the equations.
N N
∂ ∂ X 1 ∂ 1 X
log L(µ, θ) = log √ + − (xi − µ)2 = 0
∂θ ∂θ i=1 2πθ ∂θ 2θ i=1
"N #
N 1 X 2
=− + 2 (xi − µ) = 0
2θ 2θ i=1

N
X
= −N θ + (xi − µ)2 = 0
i=1
2
Thus, the estimate of θ or σ is
PN
i=1 (xi − µ̂)2
θ̂ = σˆ2 =
N

Page 3 of 5
BLG 454E Learning From Data Homework #1

(b)
import numpy as np
import matplotlib . pyplot as plt
from numpy import loadtxt

data = loadtxt ( " data . txt " )

mu = sum ( data ) / len ( data )
variance = sum ( np . power ( data - mu , 2 ) ) / len ( data )

# Plot the h i s t o g r a m .
plt . hist ( data , bins = 25 , normed = True , alpha = 0 .6 , color = ’g ’ , label = " data " )

# Plot the PDF .

xmin , xmax = plt . xlim ()
x = np . linspace ( xmin , xmax , 885 )

num = np . exp ( - ( x - mu ) ** 2 / ( 2 * variance ) )

denom = np . sqrt ( 2 * np . pi * variance )
p = num / denom

plt . plot (x , p , ’k ’ , linewidth =2 , label = " MLE fixed distribution " )

title = " MLE results : mu = % . 2f , std = % . 2f " % ( mu , np . sqrt ( variance ) )
plt . title ( title )
plt . legend ( bbox_ to_ancho r = ( 0 . 65 , 0 . 8 ) , loc =2 , borderaxespad = 0 .)
plt . show ()

0.16 MLE results: mu = 10.06, std = 2.57

0.14

0.12 MLE fixed distribution

data
0.10

0.08

0.06

0.04

0.02

0.00
0 2 4 6 8 10 12 14 16 18
Figure 1: Data and fixed gaussian distribution with MLE

4. (30 pts.) In the Table 1 below, x1 , x2 , x3 and xi ∈ {0, 1}, i = 1, 2, 3. xi represent the i feature vector
and y ∈ {+, −} represents the class label.
(a) (15 pts.) Construct the Naive Bayes classifier for the given training dataset in Table 1.
Hint: Estimate the class conditional prob. for each feature vector x1 , x2 , x3
(b) (5 pts.) Predict the class label for (x1 = 1, x2 = 1, x3 = 1) data using trained Naive Bayes approach
in part (a)
(c) (10 pts.) Calculate the probabilities of P (x1 = 1), P (x2 = 1), and P (x1 = 1, x2 = 1). Decide
whether x1 and x2 are independent or not.

(a) P (x1 = 1|+) = 3/5 = 0.6, P (x2 = 1|+) = 2/5 = 0.4, P (x3 = 1|+) = 4/5 = 0.8,
P (x1 = 0|+) = 2/5 = 0.4, P (x2 = 0|+) = 3/5 = 0.6, P (x3 = 0|+) = 1/5 = 0.2,

P (x1 = 1|−) = 2/5 = 0.4, P (x2 = 1|−) = 2/5 = 0.4, P (x3 = 1|−) = 1/5 = 0.2.
P (x1 = 0|−) = 3/5 = 0.6, P (x2 = 0|−) = 3/5 = 0.6, P (x3 = 0|−) = 4/5 = 0.8.

Page 4 of 5
BLG 454E Learning From Data Homework #1

(b) Let R : (x1 = 1, x2 = 1, x3 = 1) be the test instance. To determine its class, we need to compute
P (+|R) and P (−|R). Using Bayes theorem:

P (R|+)P (+) P (R|−)P (−)

P (+|R) = P (R) and P (−|R) = P (R) .

Since P (+) = P (−) = 5/10 = 0.5 and P (R) is the same for both class, R can be classified by
comparing P (R|+) and P (R|−).

Naive bayes assumes features are independent, so we can write,

P (R|+) = P (x1 , x2 , x3 |+) = P (x1 = 1|+) × P (x2 = 1|+) × P (x3 = 1|+) = 0.192
P (R|−) = P (x1 = 1|−) × P (x2 = 1|−) × P (x3 = 1|−) = 0.032

Since P (R|+) is larger, the record is assigned to (+) class.

(c) P (x1 = 1) = 5/10 = 0.5, P (x2 = 1) = 4/10 = 0.4 and P (x1 = 1, x2 = 1) = P (x1 ) × P (x2 ) = 0.2.
Therefore, x1 and x2 are independent.

Table 1: Training set for question 4

Instance x1 x2 x3 y
1 0 0 1 -
2 1 0 1 +
3 0 1 0 -
4 1 0 0 -
5 1 0 1 +
6 0 0 1 +
7 1 1 0 -
8 0 0 0 -
9 0 1 0 +
10 1 1 1 +

Page 5 of 5 End of homework.

HW 4
No ratings yet
HW 4
5 pages
Lecture 10
No ratings yet
Lecture 10
59 pages
Ps 2,3
No ratings yet
Ps 2,3
48 pages
Homework 2
No ratings yet
Homework 2
4 pages
Math 181A WI25 HW Problems Complete
No ratings yet
Math 181A WI25 HW Problems Complete
26 pages
CS 7641 Midterm Exam 2 Solutions
No ratings yet
CS 7641 Midterm Exam 2 Solutions
12 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
Math for CompSci: MLE & Regularization
No ratings yet
Math for CompSci: MLE & Regularization
46 pages
MLESA v2024 Week10 Assignment Solution
No ratings yet
MLESA v2024 Week10 Assignment Solution
7 pages
HW2 Solutions PDF
No ratings yet
HW2 Solutions PDF
4 pages
Likelihood Frequentist
No ratings yet
Likelihood Frequentist
27 pages
HW04 Sol
No ratings yet
HW04 Sol
14 pages
12 MLEFilled
No ratings yet
12 MLEFilled
8 pages
Massachusetts Institute of Technology
No ratings yet
Massachusetts Institute of Technology
12 pages
Spring 2006 Final Solution
No ratings yet
Spring 2006 Final Solution
12 pages
STAT 135 Solutions To Homework 4:: 30 Points
No ratings yet
STAT 135 Solutions To Homework 4:: 30 Points
9 pages
Homework 0: Mathematical Background For Machine Learning
No ratings yet
Homework 0: Mathematical Background For Machine Learning
11 pages
Mit18 05 s22 Class10 Pset Sol
No ratings yet
Mit18 05 s22 Class10 Pset Sol
4 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
6 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Lecture17 Mle Map
No ratings yet
Lecture17 Mle Map
29 pages
MidSem 202223 Solution
No ratings yet
MidSem 202223 Solution
4 pages
EM Algorithm
No ratings yet
EM Algorithm
10 pages
Lecture 2 Annotated
No ratings yet
Lecture 2 Annotated
60 pages
Slide 8 01
No ratings yet
Slide 8 01
37 pages
hw4 Fall18
No ratings yet
hw4 Fall18
1 page
Week 6 Mle
No ratings yet
Week 6 Mle
41 pages
Hw1a Soln
No ratings yet
Hw1a Soln
5 pages
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
No ratings yet
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
4 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
19 pages
4 MLEHandout
No ratings yet
4 MLEHandout
36 pages
Mit18 05 s22 Exam02 Sol
No ratings yet
Mit18 05 s22 Exam02 Sol
7 pages
238 03242024 - Final 课后
No ratings yet
238 03242024 - Final 课后
10 pages
Stats 217 Homework Assignment
No ratings yet
Stats 217 Homework Assignment
5 pages
HW 2 Sol
No ratings yet
HW 2 Sol
10 pages
Hypothesis Test Power Calculation
No ratings yet
Hypothesis Test Power Calculation
14 pages
Assignment 21 - Matrices II, SOLUTIONS
No ratings yet
Assignment 21 - Matrices II, SOLUTIONS
5 pages
Problem Set 3 Math
No ratings yet
Problem Set 3 Math
6 pages
Homework 6: Stats 217: Xy Xy
No ratings yet
Homework 6: Stats 217: Xy Xy
5 pages
Endsem Solutions
No ratings yet
Endsem Solutions
19 pages
ECE 493, Spring 2020, Assignment 1 Due: Friday June 19, 11:59pm
No ratings yet
ECE 493, Spring 2020, Assignment 1 Due: Friday June 19, 11:59pm
3 pages
Regression and Probability Models Analysis
No ratings yet
Regression and Probability Models Analysis
16 pages
f19 HW03 Module02b Solns PDF
No ratings yet
f19 HW03 Module02b Solns PDF
8 pages
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
No ratings yet
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
7 pages
Csci567 Hw1 Spring 2016
No ratings yet
Csci567 Hw1 Spring 2016
9 pages
Total 3 Points Due On The Blackboard On Friday, October 6 by 11:59pm
No ratings yet
Total 3 Points Due On The Blackboard On Friday, October 6 by 11:59pm
2 pages
HW 2
No ratings yet
HW 2
4 pages
MIT 6.867 Machine Learning PS4 Solutions
No ratings yet
MIT 6.867 Machine Learning PS4 Solutions
8 pages
Problems Solution
No ratings yet
Problems Solution
6 pages
Week 1
No ratings yet
Week 1
18 pages
Lecture Slide 10
No ratings yet
Lecture Slide 10
48 pages
HW t5 Answers
No ratings yet
HW t5 Answers
4 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
MA 411 Assignment For All
No ratings yet
MA 411 Assignment For All
8 pages
Math3160s13-Hw9 Sols PDF
No ratings yet
Math3160s13-Hw9 Sols PDF
4 pages
Türkei: Land: Abschluss: Lisans / Bachelor (Bilgisayar Mühendisliği / Computer Engineering)
No ratings yet
Türkei: Land: Abschluss: Lisans / Bachelor (Bilgisayar Mühendisliği / Computer Engineering)
2 pages
Chap 3
No ratings yet
Chap 3
183 pages
Time Series Analysis Guide
No ratings yet
Time Series Analysis Guide
124 pages
Econometrics: Qualitative Models
No ratings yet
Econometrics: Qualitative Models
144 pages
Pre-Lab Work:: Experiment
No ratings yet
Pre-Lab Work:: Experiment
1 page
Amm SS21 SWP1
No ratings yet
Amm SS21 SWP1
2 pages
Syllabus MM SS21
No ratings yet
Syllabus MM SS21
2 pages
ProblemSetQTII 2024
No ratings yet
ProblemSetQTII 2024
29 pages
Estimation For Multivariate Linear Mixed Models
No ratings yet
Estimation For Multivariate Linear Mixed Models
7 pages
Shrutanik Chatterjee - 34230822046 - Machine Learning Applications
No ratings yet
Shrutanik Chatterjee - 34230822046 - Machine Learning Applications
8 pages
Stata Practical Multilevel
No ratings yet
Stata Practical Multilevel
23 pages
Daily Dose of Data Science - Probability vs. Likelihood
No ratings yet
Daily Dose of Data Science - Probability vs. Likelihood
7 pages
King
No ratings yet
King
287 pages
Test4 PDF
No ratings yet
Test4 PDF
8 pages
5450 Diffusion Model Augmented Beha
No ratings yet
5450 Diffusion Model Augmented Beha
25 pages
Solution
No ratings yet
Solution
3 pages
Ethiopian Agro-Processing Efficiency
No ratings yet
Ethiopian Agro-Processing Efficiency
44 pages
Zero-Intelligence Realized Variance Estimation: Jim Gatheral and Roel C.A. Oomen March 2009
No ratings yet
Zero-Intelligence Realized Variance Estimation: Jim Gatheral and Roel C.A. Oomen March 2009
30 pages
Introduction To Probability For Computing 1st Edition Harchol-Balter Available All Format
100% (1)
Introduction To Probability For Computing 1st Edition Harchol-Balter Available All Format
121 pages
Parameter Evaluation of 3-Parameter Weibull Distribution Based On Adaptive Genetic Algorithm
No ratings yet
Parameter Evaluation of 3-Parameter Weibull Distribution Based On Adaptive Genetic Algorithm
6 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Financial Econometrics Basics
No ratings yet
Financial Econometrics Basics
129 pages
10 1 1 53
No ratings yet
10 1 1 53
84 pages
Characterization of The Resistivity Aluto Langano Geothermal Field in Ethiopia Data
No ratings yet
Characterization of The Resistivity Aluto Langano Geothermal Field in Ethiopia Data
12 pages
Michaelis Menten Equation
No ratings yet
Michaelis Menten Equation
19 pages
Circular Data Correlation PDF
No ratings yet
Circular Data Correlation PDF
24 pages
Ebook - Econometrics Handbook PDF
No ratings yet
Ebook - Econometrics Handbook PDF
317 pages
Chapter 10
No ratings yet
Chapter 10
36 pages
Weibull Analysis
No ratings yet
Weibull Analysis
6 pages
Pattern Classification: Second Edition
No ratings yet
Pattern Classification: Second Edition
11 pages
Ch4 - Output Error Method
No ratings yet
Ch4 - Output Error Method
59 pages
Notes
No ratings yet
Notes
199 pages
Irtplay
No ratings yet
Irtplay
78 pages
Faktor Paylater
No ratings yet
Faktor Paylater
6 pages
Ai and ML Notes
No ratings yet
Ai and ML Notes
26 pages
Errata 7
No ratings yet
Errata 7
5 pages
Generalized Linear Model Multivariate Poisson With Artificial Marginal (GLM-MPAM) : Application of Vehicle Insurance
No ratings yet
Generalized Linear Model Multivariate Poisson With Artificial Marginal (GLM-MPAM) : Application of Vehicle Insurance
9 pages

hw1 Solutions

Uploaded by

hw1 Solutions

Uploaded by

ITU Computer and Informatics Faculty

BLG 454E Learning From Data, Spring 2018

1. (10 pts.) In general, the probability that it rains on Saturday is 25%.

If the bug starts on B, then it can get to A directly, or through G :

P (A2 ) is the sum of all these probabilities multiplied by 71 :

(a) Gaussian probability density function

data = loadtxt ( " data . txt " )

# Plot the PDF .

num = np . exp ( - ( x - mu ) ** 2 / ( 2 * variance ) )

plt . plot (x , p , ’k ’ , linewidth =2 , label = " MLE fixed distribution " )

0.16 MLE results: mu = 10.06, std = 2.57

0.12 MLE fixed distribution

P (R|+)P (+) P (R|−)P (−)

Naive bayes assumes features are independent, so we can write,

Since P (R|+) is larger, the record is assigned to (+) class.

Table 1: Training set for question 4

Page 5 of 5 End of homework.

You might also like