0% found this document useful (0 votes)

12 views16 pages

L3 SLR Model 3

The document outlines the properties and assumptions of the Simple Linear Regression (SLR) model, focusing on unbiasedness and homoskedasticity. It explains the conditions under which the estimators β̂0 and β̂1 are considered unbiased and provides a detailed derivation of their variances. Additionally, it discusses the use of dummy variables in regression analysis, illustrated with a case study on adult heights by gender.

Uploaded by

dangikunal1001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views16 pages

L3 SLR Model 3

Uploaded by

dangikunal1001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Announcements:

• Quiz 1 on January 30, 2025 at 3 pm. Will cover material

covered until January 23
• Please answer google form

Simple Linear Regression model continued

“Desirable” properties of estimators

• Unbiasedness
• Efficiency (minimum variance) [we will cover this later]
These are finite sample properties

An estimator θ̂ of θ is said to be unbiased if E(θ̂) = θ

In the SLR case, we want β̂0 and β̂1 to be unbiased. I.e. that
E[β̂1 ] = β1 and E[β̂0 ] = β0
Assumptions in the SLR model

Given the regression model y = β0 + β1 x + u, we assume

1. The model is linear in parameters.
2. There is a random sample of n observations on y and x
3. Not all the x have the same value
4. E(u|x) = 0. By Law of Iterated Expecations, this ⇒ E(u) = 0
5. Var (u|x) = σ 2 . This assumption is called homoskedasticity.
Assumption 4

• We needed only the first three assumptions to derive the OLS

estimator.
• We need assumption 4 for demonstrating unbiasedness of the
OLS estimator.
• Another implication of (4) is that E(y |x) = β0 + β1 x This is
referred to as the population regression function. It
emphasizes how y changes on average with changes in x.
• This is a key assumption and we will refer to it several times
in this course.
Unbiasedness of β̂1 and β̂0

Before proving unbiasedness, it is useful to recognize the following

identity
Xn n
X
(xi − x̄)(yi − ȳ ) = (xi − x̄)yi
i i

We use this to rewrite the expression for β̂1 as follows:

Pn Pn
iP(xi − x̄)(yi − ȳ ) (xi − x̄)yi
β̂1 = n 2
= Pin 2
i (xi − x̄) i (xi − x̄)
Substitute for yi from the population model
Pn
(xi − x̄)(β0 + β1 xi + ui )
⇒ β̂1 = i Pn 2
i (xi − x̄)
Unbiasedness of β̂1 and β̂0

β0 ni (xi − x̄) β1 ni (xi − x̄)xi

P P Pn
(xi − x̄)ui
= Pn 2
+ Pn 2
+ Pin 2
i (xi − x̄) i (xi − x̄) i (xi − x̄)
Substitute for yi from the population regression function
Pn Pn Pn
i (xi − x̄)2 i (xi − x̄)ui (xi − x̄)ui
⇒ β̂1 = 0 + β1 Pn 2
+ Pn 2
= β1 + Pin 2
i (xi − x̄) i (xi − x̄) i (xi − x̄)

This is a key relationship and has many uses.

This means that conditional on x, E(β̂1 |x) = β1 + 0 = β hence
unbiased. This in turn implies that unbiasedness holds
unconditionally as well.
Similarly, β̂0 is also unbiased. Proof left as an exercise.
Assumption 5

• Assumption 5 states that Var (u|x) = σ 2 , residuals are

homoskedastic. This in turn implies that Var (y |x) = σ 2
• σ 2 is a scalar, and does not vary with observation i. When
Var (u|x) is not a constant, we term this a case of
heteroskedasticity. We will consider this case later.
• It is important to reiterate that both assumptions (4) and (5)
pertain to the unobserved error u and not to the observed
residual û
Homoskedasticity (from Wooldridge, Chapter 2)
Heteroskedasticity (from Wooldridge, Chapter 2)

Assumption 5 rules out heteroskedasticity (for now)

Variance of β̂1

Assumption 5 is needed to derive the variance of the estimate

coefficients. We start with the slope coefficient β̂1 But first, recall
that the variance of any estimator θ̂ is given by:
Var (θ̂) = E(θ̂ − E(θ̂))2 .

(xi − x̄)ui 2
P
2
⇒ Var (β̂1 ) = E(β̂1 − β1 ) = E P
(xi − x̄)2
Where these are all conditional on the sample x’s. (Recall
derivation of unbiasedness of β̂1 )
(xi − x̄)ui 2
P
1 X
=E = (xi − x̄)2 E(ui2 )
(xi − x̄)2 [ (xi − x̄)2 ]2
P P

We were able to do this because of conditioning of x. Note further

that, conditional on x, E(ui2 ) = σ 2 given that E(u) = 0 and
Var (ui ) = E(ui2 ) = σ 2 , a constant. After cancellation,

σ2
⇒ Var (β̂1 ) = P
(xi − x̄)2

We will show later that this is the lowest variance of any estimator
that happens to be linear and unbiased.
An analogous expression can be derived for β̂0 . Proof left as an
exercise.
Estimated variance of β̂1
We are not done yet. σ 2 is not observed, and is a population
parameter. This cannot therefore be computed. To be able to
calculate Var (β̂1 ) we need to estimate σ̂ 2 .
The estimated σ 2 is given by
P 2
2 ûi
σ̂ =
n−2

It turns out E(σ̂ 2 ) = σ 2 . We will show this formally in the general

K variable case.
Dividing the SSR by n instead of n − 2 would yield a biased
estimator of σ 2 . This is known as a degrees of freedom correction.
Intuitively this is because there are two restrictions that the
residuals must follow:
•
P
ûi = 0 and
•
P
xi ûi = 0
Estimated variance of β̂1

Therefore the estimated variance of β̂1 denoted by Var

d (β̂1 )

2
d (β̂1 ) = P σ̂
Var
(xi − x̄)2
The variance of β̂0 can be derived analogously. This is left as an
exercise.

Distinction between Var (β̂1 ) and Var

d (β̂1 ) is important
Dummy variables

When x is a binary variable, taking values only 1 or 0, it is called a

dummy variable.

Consider a regression of adult heights y in cms on gender x which

takes value 1 if male and value 0 if female. I.e. x can only take
two values, 0 and 1.

E[y |x] = β0 + β1 x ⇒ E[y |x = 0] = β0 and E[y |x = 1] = β0 + β1

Thus, β0 is the average height of women in cms; while the ‘slope’

coefficient β1 represents the difference in heights between men and
women.
OLS results

Based on state level data on heights, the results are

Source | SS df MS Number of obs = 58

-------------+---------------------------------- F(1, 56) = 496.08
Model | 2171.62041 1 2171.62041 Prob > F = 0.0000
Residual | 245.141349 56 4.37752409 R-squared = 0.8986
-------------+---------------------------------- Adj R-squared = 0.8968
Total | 2416.76176 57 42.3993292 Root MSE = 2.0923

------------------------------------------------------------------------------
height | Coefficient Std. err. t P>|t| [95% conf. interval]
-------------+----------------------------------------------------------------
gender | 12.23793 .5494526 22.27 0.000 11.13724 13.33862
_cons | 152.2379 .3885217 391.84 0.000 151.4596 153.0162
------------------------------------------------------------------------------
Comparison of means

These are identical to results from a simple comparison of means

. by gender: summ height

--------------------------------------------------------------------------------------------------
-> gender = 0

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------
height | 29 152.2379 1.537255 149.3 154.8

--------------------------------------------------------------------------------------------------
-> gender = 1

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------
height | 29 164.4759 2.52822 157.5 168.4

Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
Econometrics 8
No ratings yet
Econometrics 8
35 pages
Properties of The OLS Estimator: Quantitative Methods 2
No ratings yet
Properties of The OLS Estimator: Quantitative Methods 2
57 pages
Section 4 5 Solutions
No ratings yet
Section 4 5 Solutions
14 pages
Properties of OLS
No ratings yet
Properties of OLS
13 pages
Week 3-4
No ratings yet
Week 3-4
75 pages
Solutions For Tutorial 2
No ratings yet
Solutions For Tutorial 2
14 pages
Chapter2 Econometrics Old
No ratings yet
Chapter2 Econometrics Old
37 pages
Lecture 03 JEB109 2023
No ratings yet
Lecture 03 JEB109 2023
26 pages
ECON0019 Week1 SLR OLS
No ratings yet
ECON0019 Week1 SLR OLS
33 pages
Lecture 11 - Stochastic Regressors Measurement Errors
No ratings yet
Lecture 11 - Stochastic Regressors Measurement Errors
6 pages
統計摘要
No ratings yet
統計摘要
12 pages
Lecture Slides On Statistics at Uni St. Gallen
No ratings yet
Lecture Slides On Statistics at Uni St. Gallen
20 pages
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
No ratings yet
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
7 pages
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
No ratings yet
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
20 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
Pertemuan 3 - Simple Linear Regression
No ratings yet
Pertemuan 3 - Simple Linear Regression
19 pages
ECO375H Slides 3
No ratings yet
ECO375H Slides 3
39 pages
L5 2025 Spring
No ratings yet
L5 2025 Spring
40 pages
BA Notes
No ratings yet
BA Notes
6 pages
EC2C4 Econometrics II
No ratings yet
EC2C4 Econometrics II
56 pages
EC501 Lecture 02
No ratings yet
EC501 Lecture 02
27 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
EM03 2023stu
No ratings yet
EM03 2023stu
53 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
Econ Methods for Manchester Students
No ratings yet
Econ Methods for Manchester Students
164 pages
Multiple Linear Regression Model by Jeevan Bista
No ratings yet
Multiple Linear Regression Model by Jeevan Bista
16 pages
Im ch08
No ratings yet
Im ch08
12 pages
Introductory Econometrics: Prachi Singh & Partha Bandopadhyay
No ratings yet
Introductory Econometrics: Prachi Singh & Partha Bandopadhyay
18 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
Chapter 6: Regression
No ratings yet
Chapter 6: Regression
7 pages
4 - LM Test and Heteroskedasticity
No ratings yet
4 - LM Test and Heteroskedasticity
13 pages
Chapter 2 Econometric
No ratings yet
Chapter 2 Econometric
28 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Econometrics for Finance Students
No ratings yet
Econometrics for Finance Students
64 pages
Simple Regression & OLS Explained
No ratings yet
Simple Regression & OLS Explained
41 pages
UnivariateRegression 2
No ratings yet
UnivariateRegression 2
72 pages
Lecture 6
No ratings yet
Lecture 6
45 pages
Econometrics: CLM & OLS Basics
No ratings yet
Econometrics: CLM & OLS Basics
11 pages
CH 02
No ratings yet
CH 02
41 pages
Econometrics Study Guide
No ratings yet
Econometrics Study Guide
9 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
16 pages
LN3 - Least Squares Estimation-Finite-Sample Properties - Ver2 - Slides
No ratings yet
LN3 - Least Squares Estimation-Finite-Sample Properties - Ver2 - Slides
35 pages
Statistical Data Types and Estimators
No ratings yet
Statistical Data Types and Estimators
24 pages
Lecture 7 Heteroskedasticity
No ratings yet
Lecture 7 Heteroskedasticity
41 pages
Assignment3SolNew Fall2024
No ratings yet
Assignment3SolNew Fall2024
9 pages
Lecture 2: Simple Linear Regression Model: Recap
No ratings yet
Lecture 2: Simple Linear Regression Model: Recap
5 pages
OLS Estimator: Key Statistical Insights
No ratings yet
OLS Estimator: Key Statistical Insights
12 pages
Multiple Regression Model
No ratings yet
Multiple Regression Model
17 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
CUHK STAT4004 Outline
No ratings yet
CUHK STAT4004 Outline
3 pages
Time Value of Money Explained
No ratings yet
Time Value of Money Explained
83 pages
Logistic Regression Via Excel Spreadsheets Mechani
No ratings yet
Logistic Regression Via Excel Spreadsheets Mechani
12 pages
Simple Linear Regression Theory Answers
No ratings yet
Simple Linear Regression Theory Answers
2 pages
Ch13 Multiple Regres
No ratings yet
Ch13 Multiple Regres
46 pages
Pengunaan Aplikasi Merdeka Mengajar Dalam Meningkatkan Hasil Belajar Siswa Pada Sekolah Penggerak
No ratings yet
Pengunaan Aplikasi Merdeka Mengajar Dalam Meningkatkan Hasil Belajar Siswa Pada Sekolah Penggerak
12 pages
Worksheet 3.3B Regression-Residuals
No ratings yet
Worksheet 3.3B Regression-Residuals
4 pages
An Introduction To Credibility Theory
No ratings yet
An Introduction To Credibility Theory
28 pages
Mathematics: Answer Key
No ratings yet
Mathematics: Answer Key
5 pages
Dap An BTL KTL HK I 2022
No ratings yet
Dap An BTL KTL HK I 2022
4 pages
Supplier Defects Analysis 2014-2018
No ratings yet
Supplier Defects Analysis 2014-2018
21 pages
Lecture 23
No ratings yet
Lecture 23
16 pages
ITLS5050 Data Set 2 v7 Multiple Regression
No ratings yet
ITLS5050 Data Set 2 v7 Multiple Regression
48 pages
Life Table: Concept, Types and Preparation: Prepared by Dr. Anup K Mishra
No ratings yet
Life Table: Concept, Types and Preparation: Prepared by Dr. Anup K Mishra
13 pages
Nota Topik 1
No ratings yet
Nota Topik 1
25 pages
Typical Actuarial Interview Questions
No ratings yet
Typical Actuarial Interview Questions
24 pages
Zimbabwe Pension Funds in Crisis
No ratings yet
Zimbabwe Pension Funds in Crisis
97 pages
2022 10 Exam Fam L Tables Excel Workbook
No ratings yet
2022 10 Exam Fam L Tables Excel Workbook
15 pages
EXCEL
No ratings yet
EXCEL
24 pages
American Lawbook Corporation (B)
No ratings yet
American Lawbook Corporation (B)
4 pages
Art. Insurance and Reinsurance Models For Earthquake
No ratings yet
Art. Insurance and Reinsurance Models For Earthquake
25 pages
Panel Data Binary Choice Models
No ratings yet
Panel Data Binary Choice Models
14 pages
Chapter 06-Regression Analysis
No ratings yet
Chapter 06-Regression Analysis
41 pages
LIFE 411 - Population Dynamics
No ratings yet
LIFE 411 - Population Dynamics
24 pages
Linear Regression Practice
No ratings yet
Linear Regression Practice
4 pages
ECON3049 Course Outline 2021
No ratings yet
ECON3049 Course Outline 2021
3 pages
Homework Assignment 5
No ratings yet
Homework Assignment 5
10 pages
Actuarial Valuation Bids Invited
No ratings yet
Actuarial Valuation Bids Invited
6 pages
Pop Quiz
100% (6)
Pop Quiz
2 pages
B. No Liability Is. Recognized at Year-End For Any Unused Entitlement
No ratings yet
B. No Liability Is. Recognized at Year-End For Any Unused Entitlement
2 pages

L3 SLR Model 3

Uploaded by

L3 SLR Model 3

Uploaded by

Announcements:

• Quiz 1 on January 30, 2025 at 3 pm. Will cover material

Simple Linear Regression model continued

An estimator θ̂ of θ is said to be unbiased if E(θ̂) = θ

Given the regression model y = β0 + β1 x + u, we assume

• We needed only the first three assumptions to derive the OLS

Before proving unbiasedness, it is useful to recognize the following

We use this to rewrite the expression for β̂1 as follows:

β0 ni (xi − x̄) β1 ni (xi − x̄)xi

This is a key relationship and has many uses.

• Assumption 5 states that Var (u|x) = σ 2 , residuals are

Assumption 5 rules out heteroskedasticity (for now)

Assumption 5 is needed to derive the variance of the estimate

We were able to do this because of conditioning of x. Note further

It turns out E(σ̂ 2 ) = σ 2 . We will show this formally in the general

Therefore the estimated variance of β̂1 denoted by Var

Distinction between Var (β̂1 ) and Var

When x is a binary variable, taking values only 1 or 0, it is called a

Consider a regression of adult heights y in cms on gender x which

E[y |x] = β0 + β1 x ⇒ E[y |x = 0] = β0 and E[y |x = 1] = β0 + β1

Thus, β0 is the average height of women in cms; while the ‘slope’

Based on state level data on heights, the results are

Source | SS df MS Number of obs = 58

These are identical to results from a simple comparison of means

Variable | Obs Mean Std. dev. Min Max

Variable | Obs Mean Std. dev. Min Max

You might also like