0% found this document useful (0 votes)

48 views30 pages

Ie Slide02

This document discusses the simple linear regression model. It covers the motivation for regression analysis using examples, defines the key terms in the simple regression model, and describes the ordinary least squares (OLS) estimation method. It also explains properties of OLS estimates such as unbiasedness and the decomposition of total sum of squares, and discusses interpreting estimates and assessing fit using the R-squared statistic. Finally, it notes that nonlinear relationships between variables can be modeled by transforming variables in the regression equation.

Uploaded by

Leo Kaligis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views30 pages

Ie Slide02

Uploaded by

Leo Kaligis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Introductory Econometrics

ECON2206/ECON3209

Slides02

Lecturer: Minxian Yang

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

2. Simple Regression Model

Lecture plan

Motivation and definitions

ZCM assumption
Estimation method: OLS
Units of measurement
Nonlinear relationships
Underlying assumptions of simple regression model
Expected values and variances of OLS estimators
Regression with STATA

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Motivation
Example 1. Ceteris paribus effect of fertiliser on soybean yield
yield = 0 + 1ferti + u .
Example 2. Ceteris paribus effect of education on wage
wage = 0 + 1educ + u .
In general,
y = 0 + 1x + u,
where u represents factors other than x that affect y.
We are interested in
explaining y in terms of x,
how y responds to changes in x,
holding other factors fixed.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Simple regression model

Definition
y = 0 + 1x + u ,

y : dependent variable (observable)

x : independent variable (observable)
1 : slope parameter, partial effect, (to be estimated)
0 : intercept parameter (to be estimated)
u : error term or disturbance (unobservable)

The disturbance u represents all factors other than x.

With the intercept 0, the population average of u can
always be set to zero (without losing anything)
E(u) = 0 .
y = 0 + E(u) + 1x + u E(u)
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Zero conditional mean assumption

y = 0 + 1x + u
y + y = 0 + 1(x + x)
+ u + u

If other factors in u are held fixed (u = 0), the ceteris

paribus effect of x on y is 1 :
= change
y = 1 x .
But under what condition u can be held fixed while x
changes?
As x and u are treated as random variables,
u is fixed while x varying is described as
the mean of u for any given x is the same (zero).

The required condition is

X=
X
X
X
...
E(u |X) = 0
0
0
0
E(u | x) = E(u) = 0 ,
known as zero-conditional-mean (ZCM) assumption.
1

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Zero conditional mean assumption

Example 2. wage = 0 + 1educ + u
Suppose u represents ability.
Then ZCM assumption amounts to
E(ability | educ) = 0 ,
ie, the average ability is the same irrespective of the
years of education.
This is not true
if people choose the education level to suit their ability;
or if more ability is associated with less (or more)
education.

In practice, we do not know if ZCM holds and have to

deal with this issue.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Zero conditional mean assumption

Taking the conditional expectations of
y = 0 + 1x + u
for given x, ZCM implies
E(y | x) = 0 + 1x ,
known as the population regression function
(PRF), which is a linear function of x.
The distribution of y is centred about E(y | x).
Systematic part of y : E(y | x).
Unsystematic part of y : u.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Simple regression model

yi = 0 + 1xi + ui
y

distribution of u

conditional mean of y given x

(population regression line)

E(y| x = x3)
= 0 + 1x3

E(y| x = x2)

distribution of y
for given x = x3

E(y| x = x1)

x1
ie_Slides02

my, School of Economics, UNSW

x
8

2. Simple Regression Model (Ch2)

Observations on (x, y)
A random sample is a set of independent
observations on (x, y), ie, {(xi , yi), i = 1,2,...,n}.
At observation level, the model may be written as
yi = 0 + 1xi + ui , i = 1, 2, ..., n
where i is the observation index.
Collectively,
y1
x1 u1
y1 1
1
y
x u
y 1
1
2
2
2
, or 2
0

y
x
u
1

n
n n
yn 1

Matrix notation:
ie_Slides02

x1
u1
x2 0 u2

.
1

xn
u n

Y X B U.

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Estimate simple regression

The model:
yi = 0 + 1xi + ui ,

i = 1, 2, ..., n

Let ( 0 , 1 ) be the estimates of (0 , 1).

Corresponding residual is
ui y i 0 1xi , i 1,2,..., n.
The sum of squared residuals (SSR)
n

SSR u ( y i 0 1 xi )2
i 1

2
i

i 1

indicates the goodness of the estimates.

Good estimates should make SSR small.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Ordinary least squares (OLS)

The OLS estimates ( 0 , 1 ) minimise the SSR:

( 0 , 1 ) minimiser of SSR.
Choose ( 0 , 1 ) to minimise SSR.
The first order conditions lead to
n

( y i 0 1xi ) 0,

mean residual = 0

i 1
n

(y
i 1

0 1 xi )xi 0.
covariance of
residual and x
=0

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Ordinary least squares (OLS)

Solving the two equations with two unknowns gives
n

(x
i 1

x )( y i y )

2
(
x

x
)
i

0 y 1 x ,

i 1

where

1 n
y yi ,
n i 1

1 n
x xi .
n i 1

OLS requires the condition

2
(
x

x
)
0.
i
i 1

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

OLS regression line or SRF

For any set of data {(xi , yi), i = 1,2,...,n} with n > 2,
OLS can always be carried out as long as
n

2
(
x

x
)
0.
i
i 1

Once OLS estimates are obtained, y i 0 1xi

is known as the fitted value of y when x = xi.
By OLS regression line or sample regression
function (SRF), we refer to
y 0 1x,
which is an estimate of PRF E(y | x) = 0 + 1 x.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Interpretation of OLS estimate

In the SRF y 0 1x,
the slope estimate 1 is the change in y when x
increases by one unit: 1 y / x,
which is of primary interest in practice.
The dependent variable y may be decomposed either
as the sum of the SRF and the residual
y y u
or as the sum of the PRF and the disturbance.

y E( y | x ) u.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

PRF versus SRF

Hope: SRF = PRF on average or when n goes to infinity.

y i 0 1 xi ui
(xi, yi)

y
sample
regression
line 0 1x

residual

population
regression
line 0+ 1x

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

OLS example

wa ge 0.90 0.54 educ ,

Population : workforce in 1976

y = wage : hourly earnings (in $)
x = educ : years of education
OLS SRF : n = 526

wage

Example 2. (regress wage educ)

educ

Interpretation
Slope 0.54 : each additional year of schooling increases
the wage by $0.54.
Intercept -0.90 : fitted wage of a person with educ = 0?
SRF does poorly at low levels of education.

Predicted wage for a person with educ = 10?

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Properties of OLS
The first order conditions:
n

(y
i 1

0 1 xi ) 0,
0 1xi )xi 0

imply that
the sum of residuals is zero.
the sample covariance of x and the residual is zero.
the mean point ( x, y ) is always on the SRF (or OLS
regression line).

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Sums of squares
Each yi may be decomposed into y i y i ui .
Measure variations from y :
Total sum of squares (total variation in yi ):

SST i 1 ( y i y )2 ,
n

Explained sum of squares (variation in y i ):

SSE i 1 ( y i y )2 ,
n

sum of squared Residuals (variation in u i ):

SSR i 1 u i2 .
n

It can be shown that

ie_Slides02

SST = SSE + SSR .

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

R-squared: a goodness-of-fit measure

How well does x explain y?
or how well does the OLS regression line fit data?
We may use the fraction of variation in y that is
explained by x (or by the SRF) to measure.
R-squared (coefficient of determination):
SSE
SSR
R2
1
.
Not advisable to put
SST
SST
too much weight on
larger
better fit;
0 R2 1.
R2,

R2 when evaluating
regression models.

eg. R2 = 0.165 for wa ge 0.90 0.54 educ ,

16.5% of variation in wage is explained by educ.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Effects of changing units of measurement

If y is multiplied by a constant c, then the OLS
intercept and slope estimates are also multiplied by c.
If x is multiplied by a constant c, then the OLS
intercept estimate is unchanged but the slope
estimate is multiplied by 1/c.
The R2 does not change when varying the units of
measurement.
eg. When wage is in dollars, wa ge 0.90 0.54 educ .
If wage is in cents, wa ge 90 54 educ .

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Nonlinear relationships between x and y

The OLS only requires the regression model
y = 0 + 1x + u
to be linear in parameters.
Nonlinear relationships between y and x can be easily
accommodated.

1
0

lwage

eg. Suppose a better description

is that each year of education
increases wage by a fixed
percentage. This leads to
log(wage) = 0 + 1 educ + u ,
with %wage = (1001)educ
when u= 0.
OLS: lwa ge 0.584 0.083 educ ,
ie_Slides02

my, School of Economics, UNSW

educ

R 2 0.186
21

2. Simple Regression Model (Ch2)

Nonlinear relationships between x and y

Linear models are linear in parameters.
OLS applies to linear models no matter how x and y
are defined.
But be careful about the interpretation of .

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

OLS estimators
A random sample, containing independent draws
from the same population, is random.
A data set is a realisation of the random sample.

OLS estimates ( 0 , 1 ) computed from a random

sample is random, called the OLS estimators.
To make inference about the population parameters
(0, 1), we need to understand the statistical
properties of the OLS estimators.
In particular, we like to know the means and
variances of the OLS estimators.
We find these under a set of assumptions about the
simple regression model.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Assumptions about simple regression model

(SLR1 to SLR4)

1. (linear in parameters) In the population model, y is

related to x by y = 0 + 1 x + u, where (0, 1) are
population parameters and u is disturbance.
2. (random sample) {(xi , yi), i = 1,2,...,n} with n > 2 is a
random sample drawn from the population model.
3. (sample variation) The sample outcomes on x are
not of the same value.
4. (zero conditional mean) The disturbance u satisfies
E(u | x) = 0 for any given value of x. For the random
sample, E(ui | xi) = 0 for i = 1,2,...,n.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Property 1 of OLS estimators

Theorem 2.1
Under SLR1 to SLR4, the OLS estimators are
unbiased: E( 1 ) 1, E( 0 ) 0 .
Unbiased estimators ( 0 , 1 )
they are centred around (0, 1).
they correctly estimate (0, 1) on average.
It is useful to note that ( y i y ) 1( xi x ) (ui u ),

1 1

ie_Slides02

n
i 1

(ui u )( x i x )

n
i 1

( xi x )

The estimation error is

entirely driven by a linear
combination of ui with
weights dependent on x.

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Property 2 of OLS estimators

5. (SLR5, homoskedasticity)
Var(ui|xi) = 2 for i = 1,2,...,n. (It implies Var(ui) = 2.)
Strictly, Theorem 2.2 is about the variances
of OLS estimators, conditional on given x.

Theorem 2.2
Under SLR1 to SLR5, the variances of ( 0 , 1 ) are:
Var ( 1 )

n
i 1

( xi x )

2 n 1 i 1 x i2
n

2
2

, Var ( 0 )

n
i 1

( xi x )

the larger is 2, the greater are the variances.

the larger the variation in x, the smaller the variances.
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Homoskedasticity and heteroskedasticity

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Estimation of 2
As the residual approximates u, the estimator of 2 is
2

u
SSR
i
2
i 1 .
n2
n2
n

2 is the number of
estimated coefficients

2 is known as the standard error of the

regression, useful in forming the standard errors of
( 0 , 1 ).
Theorem 2.3 (unbiased estimator of 2)
Under SLR1 to SLR5, E ( 2 ) 2 .
ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

OLS in STATA
standard
error of
regression

SSR

ie_Slides02

my, School of Economics, UNSW

2. Simple Regression Model (Ch2)

Summary
What is a simple regression model?
What is the ZCM assumption? Why is it crucial for
model interpretation and OLS being unbiased?
What is the OLS estimation principle?
What are PRF, SRF, error term and residual?
How is R-squared is related to SSR?
Can we describe, in a simple linear regression model,
the nonlinear relationship between x and y?
What are Assumptions SLR1 to SLR5? Why do we
need to understand them?
What are the statistical properties of OLS estimators?
How do you OLS in STATA? regress y x
ie_Slides02

my, School of Economics, UNSW

Econ 399 Chapter2a
No ratings yet
Econ 399 Chapter2a
40 pages
As of Sep 16, 2020: Seppo Pynn Onen Econometrics I
No ratings yet
As of Sep 16, 2020: Seppo Pynn Onen Econometrics I
52 pages
Simple Regression Model CH02
No ratings yet
Simple Regression Model CH02
60 pages
Simple Linear Regression Model I
No ratings yet
Simple Linear Regression Model I
83 pages
Chapter 1 Article
No ratings yet
Chapter 1 Article
9 pages
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
Lecture 2
No ratings yet
Lecture 2
47 pages
3-Econometrics-Linear Regression
No ratings yet
3-Econometrics-Linear Regression
13 pages
STAT 445-Lecture 1 - 2021
No ratings yet
STAT 445-Lecture 1 - 2021
42 pages
CH 02 Simple Regression TQT
No ratings yet
CH 02 Simple Regression TQT
61 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
19 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
STAT 445 Regression Analysis
No ratings yet
STAT 445 Regression Analysis
49 pages
Chapter 2
No ratings yet
Chapter 2
50 pages
Lecture 2-3
No ratings yet
Lecture 2-3
8 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
46 pages
Econometrics Lecture 2 Simple Regression
No ratings yet
Econometrics Lecture 2 Simple Regression
33 pages
Chapter 2 Econometric
No ratings yet
Chapter 2 Econometric
28 pages
Ch.2 The Simple Regression Model
No ratings yet
Ch.2 The Simple Regression Model
6 pages
Hayashi 1 13
No ratings yet
Hayashi 1 13
13 pages
Simple Linear Regression Model
No ratings yet
Simple Linear Regression Model
51 pages
Introduction To Econometrics (ET2013) : Teresa Randazzo
No ratings yet
Introduction To Econometrics (ET2013) : Teresa Randazzo
30 pages
Pertemuan 3
No ratings yet
Pertemuan 3
23 pages
04 16 Simple Regression
No ratings yet
04 16 Simple Regression
47 pages
CH - 02 - Simple Linear Regression - TQT
No ratings yet
CH - 02 - Simple Linear Regression - TQT
61 pages
Lecture 8 - Removed
No ratings yet
Lecture 8 - Removed
13 pages
IAPRI Technical Training-Intro To Applied Econometrics 2018 06 25+-+Nicole+Mason
No ratings yet
IAPRI Technical Training-Intro To Applied Econometrics 2018 06 25+-+Nicole+Mason
29 pages
Multiple Regression
No ratings yet
Multiple Regression
14 pages
Ch3 Slides Ed4 2024
No ratings yet
Ch3 Slides Ed4 2024
72 pages
Chapter 2 - Econometrics
No ratings yet
Chapter 2 - Econometrics
41 pages
CH 2. Simple Linear Regression
No ratings yet
CH 2. Simple Linear Regression
63 pages
Week 2 - Simple Linear Regression
No ratings yet
Week 2 - Simple Linear Regression
25 pages
Econometrics Simple Linear Regression
No ratings yet
Econometrics Simple Linear Regression
22 pages
2 Simple Regression Model Estimation and Properties
100% (1)
2 Simple Regression Model Estimation and Properties
48 pages
Chap 2
No ratings yet
Chap 2
15 pages
Lecture 2. Simple Linear Regression
No ratings yet
Lecture 2. Simple Linear Regression
49 pages
Lecture 2 SLR - 1
No ratings yet
Lecture 2 SLR - 1
28 pages
Econometrics I
No ratings yet
Econometrics I
43 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
ch02 1
No ratings yet
ch02 1
41 pages
Top2 Estimation Handout
No ratings yet
Top2 Estimation Handout
39 pages
Chapter 2
No ratings yet
Chapter 2
12 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
C1 English
No ratings yet
C1 English
26 pages
Simple Regression
No ratings yet
Simple Regression
45 pages
Chapter 2
No ratings yet
Chapter 2
18 pages
Lecture 8
No ratings yet
Lecture 8
32 pages
Lecture 3 Ase
No ratings yet
Lecture 3 Ase
13 pages
L4 MLR With 2 Regressors
No ratings yet
L4 MLR With 2 Regressors
19 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
Top2 Estimation Handout
No ratings yet
Top2 Estimation Handout
40 pages
Theme 2 Ordinary Least Squares Regression
No ratings yet
Theme 2 Ordinary Least Squares Regression
10 pages
Ch3 Slides Ed4 2024 20
No ratings yet
Ch3 Slides Ed4 2024 20
72 pages
Lecture 2-2 - Simple Linear Regression (One Regressor)
No ratings yet
Lecture 2-2 - Simple Linear Regression (One Regressor)
22 pages
Omnidirectional Antennas 285-318
No ratings yet
Omnidirectional Antennas 285-318
34 pages
Ematscan Helps Lower Costs and Raise Crack-Detection Confidence For Gas-Pipeline Operators
No ratings yet
Ematscan Helps Lower Costs and Raise Crack-Detection Confidence For Gas-Pipeline Operators
2 pages
SPPA-T3000 Distributed Control System
100% (1)
SPPA-T3000 Distributed Control System
9 pages
Simple Equations
No ratings yet
Simple Equations
8 pages
Guide To Clinical Documentation. ISBN 0803666624, 978-0803666627
96% (25)
Guide To Clinical Documentation. ISBN 0803666624, 978-0803666627
23 pages
Module 8 Understanding The Chemistry
No ratings yet
Module 8 Understanding The Chemistry
2 pages
Datasheet WL260-F270 6020976 en
No ratings yet
Datasheet WL260-F270 6020976 en
8 pages
SDK DM20 Manual
No ratings yet
SDK DM20 Manual
20 pages
Chem Assignment Notes
No ratings yet
Chem Assignment Notes
5 pages
Gender and The Practice of Translation
No ratings yet
Gender and The Practice of Translation
5 pages
Range and Endurance
No ratings yet
Range and Endurance
26 pages
Vocabulary Definitions and Examples
No ratings yet
Vocabulary Definitions and Examples
1 page
Man Re Treaty 2022-23 14 April Final
No ratings yet
Man Re Treaty 2022-23 14 April Final
27 pages
Jurnal 4
No ratings yet
Jurnal 4
13 pages
Iot Analytics Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast - Facts and Trends
No ratings yet
Iot Analytics Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast - Facts and Trends
2 pages
DLL Grade 5 Q3W2 Math Fil Eng Scie Ap Mapeh Esp
No ratings yet
DLL Grade 5 Q3W2 Math Fil Eng Scie Ap Mapeh Esp
99 pages
Bending Lab Report Final
No ratings yet
Bending Lab Report Final
21 pages
Chapter 8 Philosophies of Education
No ratings yet
Chapter 8 Philosophies of Education
18 pages
Report On Merchandising Process of Garments Sector
No ratings yet
Report On Merchandising Process of Garments Sector
27 pages
Azola
No ratings yet
Azola
4 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Aurélien Géron Online PDF
No ratings yet
Hands On Machine Learning With Scikit Learn and TensorFlow Aurélien Géron Online PDF
54 pages
Teaching Vocabularies
No ratings yet
Teaching Vocabularies
17 pages
NASA - (Aerospace History 20) - Black Magic and Gremlins, Analog Flight Simulations at NASA's Flight Research Center
No ratings yet
NASA - (Aerospace History 20) - Black Magic and Gremlins, Analog Flight Simulations at NASA's Flight Research Center
243 pages
Lab 3
No ratings yet
Lab 3
6 pages
6097473R1 TB302 ForkModInspect
No ratings yet
6097473R1 TB302 ForkModInspect
2 pages
PS4 Solution
No ratings yet
PS4 Solution
9 pages
Totorial VCDS - Passat 2010
100% (1)
Totorial VCDS - Passat 2010
32 pages
Aurora PVI Desktop Monitor
No ratings yet
Aurora PVI Desktop Monitor
2 pages
The Code of The Geeks v3.12
No ratings yet
The Code of The Geeks v3.12
23 pages
Outline and Evaluate The MSM
No ratings yet
Outline and Evaluate The MSM
2 pages