0% found this document useful (0 votes)

55 views29 pages

Instrumental Variable: Rus'an Nasrudin

The document discusses instrumental variable analysis, which is a technique used to establish causality from observational data when non-observable factors drive treatment assignment. It introduces instrumental variables (Z) which are correlated with the treatment (S) but affect the outcome (Y) only through S. This allows estimating the average causal effect on the subgroup whose treatment status is affected by the instrument, known as the local average treatment effect (LATE). The two-stage least squares (2SLS) estimator implements instrumental variables regression in samples by first predicting S from Z and then regressing Y on the predicted S values.

Uploaded by

mfajrinurachman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views29 pages

Instrumental Variable: Rus'an Nasrudin

Uploaded by

mfajrinurachman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Instrumental Variable

Rus’an Nasrudin

November 16, 2021

Rus’an Nasrudin Instrumental Variable November 16, 2021 1 / 29

Theory

Table of Contents

1 Theory

2 Implementation: Two-stage least squares

3 Asymptotic 2SLS inference

Rus’an Nasrudin Instrumental Variable November 16, 2021 2 / 29

Theory

Introduction

When non-observable factors significantly drive the nonrandom assignment to treatment,

recovering consistent estimations of average treatment effects relying only on
observables is no longer possible.
Conditioning strategy is ineffective, in this case.
Ideally, we relies on Randomised Control Trial (RCT) data to elicit the true causal effect.
There is a technique to establish the causality using observational data called
“instrumental-variable” method, largely can be attributed to the contribution of the LATE
analysis by recent 2021 Sveriges Riksbank Prize in Economic Sciences in Memory of
Alfred Nobel.
Historical origin: simultaneous equation model (SEM) of Wright (1928) or bias in
measurement error in regression of Wald (1940) or Durbin (1954).

Rus’an Nasrudin Instrumental Variable November 16, 2021 3 / 29

Theory

Selection-on-unobservables

Recall the selection-on-observables that we learned earlier. The long regression takes form of:

Yi = α + ρSi + A0i γ + εi

and its short regression was:

Yi = α + ρSi + µi

Rus’an Nasrudin Instrumental Variable November 16, 2021 4 / 29

Theory

Selection-on-unobservables

The problem that we want to tackle initially was how to estimate ρ if Ai is unobserved.
Selection-on-observables approach reduces bias by exploiting within variation of included
control that explain the selection mechanism toward treatment variable.
There is another path of road be taken, we can use only part of the treatment variable that is
exogenously induced by some ‘instrumental’ (called Z ) variable to make ρ has causal
interpretation.
We call this approach ‘selection-on-unobservable’. That is when we do not know Ai we use Z
to purge out non-random component of S to make its prediction to Y be causal.

Rus’an Nasrudin Instrumental Variable November 16, 2021 5 / 29

Theory

DAG of IV

Suppose we want to establish causality of D on Y. The unobserved confounder U is present.

With an instrumental variable Z, the causality is identified.

Z U

D Y

The open back door D ← U → Y is unblocked, yet Z is a valid IV to make causality between D
and Y.

Rus’an Nasrudin Instrumental Variable November 16, 2021 6 / 29

Theory

What is Z ?

Z is correlated with the causal variable of interest Si but not correlated with any other
determinants of Yi or
Z is not correlated both with Ai and εi or with µi .
In other words, the only reason Z affect Yi is through Si .

Rus’an Nasrudin Instrumental Variable November 16, 2021 7 / 29

Theory

Exclusion restriction

By exclusion restriction, let’s define1 :

cov(Yi , Zi ) cov(Yi , Zi )/V(Zi )

ρ= = (1)
cov(Si , Zi ) cov(Si , Zi )/V(Zi )
The second equality expresses the covariance ratio as the regression coefficient using
variance of Z .

1
Denote that cov(Zi , Yi ) = cov(Zi , α + ρSi + µi ) = α · cov(Zi , 1) + ρ · cov(Zi , Si ) + cov(Zi , µi ). Since cov(Zi , 1) and
i ,Yi )
cov(Zi , µi ) are equal to zero we find that ρ = cov(Z
cov(Zi ,Si )
.
Rus’an Nasrudin Instrumental Variable November 16, 2021 8 / 29
Theory

IV estimand

Definition 1.1
The coefficient of interest ρ is the ratio of the population regression of Yi on Zi (the
reduced form) to the population regression of Si on Zi (the first-stage).

Rus’an Nasrudin Instrumental Variable November 16, 2021 9 / 29

Theory

Terminologies

First-stage: the regression of endogenous variable on the instrument.

Si = π10 + π11 Zi + ξ1i
Reduced form: the regression of outcome variable on the instrument.
Yi = π20 + π21 Zi + ξ2i
Structural equation: the regression of outcome variable on the endogenous variable.
Yi = α + ρSi + µi

Rus’an Nasrudin Instrumental Variable November 16, 2021 10 / 29

Theory

IV estimator

Definition 1.2
The IV estimator is the sample analog for population estimand of IV.

Rus’an Nasrudin Instrumental Variable November 16, 2021 11 / 29

Theory

Three assumptions

Exogenous instrument : The instrument is as good as randomly assigned.

Exclusion restriction: the instrument has no effect on outcome other than through the
first-stage channel.
Relevance: The instrument must have a clear effect on the endogenous variable.

Rus’an Nasrudin Instrumental Variable November 16, 2021 12 / 29

Theory

LATE Interpretation

The understanding is that if an instrument is as good as randomly assigned, affects the

outcome through a single known channel, has a first-stage, and affects the causal channel of
interest only in one direction, can be used to estimate the average causal effect on the
affected group.
This is known as LATE (Local Average Treatment Effect).