0% found this document useful (0 votes)

46 views25 pages

Matchse Handout

This document discusses components of causal estimation error and how they relate to different research designs for causal inference. It outlines the decomposition of causal effect estimation error and how elements like sample selection, treatment imbalance, observed covariates, and unobserved covariates contribute to error. The document also examines how randomization, blocking, matching, and other design choices can influence these different error components.

Uploaded by

Lance

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views25 pages

Matchse Handout

Uploaded by

Lance

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Quantitative Social Science Methods, I,

Lecture Notes: Research Designs for Causal

Inference

Gary King1
Institute for Quantitative Social Science
Harvard University

August 17, 2020

1
GaryKing.org
1 / 25 .
Components of Causal Estimation Error

Research Designs

Issues in Ideal Designs

Components of Causal Estimation Error 2 / 25 .

Reference

• Kosuke Imai, Gary King, and Elizabeth Stuart.

Misunderstandings among Experimentalists and
Observationalists: Balance Test Fallacies in Causal Inference
Journal of the Royal Statistical Society, Series A, 171, Part 2
(2008): 1–22.
• http://j.mp/MisExpObs

Components of Causal Estimation Error 3 / 25 .

Notation

• Sample 𝑛 units from finite population size 𝑁 (typically

𝑁 ≫ 𝑛)
• Observed outcome variable: 𝑌𝑖
• Sample selection: 𝐼𝑖 = 1 if selected, 0 otherwise
• Treatment assignment: 𝑇𝑖 = 1 if treated group, 0 if control
• (Assume: treated and control groups are each of size 𝑛/2)
• Potential outcomes: 𝑌𝑖 (1) and 𝑌𝑖 (0), 𝑌𝑖 when 𝑇𝑖 is 1 or 0
• Fundamental problem of causal inference. Only one potential
outcome is ever observed:
If 𝑇𝑖 = 0, 𝑌𝑖 (0) = 𝑌𝑖 𝑌𝑖 (1) = ?
If 𝑇𝑖 = 1, 𝑌𝑖 (0) = ? 𝑌𝑖 (1) = 𝑌𝑖
• (𝐼𝑖 , 𝑇𝑖 , 𝑌𝑖 ) are random; 𝑌𝑖 (1) and 𝑌𝑖 (0) are fixed.
• Quiz: How can 𝑌𝑖 be random when 𝑌𝑖 (0) and 𝑌𝑖 (1) are fixed?

Components of Causal Estimation Error 4 / 25 .

Quantities of Interest

• Treatment Effect (for unit 𝑖):

TE𝑖 ≡ 𝑌𝑖 (1) − 𝑌𝑖 (0)

• Population Average Treatment Effect

1 𝑁
PATE ≡ ∑ TE𝑖
𝑁 𝑖=1

• Sample Average Treatment Effect

1
SATE ≡ ∑ TE𝑖
𝑛 𝑖∈{𝐼 =1}
𝑖

Components of Causal Estimation Error 5 / 25 .

Decomposition of Causal Effect Estimation Error

• Difference in means estimator

⎛ 1 ⎞ ⎛ 1 ⎞
𝐷 ≡ 𝑌1̄ − 𝑌0̄ = ⎜ ∑ 𝑌𝑖 ⎟ − ⎜ ∑ 𝑌𝑖 ⎟ .
⎝ 𝑛/2 𝑖 ∈{𝐼𝑖 =1,𝑇𝑖 =1} ⎠ ⎝ 𝑛/2 𝑖 ∈{𝐼𝑖 =1,𝑇𝑖 =0} ⎠

• Pretreatment confounders: observed 𝑋 ; unobserved 𝑈

• Decomposition

Δ ≡ PATE − 𝐷 (Estimation error)

= Δ𝑆 + Δ𝑇
= (Δ𝑆𝑋 + Δ𝑆𝑈 ) + (Δ𝑇𝑋 + Δ𝑇𝑈 )

Error due to: Δ𝑆 (sample selection), Δ𝑇 (treatment

imbalance), and each due to observed (𝑋𝑖 ) and unobserved
(𝑈𝑖 ) covariates

Components of Causal Estimation Error 6 / 25 .

Decomposing Selection Error
Δ = Δ𝑆 + Δ𝑇 = (Δ𝑆𝑋 + Δ𝑆𝑈 ) + Δ𝑇
• Definition
Δ𝑆 ≡ PATE − SATE
𝑁 −𝑛
= (NATE − SATE), NATE: nonsample ATE
𝑁
• Δ𝑆 vanishes if
• The sample is a census (𝐼𝑖 = 1 for all observations and 𝑛 = 𝑁 );
• SATE = NATE (i.e., nothing to correct)
• Switch quantity of interest from PATE to SATE
(recommended!)
• Δ𝑆𝑋 = 0 when empirical distribution of (observed) 𝑋 is
identical in population and sample:
̃
𝐹 (𝑋 ∣ 𝐼 = 0) = ̃ 𝐹 (𝑋 ∣ 𝐼 = 1).
• Δ𝑆𝑈 = 0 when empirical distribution of (unobserved) 𝑈 is
identical in population and sample:
̃
𝐹 (𝑈 ∣ 𝐼 = 0) = ̃ 𝐹 (𝑈 ∣ 𝐼 = 1).
• Unverifiable: 𝑋 unobserved out of sample; 𝑈 unobserved
• Δ𝑆𝑋 : vanishes if weighting on 𝑋 (and examples exist in
sample)
Components of Causal Estimation Error 7 / 25 .
Decomposing Treatment Imbalance
Δ = Δ𝑆 + Δ𝑇 = Δ𝑆 + (Δ𝑇𝑋 + Δ𝑇𝑈 )

• Δ𝑇𝑋 = 0: when 𝑋 balanced between treateds and controls:

̃
𝐹 (𝑋 ∣ 𝑇 = 1, 𝐼 = 1) = ̃
𝐹 (𝑋 ∣ 𝑇 = 0, 𝐼 = 1).

Verifiable; generated ex ante by blocking or ex post via

matching or modeling
• Δ𝑇𝑈 = 0: when 𝑈 balanced between treateds and controls:

𝐹 (𝑈 ∣ 𝑇 = 1, 𝐼 = 1) = ̃
̃ 𝐹 (𝑈 ∣ 𝑇 = 0, 𝐼 = 1).

Unverifiable; Achieved only by assumption or, on average, by

random treatment assignment

Components of Causal Estimation Error 8 / 25 .

Alternative Quantities of Interest: For Matching
• Population average treatment effect on the treated

1
PATT ≡ ∑ TE𝑖
𝑁 ∗ 𝑖∈{𝑇 =1}
𝑖

(𝑁 ∗ = ∑𝑁
𝑖=1 𝑇𝑖 : number of treated units in population)
• Sample average treatment effect on the treated

1
SATT ≡ ∑ TE𝑖
𝑛/2 𝑖∈{𝐼 =1,𝑇 =1}
𝑖 𝑖

• Analogous estimation error decomposition holds:

Δ′ = PATT − 𝐷 = (Δ′𝑆𝑋 + Δ′𝑆𝑈 ) + (Δ′𝑇𝑋 + Δ′𝑇𝑈 )

• Quiz: Why PATT and SATT rather than PATE and SATE for
matching?
• Quiz: How do they differ in randomized experiments?
Components of Causal Estimation Error 9 / 25 .
Effects of Design Components on Estimation Error
Δ = Δ𝑆 + Δ𝑇 = (Δ𝑆𝑋 + Δ𝑆𝑈 ) + (Δ𝑇𝑋 + Δ𝑇𝑈 )

Design Choice Δ𝑆𝑋 Δ𝑆𝑈 Δ𝑇𝑋 Δ𝑇𝑈

avg avg
Random sampling = 0 = 0
avg
Complete stratified random sampling =0 = 0
Focus on SATE rather than PATE =0 =0
Weighting for nonrandom sampling =0 =?
Large sample size →? →? →? →?
avg avg
Random treatment assignment = 0 = 0
Complete blocking =0 =?
Exact matching =0 =?
Assumption
avg avg
No selection bias = 0 = 0
avg
Ignorability = 0
No omitted variables =0

Components of Causal Estimation Error 10 / 25 .

Comparing Blocking (i.e., before) and Matching (i.e.,
after)
• Adding blocking (on pretreatment vars related to outcome) to
random assignment: as or more efficient, and never biased
• Blocking: like regression adjustment, where functional form
and the parameter values are known
• Matching is like blocking, except:
• to avoid selection error: change QOI from PATE to PATT/SATT
• random treatment assignment following matching:
impossible
• Exact matching, unlike blocking: dependent on good matches
in already-collected data
• Worst case scenario: matching on wrong vars (like regression
adjustment) can increase bias
• Adding matching to a parametric model: reduces model
dependence and bias, and sometimes variance too
• Quiz: Which is preferable: Matching or Blocking?

Components of Causal Estimation Error 11 / 25 .

Components of Causal Estimation Error

Research Designs

Issues in Ideal Designs

Research Designs 12 / 25 .
The Benefits of Major Research Designs: Overview
Δ𝑆𝑋 Δ 𝑆𝑈 Δ 𝑇𝑋 Δ 𝑇𝑈
Ideal experiment →0 →0 =0 →0
Randomized clinicial trials
avg avg
(Limited or no blocking) ≠0 ≠0 = 0 = 0
Randomized clinicial trials
avg
(Full blocking) ≠0 ≠0 =0 = 0
Social Science
Field Experiment • → 0: 𝐸(𝑄) = 0 &
(Limited or no blocking) ≠0 ≠0 →0 →0 lim Var(𝑄) = 0
Survey Experiment 𝑛→∞
(Limited or no blocking) →0 →0 →0 →0
Observational Study avg
(Representative data set, • = 0: 𝐸(𝑄) = 0
Well-matched) ≈0 ≈0 ≈0 ≠0
Observational Study
(Unrepresentative but partially,
correctable data, well-matched) ≈0 ≠0 ≈0 ≠0
Observational Study
(Unrepresentative data set,
Well-matched) ≠0 ≠0 ≈0 ≠0

Research Designs 13 / 25 .
The Ideal Experiment (according to the paper)

• Random selection from well-defined population

• large 𝑛
• blocking on all known confounders
• random treatment assignment within blocks
• 𝐸(Δ𝑆𝑋 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑆𝑋 ) = 0
• 𝐸(Δ𝑆𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑆𝑈 ) = 0
• Δ𝑇𝑋 = 0
• 𝐸(Δ𝑇𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑈 ) = 0
• Quiz: Is there an even more ideal experiment?
• Hint: How can we make Δ𝑆𝑋 = 0?

Research Designs 14 / 25 .
An Even More Ideal Experiment (not in the paper)

• Begin with a well-defined population

• New feature: Define sampling strata based on
cross-classification of all known confounders
• Random sampling within strata
• (if strata sample ∝ population size, no weights needed)
• large 𝑛
• blocking on all known confounders
• random treatment assignment within blocks
• Δ𝑆𝑋 = 0
• 𝐸(Δ𝑆𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑆𝑈 ) = 0
• Δ𝑇𝑋 = 0
• 𝐸(Δ𝑇𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑈 ) = 0
• Wait, why wasn’t this in the paper?

Research Designs 15 / 25 .
Randomized Clinical Trials (Little or no Blocking)

• nonrandom selection
• small 𝑛
• little or no blocking
• random treatment assignment
• Δ𝑆𝑋 ≠ 0
• Δ𝑆𝑈 ≠ 0
• 𝐸(Δ𝑇𝑋 ) = 0
• 𝐸(Δ𝑇𝑈 ) = 0

Research Designs 16 / 25 .
Randomized Clinical Trials (Full Blocking)

• nonrandom selection
• small 𝑛
• Full blocking
• random treatment assignment
• Δ𝑆𝑋 ≠ 0
• Δ𝑆𝑈 ≠ 0
• Δ𝑇𝑋 = 0
• 𝐸(Δ𝑇𝑈 ) = 0

Research Designs 17 / 25 .
Social Science Field Experiment

• nonrandom selection
• large 𝑛
• limited or no blocking
• random treatment assignment
• Δ𝑆𝑋 ≠ 0 or change PATE to SATE and Δ𝑆𝑋 = 0
• Δ𝑆𝑈 ≠ 0 or change PATE to SATE and Δ𝑆𝑈 = 0
• 𝐸(Δ𝑇𝑋 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑋 ) = 0
• 𝐸(Δ𝑇𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑈 ) = 0

Research Designs 18 / 25 .
Survey Experiment

• random selection
• large 𝑛
• limited or no blocking
• random treatment assignment
• (only treatments: question wording changes)
• 𝐸(Δ𝑆𝑋 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑆𝑋 ) = 0
• 𝐸(Δ𝑆𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑆𝑈 ) = 0
• 𝐸(Δ𝑇𝑋 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑋 ) = 0
• 𝐸(Δ𝑇𝑈 ) = 0, lim𝑛→∞ 𝑉 (Δ𝑇𝑈 ) = 0

Research Designs 19 / 25 .
Observational Study, well-matched

• no stratification, nonrandom selection

• large 𝑛
• no blocking, nonrandom treatment assignment
• Δ𝑆𝑋 ≈ 0 if representative, corrected by weighting, or for
estimating SATE; or ≠ 0 otherwise
• Δ𝑆𝑈 ≠ 0
• Δ𝑇𝑋 ≈ 0 (due to matching well)
• Δ𝑇𝑈 ≠ 0 except by assumption

Research Designs 20 / 25 .
Components of Causal Estimation Error

Research Designs

Issues in Ideal Designs

Issues in Ideal Designs 21 / 25 .

What is the Best Design?

• Ideal design: rarely feasible

• Effort in experimental studies: random assignment
• Effort in observational studies: knowing, measuring, and
adjusting for 𝑋 (via matching or modeling)
• Achilles heal of experiments: Δ𝑆 , small 𝑛
• Achilles heal of observational studies: Δ𝑇
• Each design: accommodates best to its applications
• Quiz: Astronomers never randomize; is astronomy a science?

Issues in Ideal Designs 22 / 25 .

Fallacies in Experimental Research

• Failure to block on all available confounders

• incorrectly seen as requiring fewer assumptions (about what
to block on)
• In fact, blocking helps (except in strange situations)
• Blocking on relevant covariates is better, so choose carefully.
• “Block what you can and randomize what you cannot”
• t-test to check balance after random treatment assignment
• blocking vars: balance exactly after treatment assignment; if
you’re checking, you missed an opportunity to increase
efficiency
• if vars become available after treatment assignment: t-test
checks if randomization was done appropriately
• randomization balances on average: any one random
assignment is not balanced exactly (which is why its better to
block)

Issues in Ideal Designs 23 / 25 .

The Balance Test Fallacy in Matching Research

100
4

80
3

60
Math test score
t−statistic

"Statistical

40
insignificance" region
1

20 QQ Plot Mean Deviation

Difference in Means
0

0 100 200 300 400 0 100 200 300 400

Number of Controls Randomly Dropped Number of Controls Randomly Dropped

Quiz: randomly dropping observations reduces imbalance??

Issues in Ideal Designs 24 / 25 .

The Balance Test Fallacy: Explanation

• Hypo tests: balance and power; only want balance

• Balance is observed: No need for superpopulation or
inference
• Simple linear model (for intution):
• Suppose 𝐸(𝑌 ∣ 𝑇 , 𝑋 ) = 𝜃 + 𝑇 𝛽 + 𝑋 𝛾
• Bias in coefficient on 𝑇 from regressing 𝑌 on 𝑇 (without 𝑋 ):
𝐸(𝛽 ̂ − 𝛽 ∣ 𝑇 , 𝑋 ) = 𝐺𝛾 (where 𝐺 are coefficients from a
regression 𝑋 on a constant and 𝑇 )
• Imbalance: 𝐺, Importance: 𝛾
• If 𝐺 = 0, bias=0
• If 𝐺 ≠ 0, bias can be any size (due to 𝛾 )
• To reduce bias: reduce 𝐺 without limit
• No threshold level is safe
• But prune too much, variance increases
• Quiz: Should we match on vars that do not influence 𝑌 ?

Issues in Ideal Designs 25 / 25 .

2008 Imai Et Al - Misunderstandings Between Experimentalists and Observationalists About Causal Inference
No ratings yet
2008 Imai Et Al - Misunderstandings Between Experimentalists and Observationalists About Causal Inference
22 pages
Chapter 4 Design of Experiment
No ratings yet
Chapter 4 Design of Experiment
40 pages
Anova 1-10
No ratings yet
Anova 1-10
66 pages
Intro Quantitative Research
No ratings yet
Intro Quantitative Research
24 pages
statTI5e PPT 0103
No ratings yet
statTI5e PPT 0103
26 pages
Detecting Interference Between Units
No ratings yet
Detecting Interference Between Units
22 pages
Final
No ratings yet
Final
9 pages
CRD
No ratings yet
CRD
16 pages
Diseños Experimentales
No ratings yet
Diseños Experimentales
69 pages
CRD Is Best Suited For Experiments With A Small Number of Treatments
No ratings yet
CRD Is Best Suited For Experiments With A Small Number of Treatments
14 pages
Syllabus
No ratings yet
Syllabus
8 pages
Hypothesis Testing Basics
100% (1)
Hypothesis Testing Basics
8 pages
Evaluation Method - 2023 - Class
No ratings yet
Evaluation Method - 2023 - Class
21 pages
Bayesian Causal Tutorial Ohiostate June2019
No ratings yet
Bayesian Causal Tutorial Ohiostate June2019
56 pages
Applied Statistics Course Overview
100% (1)
Applied Statistics Course Overview
654 pages
Experimental Design REVIEWER 1
No ratings yet
Experimental Design REVIEWER 1
2 pages
Random PDF About Theorys
No ratings yet
Random PDF About Theorys
16 pages
09 Advanced
No ratings yet
09 Advanced
70 pages
Combined STAT101B CheatSheet Raw
No ratings yet
Combined STAT101B CheatSheet Raw
17 pages
Causal Inference, Michael E. Sobel
No ratings yet
Causal Inference, Michael E. Sobel
3 pages
Fisher's Randomization Inference - Matthew Blackwell
No ratings yet
Fisher's Randomization Inference - Matthew Blackwell
7 pages
Formulating Causal Questions and Principled Statistical Answers
No ratings yet
Formulating Causal Questions and Principled Statistical Answers
26 pages
Introduction To Treatment Effects Handout
No ratings yet
Introduction To Treatment Effects Handout
18 pages
Соц Эффект По Закону2
No ratings yet
Соц Эффект По Закону2
29 pages
Science
No ratings yet
Science
26 pages
Causal Inference: Yu Xie University of Michigan
No ratings yet
Causal Inference: Yu Xie University of Michigan
51 pages
CIML2023
No ratings yet
CIML2023
87 pages
Experimental Design in Stat 705
No ratings yet
Experimental Design in Stat 705
16 pages
Large Sample Randomization Inference of Causal Effects in The Presence of Interference
No ratings yet
Large Sample Randomization Inference of Causal Effects in The Presence of Interference
15 pages
The Hardness of Validating Observational Studies With Experimental Data
No ratings yet
The Hardness of Validating Observational Studies With Experimental Data
20 pages
Lecture 21
No ratings yet
Lecture 21
8 pages
DOCTOR OF EDUCATION - Comprehensive Exams
No ratings yet
DOCTOR OF EDUCATION - Comprehensive Exams
8 pages
Causal K-Means Clustering
No ratings yet
Causal K-Means Clustering
44 pages
Lecture Slides - Before Running An Experiment
No ratings yet
Lecture Slides - Before Running An Experiment
27 pages
Problem Set 1
No ratings yet
Problem Set 1
4 pages
Causal Inference Lecture Intro
No ratings yet
Causal Inference Lecture Intro
51 pages
Script
No ratings yet
Script
11 pages
Biostatistics Unit 4
No ratings yet
Biostatistics Unit 4
30 pages
Lecture Handout 4-Inferential Statistics
No ratings yet
Lecture Handout 4-Inferential Statistics
8 pages
RMDA Final Review: 2020-2021 Semester 1 Prof. Sally Hudson
No ratings yet
RMDA Final Review: 2020-2021 Semester 1 Prof. Sally Hudson
26 pages
Finite Population CLTs for Causal Inference
No ratings yet
Finite Population CLTs for Causal Inference
12 pages
Cook 2008
No ratings yet
Cook 2008
27 pages
6 Causal Inference Technical
No ratings yet
6 Causal Inference Technical
28 pages
Соц Эффект По Закону
No ratings yet
Соц Эффект По Закону
36 pages
Research Design: GSBA 599, Fall 2010
No ratings yet
Research Design: GSBA 599, Fall 2010
8 pages
Lecture 4 IMB 516
No ratings yet
Lecture 4 IMB 516
26 pages
Basic Testing
No ratings yet
Basic Testing
116 pages
PSM Inès
No ratings yet
PSM Inès
71 pages
Causal Obs
No ratings yet
Causal Obs
35 pages
Mock Exam 2 Solutions Mark Scheme
No ratings yet
Mock Exam 2 Solutions Mark Scheme
11 pages
Methodology Lecture
No ratings yet
Methodology Lecture
135 pages
Stanovich Notes
No ratings yet
Stanovich Notes
16 pages
A First Course in Experimental Design
No ratings yet
A First Course in Experimental Design
193 pages
Lesson - 12 Between Subject Designs 12.0. Objectives
No ratings yet
Lesson - 12 Between Subject Designs 12.0. Objectives
15 pages
Repeated Measures Analysis
No ratings yet
Repeated Measures Analysis
32 pages
Course in Causal Inference
No ratings yet
Course in Causal Inference
428 pages
V51i13 PDF
No ratings yet
V51i13 PDF
35 pages
King 13
No ratings yet
King 13
4 pages
King 5
No ratings yet
King 5
22 pages
King 2
No ratings yet
King 2
10 pages
Intro to Gender Studies Syllabus
No ratings yet
Intro to Gender Studies Syllabus
8 pages
Understanding Simpson's Paradox
No ratings yet
Understanding Simpson's Paradox
19 pages
GEM2900: Understanding Uncertainty & Statistical Thinking: David Nott
No ratings yet
GEM2900: Understanding Uncertainty & Statistical Thinking: David Nott
35 pages
GEM2900: Understanding Uncertainty & Statistical Thinking: David Nott
No ratings yet
GEM2900: Understanding Uncertainty & Statistical Thinking: David Nott
17 pages
Lecture 11
No ratings yet
Lecture 11
12 pages
Chapter 3 - Variance Reduction Methods
No ratings yet
Chapter 3 - Variance Reduction Methods
20 pages
Essentials of Statistics For The Behavioral Sciences 9th Edition Gravetter
No ratings yet
Essentials of Statistics For The Behavioral Sciences 9th Edition Gravetter
309 pages
Asset-V1 MITx+18.6501x+3T2019+type@asset+block@resources Syllabus Schedule 3T2019
50% (2)
Asset-V1 MITx+18.6501x+3T2019+type@asset+block@resources Syllabus Schedule 3T2019
3 pages
1.medical Statistics
No ratings yet
1.medical Statistics
33 pages
A First Course in Machine Learning Chapman Hall CRC Machine Learning Pattern Recognition 2nd Edition Simon Rogers Download
100% (1)
A First Course in Machine Learning Chapman Hall CRC Machine Learning Pattern Recognition 2nd Edition Simon Rogers Download
48 pages
Quality Improvement for LLI
100% (1)
Quality Improvement for LLI
34 pages
An Introduction To Coding Theory: Adrish Banerjee
No ratings yet
An Introduction To Coding Theory: Adrish Banerjee
28 pages
Multistage (Cluster) Sampling
No ratings yet
Multistage (Cluster) Sampling
5 pages
Geo Statistics
No ratings yet
Geo Statistics
7 pages
Skripsi Monic
No ratings yet
Skripsi Monic
4 pages
Week 10 - Ordinary Kriging
No ratings yet
Week 10 - Ordinary Kriging
19 pages
MATH1041 Final Cheat Sheet
No ratings yet
MATH1041 Final Cheat Sheet
3 pages
March 13 Homework Solutions Math 151, Winter 2012 Chapter 7 Problems (Pages 373-379)
No ratings yet
March 13 Homework Solutions Math 151, Winter 2012 Chapter 7 Problems (Pages 373-379)
8 pages
Statistics in Hydrology
100% (1)
Statistics in Hydrology
4 pages
Heteroskedasticity in Econometrics
100% (1)
Heteroskedasticity in Econometrics
3 pages
Machine Learning Quiz Solutions
No ratings yet
Machine Learning Quiz Solutions
3 pages
Sample Statistics Population Parameters: PTH Percentile
No ratings yet
Sample Statistics Population Parameters: PTH Percentile
6 pages
STA222
No ratings yet
STA222
6 pages
Statistics and Data Visualisation With Python Jesús Rogel-Salazar Download
No ratings yet
Statistics and Data Visualisation With Python Jesús Rogel-Salazar Download
94 pages
Analytic Methods in Accident Research
No ratings yet
Analytic Methods in Accident Research
9 pages
Time Series Stationarity Tests
No ratings yet
Time Series Stationarity Tests
8 pages
Business Stats for B.Com Students
No ratings yet
Business Stats for B.Com Students
2 pages
Management Quizz
No ratings yet
Management Quizz
124 pages
Two-Sample t-Test & ANOVA Guide
No ratings yet
Two-Sample t-Test & ANOVA Guide
44 pages
Smec ML Lab Manual R22
No ratings yet
Smec ML Lab Manual R22
21 pages
Anova
100% (2)
Anova
49 pages
NLP and Entropy
No ratings yet
NLP and Entropy
54 pages
A-Level Statistics Exam Guide
No ratings yet
A-Level Statistics Exam Guide
24 pages
IB Math AI SL Questionbank - Descriptive Statistics 5
No ratings yet
IB Math AI SL Questionbank - Descriptive Statistics 5
1 page
Exercises c3
No ratings yet
Exercises c3
7 pages

Matchse Handout

Uploaded by

Matchse Handout

Uploaded by

Quantitative Social Science Methods, I,

Lecture Notes: Research Designs for Causal

August 17, 2020

Issues in Ideal Designs

Components of Causal Estimation Error 2 / 25 .

• Kosuke Imai, Gary King, and Elizabeth Stuart.

Components of Causal Estimation Error 3 / 25 .

• Sample 𝑛 units from finite population size 𝑁 (typically

Components of Causal Estimation Error 4 / 25 .

• Treatment Effect (for unit 𝑖):

TE𝑖 ≡ 𝑌𝑖 (1) − 𝑌𝑖 (0)

• Population Average Treatment Effect

• Sample Average Treatment Effect

Components of Causal Estimation Error 5 / 25 .

• Difference in means estimator

• Pretreatment confounders: observed 𝑋 ; unobserved 𝑈

Δ ≡ PATE − 𝐷 (Estimation error)

Error due to: Δ𝑆 (sample selection), Δ𝑇 (treatment

Components of Causal Estimation Error 6 / 25 .

• Δ𝑇𝑋 = 0: when 𝑋 balanced between treateds and controls:

Verifiable; generated ex ante by blocking or ex post via

Unverifiable; Achieved only by assumption or, on average, by

Components of Causal Estimation Error 8 / 25 .

• Analogous estimation error decomposition holds:

Δ′ = PATT − 𝐷 = (Δ′𝑆𝑋 + Δ′𝑆𝑈 ) + (Δ′𝑇𝑋 + Δ′𝑇𝑈 )

Design Choice Δ𝑆𝑋 Δ𝑆𝑈 Δ𝑇𝑋 Δ𝑇𝑈

Components of Causal Estimation Error 10 / 25 .

Components of Causal Estimation Error 11 / 25 .

Issues in Ideal Designs

• Random selection from well-defined population

• Begin with a well-defined population

• no stratification, nonrandom selection

Issues in Ideal Designs

Issues in Ideal Designs 21 / 25 .

• Ideal design: rarely feasible

Issues in Ideal Designs 22 / 25 .

• Failure to block on all available confounders

Issues in Ideal Designs 23 / 25 .

20 QQ Plot Mean Deviation

0 100 200 300 400 0 100 200 300 400

Number of Controls Randomly Dropped Number of Controls Randomly Dropped

Quiz: randomly dropping observations reduces imbalance??

Issues in Ideal Designs 24 / 25 .

• Hypo tests: balance and power; only want balance

Issues in Ideal Designs 25 / 25 .

You might also like