Optimal Income Taxation 3
Optimal Income Taxation 3
Stefanie Stantcheva
1 77
GOALS OF THESE LECTURES
1) Understand the core optimal income tax model: linear and nonlinear
taxes in the Saez (2001) framework.
2 77
OPTIMAL TAXATION: SIMPLE MODEL WITH NO BEHAVIORAL
RESPONSES
3 77
SIMPLE MODEL WITH NO BEHAVIORAL RESPONSES
Form lagrangian: L = [u (z − T (z )) + λ · T (z )] · h (z )
∂L
0= = [−u 0 (z − T (z )) + λ] · h(z ) ⇒ u 0 (z − T (z )) = λ
∂T (z )
4 77
Utilitarianism and Redistribution
utility
𝑐1 + 𝑐2
𝑢
2
𝑢(𝑐1 ) + 𝑢(𝑐2 )
2
0 𝑐1 𝑐1 + 𝑐2 𝑐2
consumption 𝑐
2
ISSUES WITH SIMPLE MODEL
The issue is the restricted nature of social preferences that can be captured
by most social welfare functions.
We will solve the Mirrleesian model later. For now, let’s look at the spirit of
optimal tax evolution.
Optimal income tax trades-off redistribution and efficiency (as tax based on
w only not feasible)
⇒ T (.) < 0 at bottom (transfer) and T (.) > 0 further up (tax) [full
integration of taxes/transfers]
2) Marginal tax rate T 0 (.) should be zero at the top (if skill distribution
bounded) [Sadka ’76-Seade ’77]
7 77
HISTORY: BEYOND MIRRLEES
Till late 1990s, Mirrlees results not closely connected to empirical tax
studies and little impact on tax policy recommendations
Since late 1990s, Diamond AER’98, Piketty ’97, Saez ReStud ’01 have
connected Mirrlees model to practical tax policy / empirical tax studies
8 77
INTENSIVE LABOR SUPPLY CONCEPTS
max u ( c , z ) subject to c = z · (1 − τ ) + R
c,z + −
(1 − τ ) ∂z
Uncompensated elasticity εu =
z ∂(1 − τ )
∂z
Income effects η = (1 − τ ) ≤0
∂R
9 77
INTENSIVE LABOR SUPPLY CONCEPTS (II)
(1 − τ ) ∂z c
Compensated elasticity εc = >0
z ∂(1 − τ )
∂z ∂z c ∂z
Slutsky equation = +z ⇒ εu = εc + η
∂(1 − τ ) ∂(1 − τ ) ∂R
10 77
Labor Supply Theory
c= Indifference
consumption Curves
c = (1-t)z+R
0 earnings supply z
Labor Supply Theory
𝑐= utility 𝑢
consumption
Slope=1-τ
0 earnings supply z
Labor Supply Income Effect
𝑐
𝜂=(1−t)𝜕𝑧/𝜕𝑅 ≤0
R+∆R
R
z(1-τ,R+ΔR) z(1-τ,R)
0 Earnings z
Labor Supply Substitution Effect
𝑐
utility 𝑢
slope= 1-τ+dτ
slope=1-τ
εc= (1-τ)/z ∂zc/ ∂ (1-τ)>0
zc(1-τ,u) zc(1-τ+dτ,u)
0 Earnings z
Uncompensated Labor Supply Effect
𝑐
Slutsky equation: εu = εc + η
slope=1-τ+dτ
income effect
𝜂≤0
slope=1-τ
R
substitution effect: εc>0
0 Earnings z
Labor Supply Effects of Taxes and Transfers
11 77
Effect of Tax on Labor Supply
𝑐= z-T(z)
T(z) < 0:
income effect z ↓
T’(z) > 0: slope=1-T’ (z)
substitution effect
z ↓
T(z) > 0: income effect z ↑
T’(z)>0: substitution effect z ↓
-T(0)
0 z↑∗
pre-tax income z
WELFARE EFFECT OF SMALL TAX REFORM
Welfare impact of a small tax reform is given by uc times the money metric
mechanical change in tax
12 77
WELFARE EFFECT OF SMALL TAX REFORM (II)
When is the welfare impact not just the mechanical change in disposable
income?
∂F ∗ ∂G
V 0 (θ ) = (x (θ ), θ ) − λ∗ (θ ) (x ∗ (θ ), θ )
∂θ ∂θ
13 77
SOCIAL WELFARE FUNCTIONS (SWF)
Any other social objective will lead to Pareto dominated outcomes in some
circumstances (Kaplow and Shavell JPE’01) Why?
14 77
SOCIAL MARGINAL WELFARE WEIGHTS
16 77
OPTIMAL LINEAR TAX RATES: PLAN
1) The optimal linear tax formula (on all income z ∈ [0, ∞)).
17 77
OPTIMAL LINEAR TAX RATE: FORMULA
19 77
Laffer Curve
Tax
Revenue R = 𝜏 ∙ 𝑍(1 − 𝜏)
R 1 1−𝜏 𝑑𝑍
𝜏∗= with 𝑒 = ∙
1+𝑒 𝑍 𝑑(1−𝜏)
0 𝜏∗ 1 𝜏: Tax Rate
OPTIMAL TOP INCOME TAX RATE (SAEZ ’01)
! Careful, what is e?
Note that e is a mix of income and substitution effects (see Saez ’01)
21 77
Optimal Top Income Tax Rate (Mirrlees ’71 model)
Disposable
Income
c=z-T(z) Top bracket:
Slope 1-τ
z*-T(z*) Reform:
Slope 1-τ−dτ
0 z* Market
income z
Source: Diamond and Saez JEP'11
Optimal Top Income Tax Rate (Mirrlees ’71 model)
Disposable
Income Mechanical tax increase:
c=z-T(z) dτ[z-z*]
z*-T(z*)
Behavioral Response tax loss:
τ dz = - dτ e z τ/(1-τ)
0 z* z Market
income z
Source: Diamond and Saez JEP'11
OPTIMAL TOP INCOME TAX RATE
2) Welfare effect:
dW = −ḡ dM = −ḡ [z − z ∗ ]dτ
where ḡ is the social marginal welfare weight for top earners
τ (1 − ḡ )[z − z ∗ ]
=
1−τ e ·z
1 − ḡ z
τ= with a=
1 − ḡ + a · e z − z∗
23 77
OPTIMAL LINEAR RATES: RECAP
1) The optimal linear tax formula (on all income z ∈ [0, ∞)):
R
∗ 1 − ḡ gi · zi
τ = with ḡ = R , gi = G 0 (u i )uci
1 − ḡ + e Z · gi
24 77
SUFFICIENT STATS FORMULA
25 77
ZERO TOP RATE RESULT
When z ∗ → z T ⇒ z → z T
τ
dM = dτ [z − z ∗ ] << dB = dτ · e · z when z∗ → zT
1−τ
1 − ḡ
τ=
1 − ḡ + a · e
Only difficult parameter to estimate is e
27 77
3
ym /(ym − y ∗ ) with ym = E(y|y > y ∗ )
αY = y ∗ hY (y ∗ )/(1 − HY (y ∗ ))
2.5
Empirical Pareto coefficient
1.5
0.5
0
$200,000 $400,000 $600,000 $800,000 $1,000,000
Total Income
3
rkm /(rkm − rk ∗ ) with rkm = E(rk|rk > rk ∗ )
αK = rk ∗ hK (rk ∗ )/(1 − HK (rk ∗ ))
2.5
Empirical Pareto coefficient
1.5
0.5
0
$200,000 $400,000 $600,000 $800,000 $1,000,000
Capital Income
TOP TAX REVENUE MAXIMIZING TAX RATE
In the end, ḡ reflects the value that society puts on marginal consumption
of the rich
30 77
EXTENSIONS AND LIMITATIONS
31 77
GENERAL NON-LINEAR INCOME TAX T (z )
(2) Marginal tax rate schedule T 0 (z ) describing how (a) lump-sum grant is
taxed away, (b) how tax liability increases with income
ddz
Behavioral response:
z = - d e z/(1-T’(z))
Tax loss: T’(z) z h(z)dz
= -h(z) e z T’(z)/(1-T’(z)) dzd
Optimum dM + dW + dB = 0
33 77
GENERAL NON-LINEAR INCOME TAX
1 − G (z )
T 0 (z ) =
1 − G ( z ) + α ( z ) · e(z )
34 77
2.5
Empirical Pareto Coefficient
1.5 1 2
⇒ Desirable reform
35 77
MIRRLEES MODEL
The difference to before: we need to specify the structural primitives.
Why did we not talk about this in the earlier derivations? Did we ignore
the incentive compatibility constraints?
36 77
Elasticity of labor to taxes
Recall we derive elasticities on the linearized budget set. If marginal tax
rate is τ, labor supply is: l = l (n (1 − τ )). Why the n (1 − τ )? Why only
n (1 − τ )?
n (1 − τ ) = v 0 (l )
d (n (1 − τ )) = v 00 (l )dl
dl (1 − τ )n (1 − τ )n v 0 (l )
⇒e= = =
d (n (1 − tau )) l lv 00 (l ) lv 00 (l )
Incentive compatibility:
38 77
Direct Revelation Mechanism and Incentive Compatibility
We want to max social welfare and have exogenous revenue requirement
(non transfer-related E ).
Incentive compatibility:
z (n ) 0 z (n 0 )
c (n ) − v ≥ c (n ) − v ∀n, n0
n n
That’s a lot of constraints!
38 77
Envelope Theorem and First order Approach
Replace the infinity of constraints with agents’ first-order condition. If we
take derivative of utility wrt type n at truth-telling
0
dun 0 z 0 (n ) 0 z (n ) dn z (n ) 0 z (n )
= c (n ) − v + 2 v
dn n n dn n n
Envelope condition:
dun ln v 0 (ln )
=
dn n
R R R
maxcn ,un ,zn G (un )f (n )dn s.t. cn f (n )dn ≤ nln f (n )dn − E
n n n
ln v 0 (ln )
and s.t. du
dn =
n
n
State variable: un .
40 77
Hamiltonian and Optimal Control
FOCs:
∂H φ (n )
= p · [n − v 0 (ln )]f (n) + · [v 0 (ln ) + ln v 00 (ln )] = 0
∂ln n
∂H dφ(n )
= [ G 0 ( un ) − p ] f ( n ) = −
∂un dn
Transversality: limn→∞ φ(n ) = 0 and φ(0) = 0.
41 77
Rearranging the FOCs
Take the integral of the FOC wrt un to solve for φ(n ):
Z ∞
Integrate this same FOC over the full space, using transversality conditions:
Z ∞
p= G 0 (un )f (m )dm
0
How can we make the tax rate appear? Use the agent’s FOC.
n − v 0 (ln ) = nT 0 (zn )
42 77
Obtaining the Optimal Tax Formula
(1−T 0 (zn ))n
Recall that e = lv 00 (l )
43 77
Let’s go from types to observable income
How do we go from type distribution to income distribution?
dzn dl
= l + (1 − τ )n = ln · (1 + e )
dn d (m (1 − τ ))
(intuition?)
45 77
Recap:
Yield same formula if can make the link between types and income
distributions.
! R ∞
T 0 ( zn ) 1 n (1 − gm )dF (m)
= 1+ (primitives)
1 − T 0 (zn ) e ( zn ) nf (n )
1 1 − H ( zn )
= · (1 − G (zn )) (incomes)
e ( zn ) zn h (zn )
46 77
NUMERICAL SIMULATIONS
1 z 1+ 1e
u (c, z ) = c − 1
·
1+ e
n
47 77
NUMERICAL SIMULATIONS
Z ∞
T 0 (z (n )) 1 1 G 0 (u (m ))
= 1+ 1− f (m )dm,
1 − T 0 (z (n )) e nf (n ) n λ
Iterative Fixed Point method: start with T00 , compute z 0 (n ) using individual
FOC,R get T 0 (0) using govt budget, compute u 0 (n ), get λ using
λ = G 0 (u )f , use formula to estimate T10 , iterate till convergence
48 77
NUMERICAL SIMULATION RESULTS
1 − G (z )
T 0 (z ) =
1 − G ( z ) + α ( z ) · e(z )
Take utility function with e constant
49 77
FIGURE 5 − Optimal Tax Simulations
Utilitarian Criterion, Utility type I Utilitarian Criterion, Utility type II
1 1
0.2 0.2
0 0
$0 $100,000 $200,000 $300,000 $0 $100,000 $200,000 $300,000
Wage Income z Wage Income z
0.2 0.2
0 0
$0
$100,000 $200,000 $300,000 $0 $100,000 $200,000 $300,000
Source: Saez (2001), p. 224 Wage Income z Wage Income z
EXTENSION 1: MIGRATION EFFECTS
Migration issues may be particularly important at the top end (brain drain).
50 77
ELASTICITY OF MIGRATION TO TAXES
Migration responds to average taxes (or total taxes, since income fixed).
∂P (c|z ) z − T (z )
ηm (z ) =
∂c P (c|z )
T (z )
Fiscal cost of raising taxes by dT (z ) is: B = − z−T (z ) · P (c|z ) · ηm
T (z ) 1
= · (1 − g (z ))
z − T (z ) ηm (z )
51 77
MIGRATION EFFECTS IN THE STANDARD MODEL
52 77
EXTENSION 2: RENT SEEKING EFFECTS
Pay may not be equal to the marginal economic product for top income
earners. Why? Overpaid or underpaid?
u i (c, η, y ) = c − hi (y ) − ki (η)
53 77
RENT SEEKING ELASTICITIES
Given tax, individual maximizes:
u i (c, y , η) = η · y − T (η · y ) − hi (y ) − ki (η)
What will yi and ηi depend on?
1−τ dz
Total compensation elasticity e: e = z d (1−τ ) (what is it driven by?)
1−τ dy
Real labor supply elasticity ey : ey = y d (1−τ ) ≥ 0.
db 1−τ
Thus the bargaining elasticity component eb = d (1−τ ) z
= s · e with
db/d (1−τ )
s= dz/d (1−τ )
T = τ [y (1 − τ ) + b (1 − τ ) − z ∗ ]q − E (b )
1 + a · eb a(y /z )ey
τ∗ = = 1−
1+a·e 1+a·e
How does τ change with e, ey , and eb ? When is τ ∗ = 1 optimal?
∗
Every tax payer with income z above z ∗ pays additional dτdz ∗ , valued at
(1 − g (z ))dτdz ∗ .
Z ∞
∗
M = dτdz (1 − g (z ))h(z )dz
z∗
56 77
BEHAVIORAL EFFECT PART 1: SUBSTITUTION
In [z ∗ , z ∗ + dz ∗ ], income changes by dz.
Marginal tax rate changes directly by dτ, but also additionally indirectly
by dT 0 (z ) = T 00 (z )dz. Why? When is this not the case?
dτ + dT 0 (z ) dτ
dz = −ε(cz ) z ∗ ⇒ dz = −ε(cz ) z ∗
1 − T 0 (z ) 1 − T 0 (z ) + ε(cz ) z ∗ T 00 (z )
Define the virtual density: density that would occur at z if tax schedule
replaced by linearized tax schedule. What is the linearized schedule (τ, R )
such that income is (1 − τ )z + R?
h ∗ (z ) h (z )
=
0
1 − T (z ) 1 − T (z ) + ε(cz ) z ∗ T 00 (z )
0
58 77
BEHAVIORAL EFFECT PART 1: SUBSTITUTION
T 0 (z ) ∗ ∗
E = −ε(cz ) z ∗ h (z )dτdz ∗
1 − T 0 (z )
59 77
BEHAVIORAL EFFECT PART 2: INCOME EFFECT
Taxpayers with income above z ∗ pay −dR = dτdz ∗ additional taxes. Their
change in income is:
T 00 dz dτdz ∗ dτdz ∗
dz = −ε(cz ) z − η ⇒ dz = −η
1 − T0 1 − T 0 (z ) 1 − T 0 (z ) + zε(cz ) T 00 (z )
Why?
Z ∞
∗ T 0 (z ) ∗
I = dτdz −η(z ) h (z )dz
z∗ 1 − T 0 (z )
At the optimum: M + E + I = 0.
60 77
PUTTING THE EFFECTS TOGETHER
T 0 (z ) 1 1 − H (z ∗ )
= c
1 − T 0 (z ) ε(z ) z ∗ h ∗ ( z ∗ )
Z ∞ Z ∞
h (z ) T 0 (z ) h ∗ (z )
× (1 − g (z )) dz + −η dz
z∗ 1 − H (z ∗ ) z∗ 1 − T 0 (z ) 1 − H (z ∗ )
First-order differential equation. See Saez (2001) Appendix for solution (is
standard).
z˙n 1+ε(uzn )
Recall with a linear tax: zn = n .
What happens with nonlinear tax? See Saez (2001) Appendix for derivation.
(i.e, commodity taxes not useful) under two assumptions on utility functions
u h (c1 , .., cK , z )
Note: With weaker linear income taxation tool (Diamond-Mirrlees AER ’71,
Diamond JpubE ’75), need v (c1 , .., cK ) homothetic (linear Engel curves,
Deaton EMA ’81) to obtain no commodity tax result
64 77
Generalization of Atkinson-Stiglitz to Heterogeneous Tastes –
Saez (2002)
Can we generalize AS to case with heterogeneous consumption
preferences?
Budget constraint q · c ≤ z − T (z ).
Imagine dt1 .
P h
Mechanical revenue effect: dM1 = = C1 dt1 .
h c1 dt1
P
Welfare effect (envelope theorem): dU1 = − h g h c1h dt1 .
P
Behavioral labor supply response: dB1 = − h T 0 (z h )dzth1 with
∂z h
dzth1 = dt1 ∂q 1
.
66 77
Can commodity taxation improve welfare? (II)
Find a small income tax reform that “mimics” commodity tax change:
dT (z ) = C1 (z )dt1 .
Mechanical
P Revenue effect:
P
dMT = h dT (z ) = h C1 (z h )dt1 = C1 dt1 = dM1 (why?)
h
P
Welfare effect: dWT = − h g h C1 (z h )dt1
P
Behavioral effect: dBT = − h T 0 (z h )dzTh .
67 77
Can commodity taxation improve welfare? (III)
X X
dW dzTh dT (z h ) dz1h
=− g h [c1h − C1 (z h )] + T 0 (z h ) · −
dt1 dT (z h ) dt1 dt1
| h {z } |h {z }
Pure Welfare Effect Behavioral Effect
68 77
Behavioral Effects under Commodity and Income Taxation
Since T 0 (z h ) ≥ 0 (remember), increasing dt1 > 0 more efficient than
equivalent income tax increase if labor supply increase from commodity tax
change larger than that of income tax change.
zch dc1h zRh
E [dzth1 ] = −dt1 E +E c h
1 + T 00 (z )zch dz 1 + T 00 (z )zch 1
zch dC1 (z ) zRh
E [dzTh ] = −dt1 E +E C1 ( z )
1 + T 00 (z )zch dz 1 + T 00 (z )zch
This is the key assumption. What does it say? Why is this not
mechanically true?
h h h h
= limdz→0 E (c1 |z =z +dzdz)−E (c1 |z =z ) is cross-sectional variation in
dC1 (z )
dz
consumption of good 1 when income changes.
h
dc
What is E dz1 |z h = z ?
70 77
Assumptions needed for behavioral effects to be the same under
Commodity and Income Taxation
Imagine 2 groups:
Group A’: Individuals from group A who are forced to reduce their income
to z − dz. Reduce their
consumption relative to group A by
0 dc1h h
dc1 = E dz |z = z dz.
71 77
WHEN ATKINSON-STIGLITZ ASSUMPTIONS FAIL
Thought experiment we just did was: force high earners to work less and
earn only as much as low earners: if high earners consume more of good k
than low earners, taxing good 1 is desirable.
1) High earners are “different” (since if left to chose, chose to work more. If
they have a relatively higher/lower taste for good 1 (independently of
income), tax more/less good 1. [indirect tagging] Cigarettes? Fancy wine?
How would you see this empirically?
Atkinson, A.B. and J. Stiglitz “The design of tax structure: Direct versus indirect
taxation”, Journal of Public Economics, Vol. 6, 1976, 55-75. (web)
Besley, T. and S. Coate “Workfare versus Welfare: Incentives Arguments for Work
Requirements in Poverty-Alleviation Programs”, American Economic Review, Vol.
82, 1992, 249-261. (web)
Brewer, M., E. Saez, and A. Shephard “Means Testing and Tax Rates on Earnings”,
in The Mirrlees Review: Reforming the Tax System for the 21st Century, Oxford
University Press, 2010. (web)
73 77
Cremer, H., F. Gahvari, and N. Ladoux “Externalities and optimal taxation”, Journal
of Public Economics, Vol. 70, 1998, 343-364. (web)
Deaton, A. “Optimal Taxes and the Structure of Preferences”, Econometrica, Vol. 49,
1981, 1245-1260 (web)
Edgeworth, F. “The Pure Theory of Taxation”, The Economic Journal, Vol. 7, 1897,
550-571. (web)
77
Kaplow, L. “On the undesirability of commodity taxation even when income taxation
is not optimal”, Journal of Public Economics, Vol. 90, 2006, 1235-1250. (web)
Kleven, H., C. Kreiner and E. Saez “The Optimal Income Taxation of Couples”,
Econometrica, Vol. 77, 2009, 537-560. (web)
Lehmann, E., L. Simula, A. Trannoy “Tax Me if You Can! Optimal Nonlinear Income
Tax between Competing Governments,” Quarterly Journal of Economics 129(4),
2014, 1995-2030. (web)
75 77
Mankiw, G. and M. Weinzierl “The Optimal Taxation of Height: A Case Study of
Utilitarian Income Redistribution”, AEJ: Economic Policy, Vol. 2, 2010, 155-176.
(web)
77
Sadka, E. “On Income Distribution, Incentives Effects and Optimal Income
Taxation”, Review of Economic Studies, Vol. 43, 1976, 261-268. (web)
Seade, Jesus K. “On the shape of optimal tax schedules.” Journal of public
Economics 7.2 (1977): 203-235. (web)
77 77