Dickey e Fuller (1981)
Dickey e Fuller (1981)
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.
http://www.jstor.org
Econometrica,Vol. 49, No. 4 (July, 1981)
Let the time series Y, satisfy Y, = a + pY,_- + e,, where Y1 is fixed and the e, are
normal independent (0, a2) random variables. The likelihood ratio test of the hypothesis
that (a, p) = (0, 1) is investigated and a limit representation for the test statistic is pre-
sented. Percentage points for the limiting distribution and for finite sample distributions
are estimated. The distribution of the least squares estimator of a is also discussed. A
similar investigation is conducted for the model containing a time trend.
1. INTRODUCTION
= Y(o) - - )9
% pj(_
a, a,i /L
where
(1.) t ot +;(n--
e n) + (- t- (t = , ,. , )
An alternative model for Y is 2
where Y1 is fixed and e, NI(0, 2). Let X denote the (n-1) x 3 matrix whose
ith row is (1,i - In, Y,) and let Y' = (Y2, Y3, . .. , Yn).Then the least squares
1057
1058 D. A. DICKEY AND W. A. FULLER
(1.4) = (XX)-
as= (&x,I3s,,$s)' x Y.
Let C, denote the ijth element of (X'X)'. Then the "regression t statistics" are
where
(1.7) = (n-4)-1Y'[I-X(X'X)-IX']Y.
Se2T
We shall study the likelihood ratio test of the hypothesis that (a, p) = (0, 1) for
model (1.2), the likelihood ratio test of the hypothesis that (a, 3,p) = (0,0, 1) for
model (1.3), and the likelihood ratio test of the hypothesis that (a, A, p) = (a, 0, 1)
for model (1.3). We also investigate the distributions of a,_,,Talk &a, 8,-,Tar, and T,,T
under the null model.
The likelihood ratio statistics are derived in Section 2 and the limiting
distributions presented in Section 3. Percentage points for the distributions
obtained by Monte Carlo methods are given in Section 4. In Section 5, it is
shown that the limit distributions of the test statistics are unchanged when {et} in
(1.1) and (1.3) is replaced by a stationary pth order autoregressive process whose
coefficients must be estimated. In Section 6 the powers of the likelihood ratio
tests (DI,02 and 03 are compared to the powers of other test statistics. Section 7
contains an illustration of the use of the test statistics.
We construct the likelihood ratio statistics for the null hypothesis that the true
model is a random walk with zero drift. We consider first the test that (a, 3,p)
= (0,0, 1) in model (1.3) against the alternative that the null is not true. The
logarithm of the likelihood function for a sample of n observations from model
(1.3), conditional on Y1, is
log L = (n - l)log(27) - (n - l)log a
n
t
-(2a) E Y- ,Bt -2 n)_ _-1]2.
t=2
Under the null hypothesis, Ho: (a, 3,p) = (0,0, 1), the likelihood is maximized
with respect to a2 to obtain
n
aO2= (n -l1)- Y-t_ 1)2.
t=2
( 0, '), where O, was defined in (1.4) and 62 = (n - 4)(n - 1)- 'S2. Therefore the
likelihood ratio is
Thus, the likelihood ratio test rejects the null hypothesis for large values of
02' where '2 iS the usual regression "F test" of the hypothesis HO:(a, /, p) =
(0,0,1).
In a similar manner, it can be shown that the likelihood ratio statistic for
testing HO:(a, p) = (0, 1) against HA: Not HO, for the model (1.1) is
[1 + 2(n 3)-'(DI 2
where
= (2S
4?1 )'[(n - l)0- (n - 3)SJ.j
The likelihood ratio test of the hypothesis HO:(a, /3,p) = (a, 0, 1) in model
(1.3) is a monotone function of
2 _ _ y( _ 1))2)}(-)S2
4> Ser)[(-1 ) {2 (y(o)
The statistics 02 and 0I3 are the common regression "F tests" one would
construct for the hypotheses. The null hypothesis for test D3 iS that the time series
is a random walk with drift a. It is easily demonstrated that the distribution of
the test statistic 03 does not depend upon a.
3. LIMITING DISTRIBUTIONS
Under the null hypotheses, the statistics introduced in Sections 1 and 2 can be
expressed as functions of a few sample statistics. Let
n t-1 2
(3.1) Fn = (n-1)-2 ej
n
Tn = (n- 1) 2 et= (n- 1)f(0),
t=2
t =2
n-1
n-i
(n - 51)/2 -
Vn= E (n -t)(t I)e,.
t= 1
1060 D. A. DICKEY AND W. A. FULLER
and
(n -
l)(A, -1) = (In-W 2) 4 [(Tn + (n - 1)2e,)
r = yi2Zi2 T= 2
i=l i=l
00 00
(2i 2), -
and {Z,}j is a sequence of normal independent (0, 1) random variables.
Therefore, given that (a, p) = (0, 1),
1 0 (n- 1) Wn
X'X = (n-1) 0 12-'n(n-2) 2(n-1)2Vn
(3.4) Dn 'X'XDn' - A,
(3.5) a-Dn1'(X'YX/XO ) f
where
1 0 W
0 1 !2 v
A= ? 12
w Iv r
2
and f' [T, IT- W, (T2 -1)]. The matrix A is invertible with probability 1
and it is readily verified that
Q + W2 6VW -w
(3.6) A-'=Q-[ 6VW 12Q + 36V2 -6Vj,
-W -6V I
where Q = -W2 -3 V2. Thus
aDn(A-_ 0) -A f
The third element of A -!f is the limit random variable for n ( - 1) given in
Dickey and Fuller [7]. Using the fact that S2 converges in probability to a2, we
obtain
A 2
,< Q2- ( Q + W2)2 (1, 0, O)A-f,
2 1,O)A- ,
--r Q 1Q+3 V2) (O,
P2
9 tAI=-l[2+ 1(2TW)2 + 2
and
A W2-3V2) 2[(2T-W)(T-6V)-1].
The limiting distributions hold for any fixed Y1 and for e, a sequence of
independent identically distributed random variables.
1062 D. A. DICKEY AND W. A. FULLER
TABLE I
EMPIRICAL DISTRIBUTION OF 7;A FOR (a, p) = (0, 1) IN Yt = a + pYt-i + e,.
(Symmetric Distribution)
TABLE II
EMPIRICAL DISTRIBUTION OF TaT FOR (a, fi, p) = (0, 0, 1) IN Yt = a + 8it + pYt-l + e,.
(Symmetric Distribution)
TABLE III
EMPIRICAL DISTRIBUTION OF 7;6, FOR (a, /3, p) = (0, 0, 1) IN Yt = a + 8it + pYt-l + e,
(Symmetric Distribution)
TABLE IV
EMPIRICALDISTRIBUTIONOF (DI FOR (a, p) = (0, 1) IN Yt = a + pY,_ 1 + et
TABLE V
EMPIRICALDISTRIBUTIONOF 02 FOR (a, /3, p) = (0, 0, 1) IN Yt = a + 8t + pY_ I+ et
TABLE VI
EMPIRICALDISTRIBUTIONOF (D3FOR (a, /3, p) = (a, 0, 1) IN Yt = a + 8t + pY,_ I + et
4. SIMULATION
Tables I-VI contain percentiles for the null distributions of the "regression t
A A A
statistics" (TaL, Ta,r,, iT,), and "regression F tests" ((D,, (21)03).The null model is
given in each table.
The empirical distributions of the statistics for finite samples were created
from statistics for samples generated by the model with Y, = 0 and Y, = Y- l +
e, t = 2,3, ... , n, for n = 25, 50, 100, 250, and 500. Three replicates of 50,000
samples were generated for n = 25, two for n = 50, 100, and 250, and one for
n = 500. The simulation of the limit case was conducted using the procedure
given in Dickey [6]. Three replicates of 50,000 were generated for the limit case.
For symmetric distributions, cells equidistant from zero were pooled to create a
symmetric histogram.
For each of the six estimators and for each sample size, the 0.01, 0.025, 0.05,
0.10, 0.90, 0.95, 0.975, and 0.99 percentage points of the distributions were
calculated. These empirical percentiles were then plotted against n. Based on the
plots, regression functions of the form P = a + /8ny were fitted to the percentiles
of the empirical distributions. Because several observations on each percentile
were available for n = 25, 50, 100, 250, and for the limit case, regression F tests
for lack of fit for the smoothing regressions were computed. Of the 36 lack of fit
statistics computed, 7 were significant at the 0.25 level, 2 at the 0.05 level, and
none at the 0.01 level. The regression smoothed percentiles are given in Tables I
through VI.
David [5, Section 2.5] gives a method for constructing distribution free
confidence intervals for the percentiles of a distribution based on empirical
percentiles. In Tables I through VI the number in the row labeled "s.e" is the
largest of the two half lengths of the 68.26% confidence intervals constructed for
n = 25 and for the limit case. These entries provide an upper bound for the
estimated standard errors of the regression smoothed percentiles.
The histogram1TaI, for 50,000sampleswith n = 25 is shownin Figure 1. Figure
2 contains the histogram for i-, constructed from 50,000 samples of size n = 25
F---- I I _ _
-6 -3 0 3 6
FIGURE 1.-Histogram for 50,000 values of T. constructed with n = 25.
LIKELIHOOD RATIO STATISTICS 1065
-6 ~~-3 0 6
FIGURE 2.-Histogram for 50,000 values of q, constructed with n = 25.
generated with 0' = (0,0, 1). The distributions are symmetric and the histograms
were constructed to be symmetric. The distributions of the T statistics are
distinctive in two respects; the distribution is bimodal and the "spread" of the
distribution is much larger than that of Student's t distribution.
(5.1) Y=0,
Yt=Yt - I + Zt (t = 2, 39. . . ),
where
is a stationary autoregressive process and the et are NID(O, a2) random variables.
The model can also be written
p
Yt = pYt-1 + E 0(Y-i-Y, I) + et,
= (a,
of the diagonal elements of H1, let Y'n /8, P, 01, 2 ... 9 9)9 and let Yndenote
the least squares estimator of 'yn.Then
Mn(Yn-Y,)
=[MYn
HTOMMnnMMn 7- 1,
where
n-p
( Is[ t -
-
gn (n -p + 1)] Y+_l Zt+-lS*** Zt )'et+P.
t=1
n
n - I
tt-(n -
+p )] Zt+p-j= OP(n-22)
t=-
n
1: Yt_lZt-1= Op(n).
t= 2
Therefore
[1 0 -2
HI, 0 l rF-232 V 9
,r-2wW -232 v I
H22 is the p x p correlation matrix of the process Zt, and r, W, and V are as
defined in (3.2). It follows that the limiting distribution of the vector composed of
the first three elements of M A(, -yn) is the same as the limiting distribution of
n 2an2 -t ( (
1)]
discussed in Section 3. Similar results are easily obtained for the regression that
does not contain the time trend.
Tables VII-IX were constructed to give information on the power of the tests.
In Table VII the power was computed for samples of size n = 100 generated for
model (1.1) with p = 0.8, 0.9, 0.95, 1.00, 1.02, and 1.05 and a = 0.0, 0.5, and 1.0.
LIKELIHOOD RATIO STATISTICS 1067
ForFor Ti T
a a n(T- n(-
4D3 42 41
Statistic
=0 0 1) 1)
0.710.86
power 0.460.57
power 0.410.78 0.00
0.57
is is p
a =
0.500.550.770.87 0.470.83 0.50
0.57 0.8
computed
computed
0.620.530.890.88 0.650.93 1.00
0.72
from
from
0.220.050.560.13
0.43
0.390.73 1.00 POWER
OF
0.040.060.060.11
0.07
0.040.08 0.00 Two
p
=
a
0.11
0.050.040.120.02 0.090.21 0.50 0.95 SIDED
SAMPLES
0.050.050.060.05 0.050.05 0.00
0.05
OF
a p=
0.050.050.260.14
0.05
0.970.98 0.50 1.0 SIZE
0.050.050.350.07 100(
1.001.00 1.00
0.04 YI
TABLE VIII
EMPIRICAL POWER OF Two SIDED SIZE 0.05 TESTS AGAINST THE STATIONARY ALTERNATIVE
FOR SAMPLESOF n = 50, 100, AND 250 (20,000 SAMPLES)
Statistic 50 100 250 50 100 250 50 100 250 50 100 250 50 100 250
>1 0.25 0.80 1.0 0.09 0.24 0.94 0.05 0.09 0.37 0.05 0.05 0.05 0.05 0.05 0.05
>2 0.09 0.44 1.0 0.04 0.10 0.64 0.04 0.04 0.15 0.04 0.04 0.04 0.05 0.05 0.05
4>3 0.17 0.59 1.0 0.08 0.16 0.78 0.06 0.07 0.23 0.06 0.05 0.06 0.05 0.05 0.05
n(p - 1) 0.30 0.87 1.0 0.10 0.29 0.97 0.05 0.09 0.43 0.05 0.04 0.05 0.05 0.05 0.05
T9 0.20 0.74 1.0 0.07 0.19 0.91 0.04 0.06 0.31 0.04 0.04 0.04 0.05 0.05 0.05
n(pA- 1) 0.14 0.57 1.0 0.06 0.14 0.78 0.05 0.05 0.21 0.05 0.04 0.04 0.05 0.05 0.05
iT 0.11 0.48 1.0 0.05 0.11 0.69 0.04 0.05 0.16 0.05 0.04 0.04 0.05 0.05 0.05
TABLE IX
EMPIRICAL POWER OF ONE SIDED SIZE 0.05 TESTS FOR SAMPLES OF SIZE 100( YI FIXED)
Statistic 0.00 0.50 1.00 0.00 0.50 1.00 0.00 0.50 1.00 0.00 0.50 1.00 0.00 0.50 1.00
d 0.97 0.95 0.89 0.53 0.31 0.03 0.23 0.01 0.00 0.08 0.00 0.00 0.05 0.00 0.00
n(pA- 1) 0.95 0.95 0.96 0.46 0.42 0.29 0.19 0.06 0.00 0.07 0.00 0.00 0.05 0.00 0.00
i 0.86 0.90 0.95 0.30 0.43 0.73 0.12 0.21 0.66 0.06 0.06 0.20 0.05 0.00 0.00
n(pA- 1) 0.73 0.72 0.72 0.24 0.20 0.12 0.10 0.06 0.01 0.05 0.04 0.02 0.05 0.05 0.06
iT 0.64 0.67 0.78 0.18 0.22 0.34 0.08 0.09 0.15 0.05 0.04 0.03 0.05 0.05 0.05
The statistic (DIis the likelihood ratio test of (a, p) = (0, 1) against the alterna-
tive (a, p) 7P(0, 1) for model (1.1). The statistic )2 iS the likelihood ratio test of
(a, 3,p) = (0,0, 1) against the general alternative of model (1.3). Note that in
models (1.1) and (1.3) the initial value Y1 is fixed. Because the alternative is
broader for 02, 02 displays smaller power than J, in Table VII where the
parameter /3 = 0 for all examples of the table. Both 01l and 02 display bias,
having power less than the size for p = 0.99. The power of both tests increases as
a increases.
The statistic I3 is the likelihood ratio test of (a, /, p) = (a, 0, 1) against the
general alternative of model (1.3). In Table VII the power of I3 is between those
of (D, and (2 for p <.99. At p = 1.02 and a = 0, the power of I3 is considerably
less than the powers of (D and (2. No bias is evident inA3.
We have included in Table VII the statistics A,,, A-n, and T discussed by
A
Fuller [8, Section 8.5]. The null distributions of the statistics and T are
computed under model (1.1) with the assumption that (a, p) = (0, 1). The distri-
butions of the statistics for (a, 1), a 7 0 differ from those with a = 0. The null
LIKELIHOOD RATIO STATISTICS 1069
Y =
-p2)-2e (t=1),
where the et are NID(0, 1) random variables. The power was computed for
20,000 samples of each of the three sample sizes. Generally speaking pL is the
most powerful of the tests considered. The test I3 is the most powerful of the
tests that permit the null model to contain drift.
Table IX contains the power for one sided tests when the true model is (1.1)
with p < 1. Included in this table is the von Neumann ratio
n -1n
d = n[E Y-n] ( Yt-t l)29
t=l ~~t=2
where
n
Yn=n-l .
t= 1
Sargan [15] gives percentiles for d when Yt is generated by model (1.1) with
(a, p) = (0, 1). For sample sizes 50 and 100 and significance level 0.05 we use the
percentiles from Sargan's paper. The jth percentile of the limit distribution for d
is the reciprocal of the (100-j)th percentile in the table of Anderson and
Darling [2, p. 203]. For finite sample sizes not considered by Sargan, we use the
asymptotic percentiles as critical values for the power calculations of Table IX.
Fuller [9] has constructed modifications of the statistic d that are applicable to
higher order autoregressive processes and to model (1.3). The methods used to
generate the samples of Table IX are those used to generate the samples of Table
VII with p < 1. The statistic d is an appropriate test when the alternative is that
Y, is a stationary first order autoregressive time series. It displays good power for
this alternative (that is, when a = 0 and p < 1). The statistic n(p - 1) is only
slightly less powerful than d for a = 0 and maintains somewhat better power for
a &0. For p < 1 and a # 0 the estimator P is closer to one on the average than
the corresponding estimator associated with a = 0. Therefore, the tests A andT,
display poor power for values of p close to one and a #0 O.Because the estimator
p, converges to p for p < 1, there is some sample size for any p < 1 for which the
1070 D. A. DICKEY AND W. A. FULLER
TABLE X
MODELS AND TEST STATISTICS
Test
Null Model Alternative Model Statistic
= c + Yt- I +Y
Yt= a +,_t + pY,I+e, A A
Yt = Yt- I + et Yt =a +pYt- I + et d
P < I
aPoor power for Y, not stationary, a =, 0, small n, and p less than, but close to one.
statistics will have power greater than the size. Because the null distributions are
derived under the assumption that (a, p) = (0, 1), there is no sample size for
which the tests d, p, and are appropriate if the alternative includes a :# 0 and
p= 1.
The test statistics discussed in this section and the hypotheses for which they
are appropriate are summarized in Table X.
7. EXAMPLE
To illustrate the use of the tables we study the logarithm of the quarterly
Federal Reserve Board Production Index 1950-1 through 1977-4. We assume
that the time series is adequately represented by the model
where e, are independent identically distributed (0, a2) random variables. The
ordinary least squares estimates are
-
= 0.065966 - 0.056448 5-95
=
3(0.000533)
where 0.000533 = 0.056448/106 is the residual mean square for the full model
regression. As there are 110 observations in the regression the 97.5 per cent point
of the distribution of (12, as given in Table V, is 5.59. Therefore the hypothesis
Po= ,B = 0 and a1 = 1 is rejected at the 2.5 per cent level.
To test the hypothesis that f,3 = 0 and a1 = 1 against the general alternative
(7.1) we compute
The 95 per cent point of the distribution is given in Table VI as 6.49 and the
90 per cent point as 5.47. Therefore at the 5 per cent level one could accept the
hypothesis that the second order autoregressive process has a unit root with
possible drift under the maintained hypothesis that the process is second order.
The null hypothesis would be rejected at the 10 per cent level. We note that on
the basis of Table 8.5.2 of Fuller [8] the statistic
;.-0.119.
0? 9- 6
3.61
TT=
0.033
would lead to rejection of the hypothesis of a unit root at the 10 per cent level if
a two sided test is performed. If the alternative is that both roots are less than
one in absolute value the hypothesis of a unit root is rejected at the 5 per cent
level.
REFERENCES
[1] ANDERSON, T. W.: "On Asymptotic Distributions of Estimates of Parameters of Stochastic
Difference Equations," Annals of Mathematical Statistics, 30(1959), 676-687.
[2] ANDERSON, T. W., AND D. A. DARLING: "Asymptotic Theory of Certain 'Goodness of Fit'
Criteria Based on Stochastic Processes," Annals of Mathematical Statistics, 23(1952), 193-212.
[3] Box, G. E. P., AND G. M. JENKINS: Time Series Analysis Forecasting and Control. San Francisco:
Holden-Day, 1970.
[4] BULMER, M. G.: "The Statistical Analysis of Density Dependency," Biometrics, 31(1975),
901-911.
[5] DAVID, H. A.: Order Statistics. New York: John Wiley and Sons, 1970.
[6] DICKEY, D. A.: "Estimation and Hypothesis Testing in Nonstationary Time Series," Iowa State
University Ph.D. thesis, 1976.
1072 D. A. DICKEY AND W. A. FULLER
[7] DICKEY,D. A., AND W. A. FULLER: "Distribution of the Estimators for Autoregressive Time
Series with a Unit Root," Journal of the American Statistical Association, 74(1979), 427-431.
[8] FULLER, W. A.: Introductionto Statistical Time Series. New York: John Wiley and Sons, 1976.
[9] : "Testing the Autoregressive Process for a Unit Root," Report to the U.S. Census
Bureau, 1979.
[10] HASZA, D.: "Estimation in Nonstationary Time Series," Iowa State University Ph.D. thesis,
1977.
[11] MANN, H. B., AND A. WALD: "On Stochastic Limit and Order Relationships," Annals of
Mathematical Statistics, 14(1943), 217-226.
[12] RAo, M. M.: "Consistency and Limit Distributions of Estimators of Parameters in Explosive
Stochastic Difference Equations," Annals of Mathematical Statistics, 32(1961), 195-218.
[13] : "Asymptotic Distribution of An Estimator of the Boundary Parameter of an Unstable
Process," Annals of Statistics, 6(1978), 185-190.
[14] RUBIN,H.: "Consistency of Maximum-Likelihood Estimates in the Explosive Case," in Statisti-
cal Inference in Dynamic Economic Models, ed. by T. C. Koopmans, 1950.
[15] SARGAN,J. D.: "The Durbin-Watson Ratio of the Gaussian Random Walk," London School of
Economics manuscript, 1973.