Macroeconomic Forecasting with Indexes
Macroeconomic Forecasting with Indexes
To cite this article: James H Stock & Mark W Watson (2002) Macroeconomic Forecasting Using Diffusion Indexes,
Journal of Business & Economic Statistics, 20:2, 147-162, DOI: 10.1198/073500102317351921
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”)
contained in the publications on our platform. However, Taylor & Francis, our agents, and our
licensors make no representations or warranties whatsoever as to the accuracy, completeness, or
suitability for any purpose of the Content. Any opinions and views expressed in this publication are
the opinions and views of the authors, and are not the views of or endorsed by Taylor & Francis.
The accuracy of the Content should not be relied upon and should be independently verified with
primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims,
proceedings, demands, costs, expenses, damages, and other liabilities whatsoever or howsoever
caused arising directly or indirectly in connection with, in relation to or arising out of the use of the
Content.
This article may be used for research, teaching, and private study purposes. Any substantial
or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or
distribution in any form to anyone is expressly forbidden. Terms & Conditions of access and use can
be found at http://amstat.tandfonline.com/page/terms-and-conditions
Macroeconomic Forecasting Using
Diffusion Indexes
James H. Stock
Kennedy School of Government, Harvard University, and National Bureau of Economic Research,
Cambridge, MA 02138
Mark W. Watson
Woodrow Wilson School, Princeton University, Princeton, NJ 08544, and National Bureau of
Economic Research
This article studies forecasting a macroeconomic time series variable using a large number of predictors.
The predictors are summarized using a small number of indexes constructed by principal component
analysis. An approximate dynamic factor model serves as the statistical framework for the estimation of
the indexes and construction of the forecasts. The method is used to construct 6-, 12-, and 24-month-
ahead forecasts for eight monthly U.S. macroeconomic time series using 215 predictors in simulated real
time from 1970 through 1998. During this sample period these new forecasts outperformed univariate
Downloaded by [Linköping University Library] at 10:51 12 August 2013
147
148 Journal of Business & Economic Statistics, April 2002
ing simulated out-of-sample mean square forecast errors that 2.2 Estimation and Forecasting
are one-third less than those of the benchmark models.
Because 8Ft 91 h 1 ‚h 4L5, and ƒh 4L5 are unknown, forecasts
of yT Ch based on (2.4) and (2.5) are constructed using a two-
2. ECONOMETRIC FRAMEWORK step procedure. First, the sample data 8X t 9TtD1 are used to esti-
2.1 An Approximate Dynamic Factor Model mate a time series of factors (the diffusion indexes), 8Fbt 9TtD1 .
Second, the estimators O h 1 ‚O h 4L5 and ƒOh 4L5 are obtained
We begin with a discussion of the statistical model that by regressing ytC1 onto a constant, Fbt and yt (and lags). The
motivates the D I forecasts. Let ytC1 denote the scalar series forecast of yTh Ch is then formed as O h C ‚O h 4L5FbT C ƒOh 4L5yT .
to be forecast and let, Xt be an N -dimensional multiple time Stock and Watson (1998) developed theoretical results for
series of predictor variables, observed for t D 11 : : : 1 T , where this two-step procedure applied to (2.3) and (2.4). The factors
yt and Xt are both taken to have mean 0. (The different time are estimated by principal components because these estima-
subscripts used for y and X emphasize the forecasting rela- tors are readily calculated even for very large N and because
tionship.) We suppose that 4Xt 1 ytC1 5 admit a dynamic factor principal components can be generalized to handle data irreg-
model representation with rN common dynamic factors ft , ularities as discussed later. Under a set of moment conditions
for 4…1 e1 F 5 and an asymptotic rank condition on å, the feasi-
ytC1 D ‚4L5ft C ƒ4L5yt C …tC1 1 (2.1)
ble forecast is asymptotically rst-order ef cient in the sense
Xit D ‹i 4L5ft C eit 1 (2.2) that its mean square forecast error (MSE) approaches the
Downloaded by [Linköping University Library] at 10:51 12 August 2013
United States. The complete dataset spans 1959:1 to 1998:12. where Fbt is the vector of k estimated factors. Results for three
Four of these eight variables are the measures of real economic variants of (3.3) are reported. The rst, denoted in the tables
activity used to construct the Index of Coincident Economic by D I-AR, Lag, includes lags of the factors and lags of yt , with
Indicators maintained by the Conference Board (formerly by k and lag orders m and p estimated by Bayesian information
the U.S. Department of Commerce): total industrial production criterion (B IC), with 1 µ k µ 4, 1 µ m µ 3, and 0 µ p µ 6.
(ip); real personal income less transfers (gmyxpq); real man- Thus the smallest candidate model that B IC can choose here
ufacturing and trade sales (msmtq); and number of employ- includes only a single contemporaneous factor and excludes
ees on nonagricultural payrolls (lpnag). (Additional details are yt . The second, denoted D I-AR, includes contemporaneous Fbt ,
given in Appendix B, which lists series by the mnemonics that is, m D 1, and k and p are chosen by B IC with 1 µ
given here in parenthesis.) The remaining four series are price k µ 12 and 0 µ p µ 6. The third, denoted D I, includes only
indexes: the consumer price index (punew); the personal con- contemporaneous Fbt , so p D 01 m D 1, and k is chosen by B IC,
sumption expenditure implicit price de ator (gmdc); the con- 1 µ k µ 12.
sumer price index (CP I) less food and energy (puxx); and the The full dataset used to estimate the factors contains 215
producer price index for nished goods (pwfsa). These series monthly time series for the United States from 1959:1 to
and the predictor series were taken from the May 1999 release 1998:12. The series were selected judgmentally to represent 14
of the DR I/McGraw–Hill Basic Economics database (formerly main categories of macroeconomic time series: real output and
Citibase). In general these series represent the fully revised income; employment and hours; real retail, manufacturing, and
historical series available as of May 1999, and in this regard trade sales; consumption; housing starts and sales; real inven-
Downloaded by [Linköping University Library] at 10:51 12 August 2013
the forecasting results will differ from results that would be tories and inventory-sales ratios; orders and un lled orders;
calculated using real-time data. stock prices; exchange rates; interest rates; money and credit
For each series, several forecasting models are compared at quantity aggregates; price indexes; average hourly earnings;
the 6-, 12-, and 24-month forecasting horizons: D I forecasts and miscellaneous. The list of series is given in Appendix B
based on estimated factors, a benchmark univariate autoregres- and is similar to lists we have used elsewhere (Stock and
sion, and benchmark multivariate models. For both the real Watson 1996, 1999). These series were taken from a some-
and the price series, one of the benchmark multivariate models what longer list, from which we eliminated series with gross
is a trivariate vector autoregression, and a second is based on problems, such as rede nitions. However, no further pruning
leading economic indicators. As a further comparison, in a- was performed.
tion forecasts are also computed using an unemployment- The theory outlined in Section 2 assumes that Xt is I(0),
based Phillips curve. so these 215 series were subjected to three preliminary steps:
Our focus is on multistep-ahead prediction, and most of the possible transformation by taking logarithms, possible rst dif-
forecasting regressions are projections of an h-step-ahead vari- ferencing, and screening for outliers. The decision to take log-
able ytCh
h
onto t-dated predictors, sometimes including lagged arithms or to rst difference the series was made judgmentally
transformed values yt of the variable of interest. The real vari- after preliminary data analysis, including inspection of the data
ables are modeled as being I(1) in logarithms. Because all and unit root tests. In general, logarithms were taken for all
four real variables are treated identically, consider industrial nonnegative series that were not already in rates or percentage
production, for which units. Most series were rst differenced. A code summarizing
these transformations is given for each series in Appendix B.
h
ytCh D 41200=h5 ln4 IPtCh = IPt 5 After these transformations, all series were further standard-
ized to have sample mean zero and unit sample variance.
and yt D 1200 ln4 IPt = IPtƒ1 50 (3.1) Finally, the transformed data were screened automatically for
The price indexes are modeled as being I(2) in logarithms. outliers (generally taken to be coding errors or exceptional
The I(2) speci cation is consistent with standard Phillips curve events such as labor strikes), and observations exceeding 10
equations and is a good description of the series over much of times the interquartile range from the median were replaced
the sample period. However, I(1) speci cations also provide by missing values.
adequate descriptions of the data, particularly in the early part Using this transformed and screened dataset, three sets of
of the sample. Stock and Watson (1999) found little difference empirical factors were constructed. The rst was computed
in I(1) and I(2) factor model forecasts for these prices over the using principal components from the subset of 149 variables
sample period studied here, so for the sake of brevity we limit available for the full sample period (the balanced panel). The
our analysis to the I(2) speci cation. Accordingly, for the CP I second set of factors was computed using the nonbalanced
(and similarly for the other price series), panel of all 215 series using the methods of Appendix A. The
third set of factors was computed by stacking the 149 variables
in the balanced panel with their rst lags, so the augmented
h
ytCh D 41200=h5 ln4CP ItCh =CP It 5 ƒ 1200 ln4CP It =CP Itƒ1 5
data vector has dimension 298. Empirical factors were then
and yt D 1200ã ln4CP It =CP Itƒ1 50 (3.2) estimated by the principal components of the stacked data, as
discussed in Section 2.
Diffusion Index Forecasts. Following (2.5), the most gen- Autoregressive Forecast. The autoregressive forecast is a
eral D I forecasting function is univariate forecast based on (3.3), where the terms involving
X
m X
p Fb are excluded. The lag order p was selected recursively by
yOTh Ch—T D O h C ‚O 0hj F
bT ƒjC1 C ƒOhj yT ƒjC1 1 (3.3) B IC with 0 µ p µ 6, where p D 0 indicates that yt and its lags
jD1 jD1 are excluded.
150 Journal of Business & Economic Statistics, April 2002
Vector Autoregressive Forecast. The rst multivariate new orders in durable goods industries (mdoq), the nominal
benchmark model is a vector autoregression (VAR) with p M1 money supply (fm1), the federal funds overnight interest
lags each of three variables. One version of the VAR used rate (fyff), and the interest rate spread between 1-year U.S.
p D 4 lags, and another version selected p recursively by B IC. treasury bonds and the federal funds rate (sfygt1). The remain-
The xed-lag VARs performed somewhat better than the B IC ing variable is the trade-weighted exchange rate listed in the
selected lag lengths (which often set p D 1), and we report previous paragraph.
results for the xed lag speci cations in the results to follow. In all cases, the leading indicators were transformed so that
The variables in the VAR are a measure of the monthly growth Wt is I(0). This entailed taking logarithms of variables not
in real activity, the change in monthly in ation, and the change already in rates and differencing all variables except the inter-
in the 90-day U.S. treasury bill rate. When used to forecast the est rate spreads, housing starts, the index of vendor perfor-
real series, the relevant real activity variable was used and the mance, and the help wanted index.
in ation measure was CP I in ation. For forecasting in ation, For each variable to be forecast, p and m in (3.4) were
the relevant price series was used and the real activity measure determined by recursive B IC with 1 µ m µ 4 and 0 µ p µ 6,
was industrial production. Multistep forecasts were computed so 28 possible models were compared in each time period.
by iterating the VAR forward. This contrasts to the autoregres- Phillips Curve Forecasts. The unemployment-based
sive forecasts, which were computed by h-step-ahead projec- Phillips curve is considered by many to have been a reliable
tion rather than iteration. method for forecasting in ation over this period (Gordon
1982; Congressional Budget Of ce 1996; Fuhrer 1995; Gor-
Downloaded by [Linköping University Library] at 10:51 12 August 2013
ber of people working part-time in nonagricultural industries factors, and so forth were then reestimated, information cri-
because of slack work; real manufacturers’ un lled orders in teria were recomputed, and models were selected using data
durable goods industries; a trade-weighted index of nominal from 1959:1 through 1970:2, and forecasts from these models
exchange rates between the United States and the U.K., West were then computed for y197022
h
Ch . The nal simulated out of
Germany, France, Italy, and Japan; and the National Associ- sample forecast was made in 1998212 ƒ h for y1998212h
.
ation of Purchasing Managers’ index of vendor performance
(the percent of companies reporting slower deliveries). 4. EMPIRICAL RESULTS
For the in ation forecasts, eight leading indicators are used.
4.1 Forecasting Results
These variables were chosen because of their good individ-
ual performance in previous in ation forecasting exercises. In The results for the real variables are reported in detail in
particular these variables performed well in at least one of the Table 1 for 12-month-ahead forecasts, and summaries for 6-
historical episodes considered by Staiger, Stock, and Watson and 24-month-ahead forecasts are reported in Table 2. Two
(1997) (also see Stock and Watson 1999). Seven of these vari- sets of statistics are reported. The rst is the MSE of the can-
ables are also used in the factor-estimation step in the diffu- didate forecasting model, computed relative to the MSE of the
sion index forecasts: the total unemployment rate (lhur), real univariate autoregressive forecast (so the autoregressive fore-
manufacturing and trade sales (msmtq), housing starts (hsbr), cast has a relative MSE of 1.00). For example, the simulated
Stock and Watson: Macroeconomic Forecasting Using Diffusion Indexes 151
Industrial production Personal income Mfg & trade sales Nonag. employment
Forecast
method Rel. MSE O Rel. MSE O Rel. MSE O Rel. MSE O
Benchmark models
AR 1000 1000 1000 1000
LI 086 (027) 057 (013) 097 (021) .52 (.15) 082 (025) 063 (017) 089 (023) .56 (.14)
VAR 097 (007) 075 (068) 098 (005) .68 (.34) 098 (004) 073 (058) 1005 (009) .22 (.41)
Full dataset (N D 215)
DI-AR, Lag 057 (027) 076 (013) 077 (014) .76 (.13) 048 (025) 099 (015) 091 (013) .63 (.18)
DI-AR 063 (025) 071 (012) 086 (016) .61 (.12) 057 (024) 084 (018) 099 (031) .51 (.20)
DI 052 (026) 088 (017) 086 (016) .61 (.12) 056 (023) 094 (020) 092 (026) .55 (.20)
Balanced panel (N D 149)
DI-AR, Lag 067 (025) 070 (013) 082 (015) .70 (.13) 056 (023) 091 (016) 088 (014) .68 (.18)
DI-AR 067 (025) 070 (012) 092 (014) .57 (.12) 061 (023) 080 (017) 088 (022) .58 (.17)
DI 059 (025) 081 (017) 092 (014) .57 (.12) 057 (023) 091 (018) 084 (021) .62 (.16)
Stacked balance panel
DI-AR 065 (025) 070 (012) 093 (015) .56 (.12) 061 (022) 089 (019) 1002 (030) .49 (.14)
Downloaded by [Linköping University Library] at 10:51 12 August 2013
DI 062 (025) 081 (018) 093 (015) .56 (.12) 066 (021) 085 (020) 095 (024) .53 (.14)
Full dataset; m D 1, p D BIC, k ’ xed
DI-AR, k D 1 1006 (011) 027 (034) 1003 (008) .34 (.41) 098 (006) 063 (046) 1001 (009) .49 (.24)
DI-AR, k D 2 063 (025) 076 (014) 078 (014) .77 (.14) 053 (024) 093 (015) 077 (013) .82 (.15)
DI-AR, k D 3 056 (026) 084 (014) 077 (015) .77 (.13) 052 (023) 099 (016) 084 (014) .75 (.20)
DI-AR, k D 4 054 (026) 085 (014) 076 (015) .78 (.14) 051 (023) 1001 (016) 083 (015) .73 (.19)
Full dataset; m D 1, p D 0, k ’ xed
DI, k D 1 1003 (007) 030 (049) 1001 (009) .46 (.34) 098 (005) 067 (049) 1001 (009) .48 (.24)
DI, k D 2 055 (025) 089 (015) 078 (014) .76 (.13) 057 (024) 095 (017) 078 (013) .83 (.16)
DI, k D 3 051 (025) 1000 (016) 077 (015) .77 (.13) 060 (021) 1002 (019) 084 (014) .76 (.19)
DI, k D 4 049 (025) 1000 (016) 076 (015) .78 (.14) 059 (022) 1003 (020) 082 (015) .75 (.18)
RMSE, AR Model .049 .027 .045 .017
out of sample MSE of the leading indicator (L I) forecast of models in 10 of the 12 variable-horizon combinations, the
industrial production is 86% that of the autoregressive fore- exceptions being 6- and 12-month-ahead forecasts of employ-
cast at the 12-month horizon. Autocorrelation consistent stan- ment. In most cases the performance of the simpler D I fore-
dard errors for these relative MSEs, calculated following West casts, which exclude lags of F bt and yt , is comparable to or
(1996), are reported in parentheses. The second set of statistics even better than that of the D I-AR, Lag forecasts. This is
is the coef cient on the candidate forecast from the forecast rather surprising, because it implies that essentially all the
combining regression, predictable dynamics of these series are accounted for by
the estimated factors. In some cases, the improvement over
h1 AR
h
ytCh D yOtCh—t
h C 41 ƒ 5yOtCh—t C uhtCh 1 (4.1) the benchmark forecasts are quite substantial; for example,
for industrial production at the 12-month horizon the D I-AR,
h1 AR
where yOtCh—t
h
is the candidate h-step-ahead forecast and yOtCh—t Lag forecast has a forecast error variance 57% that of the
is the benchmark h-step-ahead autoregressive forecast. Het- autoregressive model and two-thirds that of the leading indi-
eroscedastic autocorrelation robust (HAC) standard errors for cator model. The relative improvements are more modest at
are reported in parentheses. For example, is estimated the 6-month horizon. At the 24-month horizon, the multivari-
to be .57 when the candidate forecast is the leading indica- ate benchmark forecasts break down and perform worse than
tor forecast at the 12-month horizon, with a standard error of the univariate forecast; however, the D I-AR, Lag, D I-AR, and
.13, so the hypothesis that the weight on the leading indica- D I forecasts continue to outperform the autoregressive bench-
tor forecast is 0 ( D 0) is rejected at the 5% level, but so is mark very substantially.
the hypothesis that the leading indicator forecast receives unit The performance of comparable models is usually better
weight. when the empirical factors from the full dataset are used, rel-
We now turn to the results for the real variables. First con- ative to those from the balanced panel subset. Performance is
sider the D I forecasts with factors estimated using the full not improved by using empirical factors from augmenting the
dataset (the unbalanced panel). These forecasts with B IC fac- balanced panel with its rst lag; for these real series, doing
tor selection generally improve substantially over the bench- so does comparably to, or somewhat worse than, using the
mark univariate and multivariate forecasts. The D I-AR, Lag empirical factors from the unstacked balanced panel.
model, which allows recursive B IC selection across own lags Inspection of the nal panels of Tables 1 and 2 reveals
and lags of the factors, outperforms all three benchmark a striking nding: simply using D I or D I-AR forecasts with
152 Journal of Business & Economic Statistics, April 2002
Table 2. Simulated Out-of-Sample Forecasting Results: Real Variables, 6- and 24-Month Horizons
Industrial production Personal income Mfg & trade sales Nonag. employment
Forecast
method Rel. MSE O Rel. MSE O Rel. MSE O Rel. MSE O
A. Horizon D 6 months
Benchmark models
AR 1000 1000 1000 1000
LI 070 (025) .68 (.13) 083 (015) 064 (011) 077 (019) .68 (.14) 075 (019) 067 (012)
VAR 1001 (005) .43 (.39) 099 (003) 063 (043) 099 (004) .64 (.45) 1006 (007) 012 (034)
Full dataset (N D 215)
DI-AR, Lag 069 (025) .69 (.14) 077 (012) 086 (015) 063 (018) .89 (.17) 094 (016) 056 (018)
DI-AR 077 (030) .62 (.16) 081 (016) 066 (013) 070 (020) .76 (.17) 1002 (032) 049 (019)
DI 074 (025) .68 (.17) 081 (016) 065 (013) 067 (020) .79 (.18) 096 (028) 052 (019)
Balanced panel (N D 149)
DI-AR, Lag 073 (025) .68 (.16) 079 (013) 078 (013) 066 (017) .87 (.17) 093 (017) 058 (021)
DI-AR 078 (028) .62 (.16) 081 (015) 066 (011) 076 (019) .70 (.17) 097 (028) 052 (019)
DI 073 (024) .69 (.15) 081 (015) 066 (011) 068 (019) .81 (.17) 095 (026) 053 (018)
Full dataset; m D 1, p D BIC, k ’ xed
Downloaded by [Linköping University Library] at 10:51 12 August 2013
DI-AR, k D 1 097 (015) .58 (.33) 091 (007) 080 (023) 099 (011) .52 (.29) 094 (012) 060 (019)
DI-AR, k D 2 067 (022) .77 (.15) 076 (011) 090 (014) 064 (018) .86 (.16) 084 (013) 071 (016)
DI-AR, k D 3 064 (023) .81 (.15) 075 (012) 089 (014) 064 (018) .88 (.17) 088 (014) 066 (017)
DI-AR, k D 4 064 (023) .80 (.15) 074 (013) 087 (014) 063 (018) .87 (.15) 091 (016) 060 (018)
RMSE, AR Model .030 .016 .028 .008
B. Horizon D 24 months
Benchmark models
AR 1000 1000 1000 1000
LI 1009 (028) .45 (.14) 1029 (031) 030 (020) 1008 (021) .45 (.14) 1007 (031) 047 (015)
VAR 1001 (010) .44 (.48) 098 (006) 063 (034) 1003 (006) .13 (.85) 1006 (013) 035 (031)
Full dataset (N D 215)
DI-AR, Lag 057 (024) .88 (.13) 070 (020) 094 (023) 066 (018) .95 (.18) 082 (015) 088 (026)
DI-AR 059 (025) .88 (.15) 076 (022) 080 (026) 070 (020) .89 (.19) 074 (019) 097 (024)
DI 055 (026) .91 (.14) 076 (022) 080 (025) 070 (020) .89 (.19) 074 (019) 097 (024)
Balanced panel (N D 149)
DI-AR, Lag 057 (025) .87 (.14) 076 (019) 086 (023) 064 (020) .94 (.18) 074 (017) 1006 (025)
DI-AR 058 (025) .87 (.14) 083 (020) 074 (024) 067 (019) .93 (.18) 076 (018) 094 (025)
DI 058 (025) .87 (.14) 083 (020) 074 (024) 067 (020) .94 (.19) 075 (018) 094 (024)
Full dataset; m D 1, p D BIC, k ’ xed
DI-AR, k D 1 1012 (019) .10 (.46) 1007 (009) 081(1000) 097 (004) .90 (.62) 1003 (007) 033 (046)
DI-AR, k D 2 076 (019) .68 (.11) 088 (013) 068 (017) 065 (020) .87 (.14) 072 (016) 099 (017)
DI-AR, k D 3 058 (024) .89 (.13) 072 (019) 090 (018) 070 (017) .89 (.14) 079 (016) 095 (024)
DI-AR, k D 4 056 (024) .90 (.14) 070 (020) 093 (023) 067 (018) .95 (.18) 078 (016) 096 (024)
RMSE, AR Model .075 .046 .070 .031
two factors captures most of the forecasting improvement. In on the D I-AR lag. Thus most of the forecasting gains seem to
most cases, incorporating B IC factor and lag order selection come from using a single factor.
provides little or no improvement over just using two fac- As with the real variables, forecasts based on the stacked
tors, with no lags of the factors and no lagged dependent data perform less well than those based on the unstacked data.
variables. Although the full dataset forecasts are typically better than
The results for the price series are given in Tables 3 and 4. the balanced panel subset forecasts for the 6- and 12-month
There are three notable differences in these results, relative to horizons, at the 24-month horizon the balanced panel forecasts
those for the real variables. First, the D I-AR, Lag forecasts slightly outperform the full dataset forecasts.
outperform all the benchmark forecasts less often, in only 6 Additional analysis of factor-based forecasts of CP I and
of the 12 variable-horizon combinations. Second, including consumption de ator in ation, and additional comparisons of
lagged in ation dramatically improves the forecasts, and with- these forecasts to other Phillips-curve forecasts and to fore-
out this the D I forecasts are actually worse than the autore- casts based on other leading indicators, were presented by
gressive forecasts. Third, other factor forecasts generally out- Stock and Watson (1999). Three ndings from that study are
perform the D I-AR, Lag forecasts. Notably, the full data set worth noting here. First, the D I-AR and D I-AR, Lag forecasts
D I-AR forecast with k D 1 (and no lagged factors) outperforms are found to perform well relative to a large number of addi-
all the benchmarks in 11 of 12 cases and typically improves tional multivariate benchmarks. Second, the forecasts reported
Stock and Watson: Macroeconomic Forecasting Using Diffusion Indexes 153
Table 3. Simulated Out-of-Sample Forecasting Results: Price In‘ ation, 12-Month Horizon
CPI Consumption de‘ ator CPI exc. food & energy Producer price index
Forecast
method Rel. MSE O Rel. MSE O Rel. MSE O Rel. MSE O
Benchmark models
AR 1000 1000 1000 1000
LI 079 (015) 076 (015) 095 (012) 058 (017) 1000 (016) 050 (021) 082 (015) 075 (019)
Phillips Curve 082 (013) 095 (020) 092 (010) 072 (023) 079 (018) 080 (022) 087 (014) 096 (030)
VAR 091 (009) 074 (020) 1002 (006) 045 (020) 099 (005) 056 (021) 1029 (014) 025 (012)
Full dataset (N D 215)
DI-AR, Lag 072 (014) 091 (014) 090 (009) 065 (013) 084 (015) 076 (020) 083 (013) 078 (021)
DI-AR 071 (016) 083 (013) 090 (010) 062 (013) 085 (015) 074 (020) 082 (014) 075 (020)
DI 1030 (016) 034 (008) 1040 (016) 025 (008) 1055 (031) 024 (006) 2040 (088) 013 (007)
Balanced panel (N D 149)
DI-AR, Lag 070 (014) 094 (012) 090 (008) 067 (015) 084 (015) 077 (021) 086 (011) 077 (021)
DI-AR 069 (015) 088 (013) 087 (010) 066 (012) 085 (015) 073 (020) 085 (014) 071 (019)
DI 1030 (016) 032 (008) 1034 (013) 026 (009) 1057 (033) 020 (007) 2044 (087) 014 (006)
Stacked balance panel
Downloaded by [Linköping University Library] at 10:51 12 August 2013
DI-AR 073 (015) 082 (012) 087 (009) 065 (012) 085 (015) 077 (021) 081 (014) 075 (020)
DI 1054 (031) 028 (008) 1051 (018) 025 (008) 1055 (032) 023 (006) 3006(1089) 011 (006)
Full dataset; m D 1, p D BIC, k ’ xed
DI-AR, k D 1 064 (015) 1014 (014) 077 (012) 096 (016) 071 (017) 1025 (023) 076 (016) 095 (024)
DI-AR, k D 2 067 (014) 1007 (013) 083 (009) 083 (014) 072 (017) 097 (019) 077 (015) 093 (023)
DI-AR, k D 3 076 (013) 091 (015) 094 (007) 061 (014) 086 (014) 073 (020) 086 (011) 078 (021)
DI-AR, k D 4 074 (014) 089 (015) 091 (009) 064 (014) 087 (015) 072 (021) 082 (013) 079 (021)
Full dataset; m D 1, p D 0, k ’ xed
DI, k D 1 1060 (034) 025 (007) 1056 (020) 022 (009) 1055 (031) 023 (006) 2076(1061) 012 (007)
DI, k D 2 1056 (031) 026 (007) 1058 (020) 021 (008) 1062 (039) 022 (007) 2072(1056) 013 (007)
DI, k D 3 1057 (032) 024 (008) 1060 (020) 017 (008) 1069 (043) 018 (007) 2068(1049) 013 (007)
DI, k D 4 1056 (025) 025 (007) 1056 (019) 021 (008) 1067 (040) 019 (007) 2055 (099) 016 (006)
RMSE, AR Model .021 .015 .019 .033
here can be further improved on using a single-factor fore- Nevertheless, the nding that good forecasts can be made with
cast, where the factor is computed from a set of variables that only one or two factors suggests brie y characterizing the rst
measure only real economic activity. Forecasts based on this few factors.
real economic activity factor have MSEs approximately 10% Figure 1 therefore displays the R2 of the regressions of the
less than the best forecasts reported in Table 3. Finally, sim- 215 individual time series against each of the rst six empiri-
ilar rankings of methods are obtained using I(1) forecasting cal factors from the balanced panel subset, estimated over the
models, rather than the I(2) models used here, that is, when full sample period. These R2 are plotted as bar charts with
rst rather than second differences of log prices are used for one chart for each factor. (The series are grouped by category
the forecasting equation and factor estimation. and ordered numerically using the ordering in the Appendix.)
In interpreting these results, it should be stressed that the Broadly speaking, the rst factor loads primarily on output
multivariate leading indicator models are sophisticated fore- and employment; the second factor on interest rate spreads,
casting tools that provide a stiff benchmark against which unemployment rates, and capacity utilization rates; the third,
to judge the diffusion index forecasts. In our judgment, the on interest rates; the fourth, on stock returns; the fth, on in a-
performance of the leading indicator models reported here tion; and the sixth, on housing starts. Taken together, these six
overstates their true potential out of sample performance, factors account for 39% of the variance of the 215 monthly
because the lists of leading indicators used to construct the time series in the full dataset, as measured by the trace-R2 ;
forecasts were chosen by model selection methods based the rst 12 factors together account for 53% of the variance
on their forecasting performance over the past two decades, of these series. (The contributions to the trace-R2 by the rst
as discussed in Section 3. In this light, we consider the six factors are, respectively, .137, .085, .048, .040, .034, and
performance of the various diffusion index models to be par- .041, for a total of .385.)
ticularly encouraging.
5. DISCUSSION AND CONCLUSIONS
4.2 Empirical Factors
We nd two features of the empirical results surprising and
Because the factors are identi ed only up to a k k matrix, intriguing. First, only six factors account for much of the
detailed discussion of the individual factors is unwarranted. variance of our 215 time series. One interpretation of this
154 Journal of Business & Economic Statistics, April 2002
Table 4. Simulated Out-of-Sample Forecasting Results: Price In‘ ation, 6- and 24-Month Horizons
CPI Consumption de‘ ator CPI exc. food & energy Producer price index
Forecast
method Rel. MSE O Rel. MSE O Rel. MSE O Rel. MSE O
A. Horizon D 6 months
Benchmark models
AR 1000 1000 1000 1000
LI 082 (012) 078 (016) 1004 (009) 042 (016) 1010 (016) 032 (027) 1000 (009) 051 (019)
Phillips Curve 090 (011) 080 (027) 099 (006) 054 (023) 090 (011) 068 (019) 1002 (004) 034 (037)
VAR 1004 (008) 041 (016) 1015 (007) 008 (020) 1000 (005) 050 (021) 1034 (016) 019 (012)
Full dataset (N D 215)
DI-AR, Lag 073 (014) 1005 (018) 091 (008) 071 (017) 083 (013) 089 (025) 087 (011) 087 (026)
DI-AR 074 (014) 1001 (019) 089 (008) 079 (018) 083 (013) 089 (025) 087 (010) 087 (026)
DI 1057 (025) 021 (008) 1068 (026) 010 (008) 1074 (043) 013 (007) 2042 (074) 005 (007)
Balanced panel (N D 149)
DI-AR, Lag 079 (013) 1000 (022) 097 (007) 059 (018) 085 (013) 085 (025) 091 (009) 078 (027)
DI-AR 078 (013) 094 (021) 096 (008) 060 (018) 085 (013) 085 (025) 091 (009) 082 (029)
DI 1059 (026) 019 (008) 1064 (021) 009 (008) 1073 (043) 013 (007) 2042 (070) 007 (007)
Downloaded by [Linköping University Library] at 10:51 12 August 2013
result is that there are only a few important sources of macro- estimating the factors with mixed frequency data is outlined
economic variability. Second, just a few factors are needed to in Appendix A. Third, we considered only U.S. data, and it
forecast real activity, and the most accurate forecasts of in a- would be useful to study the relative forecasting performance
tion use lags of in ation together with a single factor. This of these methods for other countries. Fourth, the estimated
suggests that a very small state vector may be necessary for factors that we used here were based on simple estimators
forecasting macroeconomic time series. and it would be useful to study other estimators designed to
These results raise several issues for future empirical and exploit the heteroscedasticity and serial correlation in the data
theoretical research. We mention ve here. First, classical dif- to improve ef ciency. Finally, our results are based on 215
fusion indexes are computed using nonlinear transformations time series chosen judgementally from the large number of
of the data, but our indexes are linear functions of the data. available macroeconomic time series. Would there be addi-
This raises the possibility that further forecasting gains can tional improvements if we were to use 500 series or much
be realized using a nonlinear version of the dynamic factor loss by restricting ourselves to only 100 series? Alternatively,
model. Second, the results reported here rely on monthly data, the problem of systematically selecting many series from
but data from other sampling frequencies (weekly, quarterly) very many series is a dif cult problem that requires further
may improve the forecasts. A computational algorithm for research.
Stock and Watson: Macroeconomic Forecasting Using Diffusion Indexes 155
Downloaded by [Linköping University Library] at 10:51 12 August 2013
Figure 1. R2 Between Factors and Individual Time Series, Grouped by Category (see Appendix B). Categories: real output and income
(Out); employment and hours (Emp); real retail, manufacturing, and trade sales (RTS); consumption (PCE); housing starts and sales (HSS); real
inventories and inventory-sales ratios (Inv); orders and un’ lled orders (Ord); stock prices (SPr); exchange rates (EXR); interest rates (Int); money
and credit quantity aggregates (Mon); price indexes (Pri); average hourly earnings (AHE); miscellaneous (Oth).
156 Journal of Business & Economic Statistics, April 2002
monthly and quarterly) data. In this case, a modi cation of ing values for Fb in the EM iteration can be obtained using
standard principal component estimation is necessary. To moti- estimates from the balanced panel subset.
vate the modi cation, consider the least squares estimators of We now provide some additional details on the calcu-
å and Ft from (2.4) from a balanced panel. The objective lation of X bit for some important special cases. Let X i D
function is 4Xi 1 : : : 1 XiT 50 , and let X !i be the vector of observations on the
ith variable. Suppose that X !i D Ai X i for some known matrix
X
N X
T
V 4F 1 å5 D 4Xit ƒ ‹0i Ft 52 1 (A.1) Ai , as can be done in the cases of missing values and tempo-
ral aggregation, for example. Then E4X i — X ! 5 D E4X i — Xi 5 D
!
iD1 tD1
where ‹i is the ith row of å. (A.1) can be minimized by the ized inverse of Ai A0i . The particulars of these calculations are
usual eigenvalue calculations and Fbt are the principal compo- now presented for some important special cases. In the rst
nents of Xt . four special cases discussed, this level of generality is unnec-
When the panel is unbalanced, least squares estimators of essary and the formula for X bit follows quite simply from the
Ft can be calculated from the objective function nature of the data irregularity.
A. Missing Observations. Suppose some observations on
X
N X
T
Xit are missing. Then, during iteration j, the elements of
V ! 4F 1 å5 D Iit 4Xit ƒ ‹0i Ft 52 1 (A.2)
iD1 tD1 the estimated balanced panel are constructed as X bit D X it if
b b b
Xit observed and Xit D ‹i Ft otherwise. The estimate of F
0
where Iit D 1 if Xit is available and 0 otherwise. Minimization is then updated by computing the eigenvectors correspond-
P bb bD
of (A.2) requires iterative methods. This appendix summarizes ing to the largest r eigenvalues of N ƒ1 i X i X i , where X i
an iterative method based on the EM algorithm that has proved 4Xbi1 1 X
bi2 1 : : : 1 X
biT 5 . The estimate of å is updated by the ordi-
0
to be easy and effective. nary least squares regression of X b onto this updated estimate
To motivate this EM algorithm, notice that V 4F 1 å5 is pro- of F .
portional to the log-likelihood under the assumption that Xit B. Mixed Monthly and Quarterly Data— I(0) Stock Vari-
are iid N 4‹0i Ft 1 15, in which case the least squares estimators ables. A series that is observed quarterly and is a stock vari-
are the Gaussian maximum likelihood estimators. Because V ! able would be the point-in-time level of a variable at the end
is just a missing data version of V and because minimization of the quarter, say, the level of inventories at the end of the
of V is computationally simple, a simple EM algorithm can quarter. If this series is I(0), then it is handled as in case A;
be constructed to minimize V ! . that is, it is treated as a monthly series with missing observa-
The jth iteration of the algorithm is de ned as follows. Let tions in the rst and second months of the quarter.
b and Fb denote estimates of å and F constructed from the
å C. Mixed Monthly and Quarterly Data— I(0) Flow Vari-
4j ƒ 15st iteration, and let ables. A quarterly ow variable is the average (or sum)
of unobserved monthly values. If this series is I(0), it can
Q4X ! 1 Fb1 b
å1 F 1 å5 D EFb1 b — !
å 6V 4F 1 å5 X 71 (A.3) be treated as follows. The unobserved monthly series, Xit ,
q q
is measured only as the time aggregate Xit , where Xit D
q
where X ! denotes the full set of observed data and 41=354Xi1 tƒ2 C Xi1 tƒ1 C Xit 5 for t D 31 61 91 121 : : : 1 and Xit
EFb1 b — !
å 6V 4F 1 å5 X 7 is the expected value of the complete data is missing for all other values of t. In this case estima-
log-likelihood V 4F 1 å5, evaluated using the conditional den- tion proceeds as in case A but with X bit D ‹O 0i Fbt C eOit , where
sity of X — X ! evaluated at Fb and b eOit D Xi’ ƒ ‹O 0i 4Fb’ ƒ2 C Fb’ ƒ1 C Fb’ 5=3, where ’ D 3 when t D
q
å. The estimates of F and
b1 b
å at iteration j solve MinF 1 å Q4X ! 1 F å1 F 1 å5. 11 21 31 ’ D 6, when t D 41 51 6, and so forth.
Stock and Watson: Macroeconomic Forecasting Using Diffusion Indexes 157
D. Mixed Monthly and Quarterly Data— I(1) stock vari- APPENDIX B: DATA DESCRIPTION
ables. Suppose that underlying monthly data are I(1) and let
Xit1 denote the quarterly rst difference stock variable, assumed The time series used to construct the diffusion index fore-
to be measured in the third month of every quarter, and let casts discussed in Section 5 are presented here. The format is
Xit denote the monthly rst difference of the variable. Then
q q as follows: series number, series mnemonic, data span used,
Xit D 4Xi1 tƒ2 C Xi1 tƒ1 C Xit 5 for t D 31 61 91 121 : : : 1 and X it is
transformation code, and brief series description. The transfor-
missing for all other values of t. In this case estimation pro-
bit D ‹O 0i Fbt C 41=35eOit , where eOit D mation codes are 1 D no transformation, 2 D rst difference,
ceeds as in case A but with X
q O b’ ƒ2 C Fb’ ƒ1 C Fb’ 5, where ’ D 3 when t D 11 21 31 ’ D 4 D logarithm, 5 D rst difference of logarithms, 6 D second
Xi’ ƒ ‹i 4F
0
difference of logarithms. An asterisk after the date denotes a
6, when t D 41 51 6, and so forth.
E. Mixed Monthly and Quarterly Data— I(1) Flow Vari- series that was included in the unbalanced panel but not the
ables. Construction of X bit is more dif cult here than in the balanced panel, either because of missing data or because of
earlier cases. Here the general regression formula given above gross outliers that were treated as missing data. The series
can be implemented after specifying X !i and Ai . Let the quar- either were taken directly from the DR I-McGraw–Hill Basic
q
terly rst differences be denoted by Xit , which is assumed to Economics database, in which case the original mnemonics
be observed at the end of every quarter. The vector of obser- are used, or were produced by author calculations based on
q q q
vations is then X !i D 4Xi3 1 Xi6 1 : : : 1 Xi’ 50 , where ’ denotes the data from that database, in which case the author calcula-
tions and original DR I-McGraw series mnemonics are sum-
Downloaded by [Linköping University Library] at 10:51 12 August 2013
30. lhnag 1959:01–1998:12 5 civilian labor force: employed, nonagric. industries (thous., sa)
31. lhur 1959:01–1998:12 1 unemployment rate: all workers, 16 years & over (%, sa)
32. lhu680 1959:01–1998:12 1 unemploy. by duration: average (mean) duration in weeks (sa)
33. lhu5 1959:01–1998:12 1 unemploy. by duration: persons unempl. less than 5 wks (thous., sa)
34. lhu14 1959:01–1998:12 1 unemploy. by duration: persons unempl. 5 to 14 wks (thous., sa)
35. lhu15 1959:01–1998:12 1 unemploy. by duration: persons unempl. 15 wks C (thous., sa)
36. lhu26 1959:01–1998:12 1 unemploy. by duration: persons unempl. 15 to 26 wks (thous., sa)
37. lpnag 1959:01–1998:12 5 employees on nonag. payrolls: total (thous., sa)
38. lp 1959:01–1998:12 5 employees on nonag. payrolls: total, private (thous., sa)
39. lpgd 1959:01–1998:12 5 employees on nonag. payrolls: goods-producing (thous., sa)
40. lpmi 1959:01–1998:12ü 5 employees on nonag. payrolls: mining (thous., sa)
41. lpcc 1959:01–1998:12 5 employees on nonag. payrolls: contract construction (thous., sa)
42. lpem 1959:01–1998:12 5 employees on nonag. payrolls: manufacturing (thous., sa)
43. lped 1959:01–1998:12 5 employees on nonag. payrolls: durable goods (thous., sa)
44. lpen 1959:01–1998:12 5 employees on nonag. payrolls: nondurable goods (thous., sa)
45. lpsp 1959:01–1998:12 5 employees on nonag. payrolls: service-producing (thous., sa)
46. lptu 1959:01–1998:12ü 5 employees on nonag. payrolls: trans. & public utilities (thous., sa)
Downloaded by [Linköping University Library] at 10:51 12 August 2013
47. lpt 1959:01–1998:12 5 employees on nonag. payrolls: wholesale & retail trade (thous., sa)
48. lpfr 1959:01–1998:12 5 employees on nonag. payrolls: nance, insur. & real estate (thous., sa)
49. lps 1959:01–1998:12 5 employees on nonag. payrolls: services (thous., sa)
50. lpgov 1959:01–1998:12 5 employees on nonag. payrolls: government (thous., sa)
51. lw 1964:01–1998:12ü 2 avg. weekly hrs. of prod. wkrs.: total private (sa)
52. lphrm 1959:01–1998:12 1 avg. weekly hrs. of production wkrs.: manufacturing (sa)
53. lpmosa 1959:01–1998:12 1 avg. weekly hrs. of prod. wkrs.: mfg., overtime hrs. (sa)
54. pmemp 1959:01–1998:12 1 NAPM employment index (percent)
99. ivsrwq 1959:01–1998:12 2 ratio for mfg & trade: wholesaler; inventory/sales (87$)(s.a.)
100. ivsrrq 1959:01–1998:12 2 ratio for mfg & trade: retail trade; inventory/sales (87$)(s.a.)
101. pmnv 1959:01–1998:12 1 napm inventories index (percent)
Orders and un lled orders (Ord)
102. pmno 1959:01–1998:12 1 napm new orders index (percent)
103. pmdel 1959:01–1998:12 1 napm vendor deliveries index (percent)
104. mocmq 1959:01–1998:12 5 new orders (net)-consumer goods & materials, 1992 dollars (bci)
105. mdoq 1959:01–1998:12 5 new orders, durable goods industries, 1992 dollars (bci)
106. msondq 1959:01–1998:12 5 new orders, nondefense capital goods, in 1992 dollars (bci)
107. mo 1959:01–1998:12 5 mfg new orders: all manufacturing industries, total (mil$, sa)
108. mowu 1959:01–1998:12 5 mfg new orders: mfg industries with un lled orders (mil$, sa)
109. mdo 1959:01–1998:12 5 mfg new orders: durable goods industries, total (mil$, sa)
110. mduwu 1959:01–1998:12 5 mfg new orders: durable goods indust with un lled orders (mil$, sa)
111. mno 1959:01–1998:12 5 mfg new orders: nondurable goods industries, total (mil$, sa)
112. mnou 1959:01–1998:12 5 mfg new orders: nondurable gds ind. with un lled orders (mil$, sa)
113. mu 1959:01–1998:12 5 mfg un lled orders: all manufacturing industries, total (mil$, sa)
114. mdu 1959:01–1998:12 5 mfg un lled orders: durable goods industries, total (mil$, sa)
115. mnu 1959:01–1998:12 5 mfg un lled orders: nondurable goods industries, total (mil$, sa)
116. mpcon 1959:01–1998:12 5 contracts & orders for plant & equipment (bil$, sa)
117. mpconq 1959:01–1998:12 5 contracts & orders for plant & equipment in 1992 dollars (bci)
Stock prices (SPr)
118. fsncom 1959:01–1998:12 5 NYSE common stock price index: composite 412=31=65 D 505
119. fsnin 1966:01–1998:12ü 5 NYSE common stock price index: industrial 412=31=65 D 505
120. fsntr 1966:01–1998:12ü 5 NYSE common stock price index: transportation 412=31=65 D 505
121. fsnut 1966:01–1998:12ü 5 NYSE common stock price index: utility 412=31=65 D 505
122. fsn 1966:01–1998:12ü 5 NYSE common stock price index: nance 412=31=65 D 505
123. fspcom 1959:01–1998:12 5 S&P’s common stock price index: composite (1941–43 D 10)
124. fspin 1959:01–1998:12 5 S&P’s common stock price index: industrials (1941–43 D 10)
125. fspcap 1959:01–1998:12 5 S&P’s common stock price index: capital goods (1941–43 D 10)
126. fsptr 1970:01–1998:12ü 5 S&P’s common stock price index: transportation (1970 D 10)
127. fsput 1959:01–1998:12 5 S&P’s common stock price index: utilities (1941–43 D 10)
128. fsp 1970:01–1998:12ü 5 S&P’s common stock price index: nancial 41970 D 105
129. fsdxp 1959:01–1998:12 1 S&P’s composite common stock: dividend yield (% per annum)
130. fspxe 1959:01–1998:12 1 S&P’s composite common stock: price-earnings ratio (%, nsa)
131. fsnvv3 1974:01–1998:07ü 5 NYSE mkt composition: reptd share vol by size, 5000 C shrs,%
Exchange rates (EXR)
132. exrus 1959:01–1998:12 5 United States effective exchange rate (merm) (index no.)
133. exrger 1959:01–1998:12 5 foreign exchange rate: Germany (deutsche mark per U.S.$)
160 Journal of Business & Economic Statistics, April 2002
134. exrsw 1959:01–1998:12 5 foreign exchange rate: Switzerland (swiss franc per U.S.$)
135. exrjan 1959:01–1998:12 5 foreign exchange rate: Japan (yen per U.S.$)
136. exruk 1959:01–1998:12ü 5 foreign exchange rate: United Kingdom (cents per pound)
137. exrcan 1959:01–1998:12 5 foreign exchange rate: Canada (canadian $ per U.S.$)
Interest rates ( Int)
138. fyff 1959:01–1998:12ü 2 interest rate: federal funds (effective) (% per annum, nsa)
139. fycp90 1959:01–1998:12ü 2 interest rate: 90 day commercial paper, (ac) (% per ann, nsa)
140. fygm3 1959:01–1998:12ü 2 interest rate: U.S. treasury bills, sec mkt, 3-mo. (% per ann, nsa)
141. fygm6 1959:01–1998:12ü 2 interest rate: U.S. treasury bills, sec mkt, 6-mo. (% per ann, nsa)
142. fygt1 1959:01–1998:12ü 2 interest rate: U.S. treasury const maturities, 1-yr. (% per ann, nsa)
143. fygt5 1959:01–1998:12 2 interest rate: U.S. treasury const maturities, 5-yr. (% per ann, nsa)
144. fygt10 1959:01–1998:12 2 interest rate: U.S. treasury const maturities, 10-yr. (% per ann, nsa)
145. fyaaac 1959:01–1998:12 2 bond yield: moody’s aaa corporate (% per annum)
146. fybaac 1959:01–1998:12 2 bond yield: moody’s baa corporate (% per annum)
147. fwa t 1973:01–1994:04ü 1 weighted avg foreign interest rate (%, sa)
148. fyfha 1959:01–1998:12 2 secondary market yields on fha mortgages (% per annum)
149. sfycp 1959:01–1998:12 1 spread fycp - fyff
Downloaded by [Linköping University Library] at 10:51 12 August 2013
187. pu81 1967:01–1998:12ü 6 cpi-u: food & beverages (82–84 D 100, sa)
188. puh 1967:01–1998:12ü 6 cpi-u: housing (82–84 D 100, sa)
189. pu83 1959:01–1998:12 6 cpi-u: apparel & upkeep (82–84 D 100, sa)
190. pu84 1959:01–1998:12 6 cpi-u: transportation (82–84 D 100, sa)
191. pu85 1959:01–1998:12 6 cpi-u: medical care (82–84 D 100, sa)
192. puc 1959:01–1998:12 6 cpi-u: commodities (82–84 D 100, sa)
193. pucd 1959:01–1998:12 6 cpi-u: durables (82–84 D 100, sa)
194. pus 1959:01–1998:12 6 cpi-u: services (82–84 D 100, sa)
195. puxf 1959:01–1998:12 6 cpi-u: all items less food (82–84 D 100, sa)
196. puxhs 1959:01–1998:12 6 cpi-u: all items less shelter (82–84 D 100, sa)
197. puxm 1959:01–1998:12 6 cpi-u: all items less medical care (82–84 D 100, sa)
198. pcgold 1975:01–1998:12ü 6 commodities price: gold, london noon x, avg of daily rate, $ per oz
199. gmdc 1959:01–1998:12 6 pce, impl pr de : pce 41987 D 1005
200. gmdcd 1959:01–1998:12 6 pce, impl pr de : pce; durables 41987 D 1005
201. gmdcn 1959:01–1998:12 6 pce, impl pr de : pce; nondurables 41987 D 1005
202. gmdcs 1959:01–1998:12 6 pce, impl pr de : pce; services 41987 D 1005
Average hourly earnings (AHE)
Downloaded by [Linköping University Library] at 10:51 12 August 2013
203. leh 1964:01–1998:12ü 6 avg hr earnings of prod wkrs: total private nonagric ($, sa)
204. lehcc 1959:01–1998:12 6 avg hr earnings of constr wkrs: construction ($, sa)
205. lehm 1959:01–1998:12 6 avg hr earnings of prod wkrs: manufacturing ($, sa)
206. lehtu 1964:01–1998:12ü 6 avg hr earnings of nonsupv wkrs: trans & public util ($, sa)
207. lehtt 1964:01–1998:12ü 6 avg hr earnings of prod wkrs: wholesale & retail trade (sa)
208. lehfr 1964:01–1998:12ü 6 avg hr earnings of nonsupv wkrs: nance, insur, real est ($, sa)
209. lehs 1964:01–1998:12ü 6 avg hr earnings of nonsupv wkrs: services ($, sa)
Miscellaneous (Oth)
210. fste 1986:01–1998:12ü 5 U.S. mdse exports: total exports (f.a.s. value) (mil.$, s.a.)
211. fstm 1986:01–1998:12ü 5 U.S. mdse imports: general imports (c.i.f. value) (mil.$, s.a.)
212. ftmd 1986:01–1998:12ü 5 U.S. mdse imports: general imports (customs value) (mil.$, s.a.)
213. fstb 1986:01–1998:12ü 2 U.S. mdse trade balance: exports less imports (fas/cif) (mil.$, s.a.)
214. ftb 1986:01–1998:12ü 2 U.S. mdse trade balance: exp. (fas) less imp. (custom) (mil.$, s.a.)
215. hhsntn 1959:01–1998:12 1 u. of mich. index of consumer expectations (bcd-83)
[Received May 2000. Revised March 2001.] (1998), “Lets Get Real: A Dynamic Factor Analytical Approach to
Disaggregated Business Cycle,” Review of Economic Studies, 65, 453–474.
Fuhrer, J. C. (1995), “The Phillips Curve is Alive and Well,” New England
REFERENCES Economic Review of the Federal Reserve Bank of Boston, March/April,
41–56.
Burns, A. F., and Mitchell, W. C. (1947), Measuring Business Cycles, Geweke, J. (1977), “The Dynamic Factor Analysis of Economic Time Series,”
New York: National Bureau of Economic Research. in Latent Variables in Socio-Economic Models, eds. D. J. Aigner and A. S.
Chamberlain, G., and Rothschild, M. (1983), “Arbitrage Factor Structure, and Goldberger, Amsterdam: North–Holland.
Mean-Variance Analysis of Large Asset Markets,” Econometrica, 51, 5. Gordon, R. J. (1982), “Price Inertia and Ineffectiveness in the United States,”
Congressional Budget Of ce (1996), The Economic and Budget Outlook: Fis- Journal of Political Economy, 90, 1087–1117.
cal Years 1997–2006, Washington, DC: Author. (1997), “The Time-Varying NA IRU and its Implications for Economic
Connnor, G., and Korajczyk, R. A. (1986), “Performance Measurement Policy,” Journal of Economic Perspectives, 11–32.
With the Arbitrage Pricing Theory,” Journal of Financial Economics 15, Quah, D., and Sargent, T. J. (1983), “A Dynamic Index Model for Large Cross
373–394.
Sections,” in Business Cycles, Indicators, and Forecasting, eds. J. H. Stock
(1988), “Risk and Return in an Equilibrium APT: Application of a
and M. W. Watson, Chicago: University of Chicago Press, 285–306.
New Test Methodology,” Journal of Financial Economics, 21, 255–289.
Sargent, T. J., and Sims, C. A. (1977), “Business Cycle Modeling without
(1993), “A Test for the Number of Factors in an Approximate Factor
Model,” Journal of Finance, 48, 4. Pretending to Have Too Much A-Priori Economic Theory,” in New Meth-
Engle, R. F., and Watson, M. W. (1981), “A One-Factor Multivariate Time ods in Business Cycle Research, ed. C. Sims et al., Minneapolis: Federal
Series Model of Metropolitan Wage Rates,” Journal of the American Sta- Reserve Bank of Minneapolis.
tistical Association, 76, 376, 774–781. Singleton, K. J. (1980), “A Latent Time Series Model of the Cyclical Behav-
Forni, M., Hallin, M., Lippi, M., and Reichlin, L. (2000), “The General- ior of Interest Rates,” International Economic Review, 21, 559–575.
ized Dynamnic Factor Model: Identi cation and Estimation,” The Review Staiger, D., Stock, J. H., and Watson, M. W. (1997), “The NA IRU, Unemploy-
of Economics and Statistics, 82, 4, 540–552. ment, and Monetary Policy,” Journal of Economic Perspectives, 11, 33–51.
Forni, M., and Reichlin, L. (1996), “Dynamic Common Factors in Large Stock, J. H., and Watson, M. W. (1989), “New Indexes of Coincident and
Cross-Sections,” Empirical Economics, 21, 27–42. Leading Economic Indicators,” NBER Macroeconomics Annual, 351–393.
162 Journal of Business & Economic Statistics, April 2002
(1991), “A Probability Model of the Coincident Economic Indica- Stock, J. H., and Watson, M. W. (1999), “Forecasting In ation,” Journal of
tors,” in Leading Economic Indicators: New Approaches and Forecasting Monetary Economics 44, 293–335.
Records, eds. K. Lahiri and G. H. Moore, New York: Cambridge Univer- (2000), “Forecasting Using Principal Components From a Large Num-
sity Press, 63–85. ber of Predictors,” manuscript.
Tootell, G. M. B. (1994), “Restructuring, the NA IRU, and the Phillips Curve,”
(1996), “Evidence on Structural Instability in Macroeconomic Time New England Economic Review of the Federal Reserve Bank of Boston,
Series Relations,” Journal of Business and Economic Statistics, 14, 11–30. Sept./Oct., 31–44.
(1998), “Diffusion Indexes,” working paper 6702, National Bureau of West, K. D. (1996), “Asymptotic Inference About Predictive Ability,” Econo-
Economic Research. metrica, 64, 1067–1084.
Downloaded by [Linköping University Library] at 10:51 12 August 2013