Diagnostic Smooth Tests of ®T: Metrika (2000) 52: 237 252
Diagnostic Smooth Tests of ®T: Metrika (2000) 52: 237 252
Abstract. Smooth tests are frequently used for testing the goodness of ®t of
a parametric family of distributions. One reason for the popularity of the
smooth tests are the diagnostic properties commonly attributed to them.
In recent years, however, it has been realized that these tests are strictly non-
diagnostic when used conventionally. The paper examines how the smooth
test statistics must be rescaled in order to obtain procedures having diagnostic
properties at least for large sample sizes.
1 Introduction
X
sk
1 X n
^2
Cn; k U^n;2 j with U^n; j p hj Xi ; Q^n : 1
js1
n i1
x Q x Q 2 x
h1 x; Q p ; h2 x; Q p :
Q 2Q
For j 1, the equation EQ hj X ; Q 0 means that the expected value of the
Poisson distribution equals Q, whereas, for j 2, it stands
P nfor the equality of
expectation and variance. Since, putting Q^n X n n 1 j1 Xj ,
!
X
n
1 Xn
h1 Xi ; Q^n p Xi nX n 0;
i1 Q^n i1
the ®rst component U^n; 1 is zero; hence s 1 in (1). The reason for this is that
the usual estimator of Q is the method of moments estimator of the mean.
Furthermore,
C^ 2 U^ 2 1 Dn n 2 ;
n; 1 n; 2
2n
Pn
where Dn j1 Xj X n 2 =X n denotes Fisher's index of dispersion. Thus,
the smooth test for Poissonity based on the ®rst nonzero component U^n; 2 and
the dispersion test are equivalent.
The diagnostic properties commonly attributed to the smooth tests are one
reason for their popularity: it is assumed that such tests, in case of rejection of
a hypothesis H0 , provide some kind of directed diagnosis regarding the kind
of departure from H0 of the underlying distribution. The dispersion test, for
example, examines the ratio of variance to expected value, which equals one in
case of the Poisson distribution. If the ratio of empirical variance to arithmetic
mean is not su½ciently close to one, the hypothesis of Poissonity is rejected
(on a given signi®cance level). Furthermore, it is believed that the hypothetical
Diagnostic smooth tests of ®t 239
model has been rejected because the variance of the underlying true distribu-
tion di¨ers from its expectation. Henze and Klar (1996) showed that the con-
ventional use of the dispersion test does not qualify to this diagnosis since, on
one hand, the conclusion refers to a nonparametric population parameter but,
on the other hand, it is drawn from the distributional behaviour of the test
statistic under the `narrow' parametric hypothesis of Poissonity. In Henze
(1997) and Henze and Klar (1996), it was also shown how to modify this and
similar tests to obtain procedures having diagnostic properties at least for
large sample sizes. The correct approach is to formulate a nonparametric hy-
pothesis that is tailored to the desired kind of directed diagnosis.
In case of the dispersion test, this hypothesis is the equality of expected
value and variance. The rejection of this nonparametric hypothesis on a cer-
tain signi®cance level leads to the conclusion that the variance of the under-
lying distribution di¨ers from its expected value.
It is the aim of this paper to generalize these results to smooth tests of
arbitrary order. To this end, the joint limit distribution of components is de-
rived in Section 2 not only under the parametric hypothesis, but also under a
suitable nonparametric class of distributions.
Under the parametric hypothesis, and if the maximum likelihood estima-
tion method is used, the usual representation of the asymptotic distribution of
components as limit law of score statistics is regained.
In the nonparametric class of distributions, however, the appropriate esti-
mation method is the method of moments, and we show how the components
have to be rescaled in order to obtain test statistics which are asymptotically
distribution-free within this class.
Section 3 treats smooth tests for certain exponential families for which re-
scaling is particularly easy. This applies to all cases treated in Rayner and Best
(1989).
Since the assertion about the limiting distribution of components in Section
2 remains valid without the assumption that the polynomials hj are orthogonal,
diagnostic smooth tests can be constructed even when no orthogonal system
exists. As an example, smooth tests of ®t for bivariate Poissonity are discussed
in Section 4.
where 0 U k0 U s and
1 X n
U^n; j p hj Xi ; Q^n :
n i1
p ^ 1 X n
n Qn Q p l Xi ; Q oPQ 1;
n i1
The ®rst result establishes the joint asymptotic distribution of the random
vector U^n U^n; k0 1 ; . . . ; U^n; k0 k 0 under H0 , where the prime denotes trans-
pose. Thereby, h x; Q hk0 1 x; Q; . . . ; hk0 k x; Q 0 , and `Q h x; Q is the
k s-matrix with entries qhi x; Q=qQj , where Q Q1 ; . . . ; Qs 0 .
Pj
2.1 Theorem. a) Suppose that the coe½cients aji Q in hj x; Q i0 aji Qx i ,
k0 1 U j U k0 k, have a continuous derivative with respect to Q. Let Q^n
satisfy (R1). Then the limiting distribution of U^n under PQ is k-variate normal
N 0; S with covariance matrix
S EQ v X ; Qv X ; Q 0 ;
EQ `Q h X ; Q CQ ;
S Ik CQ I Q 1 CQ0 ;
where Ik denotes the identity matrix of order k, and I Q is the Fisher infor-
mation matrix with entries
q q
Ijk Q EQ log f X ; Q log f X ; Q :
qQj qQk
p P n
Proof: Let Un; j g 1= n i1 hj Xi ; g so that U^n; j Un; j Q^n . By the
mean value theorem,
1 X n
p hj Xi ; Q l Xi ; Q 0 E`g hj X ; gjgQ oPQ 1:
n i1
q q
0 EQ hi X ; Q hi x; Q f x; Qm dx
qQj qQj
q hi x; Q f x; Q
m dx
qQj
qhi x; Q q log f x; Q
f x; Qm dx hi x; Q f x; Qm dx
qQj qQj
qhi X ; Q q log f X ; Q
EQ EQ hi X ; Q :
qQj qQj
Remarks:
U^n; j 0;
j 1; . . . ; s:
Pj
Proof: Write hj x; Q l0 al Qx l . Since EQ hj X ; Q 0 for j V 1,
X
j
al Qml Q 0; Q A Y:
l0
1 1X n
p U^n; j hj Xi ; Q n
n n i1
X
j
al Q n ml
l0
X
j
al Q n ml Q n 0: 9
l0
metric family PQ , other estimation methods may be used instead of the (non-
parametric)
p method of moments. Then, for example, the ®rst component U^n; 1
n X n m1 Q^n examines whether m1 Q^n is a `reasonable' estimator of the
mean. In general, this will not be the case if P B fPQ g. A large or small value
of U^n; 1 indicates that the parametric model may not be appropriate. If the aim
is to use the test as a `diagnostic' procedure, the moment estimator of Q has to
be used, as will be explained in the following.
The equations EQ hj X ; Q 0, j 1; . . . ; s, determine the moments m1 ; . . . ;
ms , which depend on Q1 ; . . . ; Qs . Conversely, Q is a unique function of m1 ; . . . ; ms
by assumption. On the set
j
P : P : jxj P dx < y; j 1; . . . ; s; Q Q m1 ; . . . ; ms A Y ;
Tj P : hj ; d P dP; 5
2.3 Theorem. a) Suppose that, under H0 , the covariance matrix S in 2.1 is the
identitiy matrix, and, hence, C ^ 2 has a limiting chi-squared distribution with k
n; k
degrees of freedom under H0 . Further, let the support of P A P0 consist of at
least s k 1 elements. Then SP EP h X ; d Ph X ; d P 0 is nonsingular,
and U^n0 SP 1 U^n has a limiting wk2 -distribution under P as well.
b) If n V s k 1 and if the distribution function of P is continuous, then the
k k-matrix
X
n
^n 1
H h Xi ; Q^n h Xi ; Q^n 0
n i1
~ 2 : U^ 0 H
C ^ 1^
n; k n n Un
qhr x; Q X r
qai Q i X r
x ci Qhi x; Q:
qQj i0
qQj i0
0 0
where cs1 ; . . . ; csk A R are determined by cs1 ; . . . ; csk . Therefore, the
matrix SP is nonsingular. Using well-known results, assertion a) follows.
To prove b), note that H ^n is the empirical version of SP . Hence, the state-
ment about the distribution of C ~ 2 follows from a) if H ^n is nonsingular with
n; k
probability one.
To this end, let n V s k 1 and assume that X1 ; . . . ; Xsk1 take di¨erent
values. Adding s 1 rows X1j ; . . . ; Xsk1
j
; j 0; . . . ; s, to the k sk1-
matrix with rows
ax u Q x
pQ x ; ax V 0;
h Q
1X n
EQ tj X tj xi ; j 1; . . . ; s;
n i1
de®ning the method of moments estimators coincide. By Lemma 2.2, the ®rst
s components are zero. Since
X
s
`Q log pQ x `Q log C Q tj x`Q zj Q; 6
j1
where X n and s ^n denote the arithmetic mean and the empirical standard
deviation, respectively. In the class
Diagnostic smooth tests of ®t 247
P0 fP A P : T3 P T4 P 0; EP X 8 < yg;
the statistic
U^n;2 3 H
^44 2U^n; 3 U^n; 4 H
^34 U^ 2 H^
n; 4 33
~2
Cn; 2 ^33 H^44 H ^ 2
H 34
has a limiting w22 -distribution. The test based on C ~ 2 has diagnostic properties.
n; 2
If the skewness and/or the kurtosis of the underlying distribution di¨er from
the corresponding values ( 0 and 3) of the normal distribution, the non-
parametric hypothesis H ~0 : P A P0 will be rejected, at least for su½ciently large
sample sizes: if a A 0; 1 and w2;2 1 a denotes the 1 a-quantile of the w22 -
distribution, then
~ 2 > w2
lim P C
n!y n; 2 2; 1 a 1: 8
D
where N1 and N2 are independent unit normal random variables, and !
denotes weak convergence. The weights are given by
^2
Table 1. Percentage of 10000 Monte Carlo samples declared signi®cant by the tests based on Cn; 2
and C~n;22 for several distributions from P0 (a 0:1)
Distribution g1 g2 ^2
C C~n;22
n; 2
n 50 200 1000 10000 n 50 200 1000 10000
N 0; 1 1.0 1.0 10.6 10.2 10.1 10.1 10.2 10.1 10.0 10.2
NU 0:7; 3:506 .59 .36 5.2 2.3 1.2 0.8 8.1 4.7 6.2 9.4
NU 0:5; 5:331 .41 .40 4.8 1.3 0.5 0.4 11.2 5.9 6.7 9.1
NU 0:7; 1:307 1.2 1.4 11.9 15.1 16.3 17.0 15.5 14.9 12.3 10.7
NU 0:5; 1:258 1.4 1.9 14.9 20.3 22.4 25.1 21.9 20.5 14.6 11.6
NU 0:3; 1:180 1.8 3.0 20.0 31.5 35.5 37.3 32.2 28.7 18.9 11.5
tU 9; 5:892 .41 .45 4.8 1.5 0.7 0.5 10.9 6.0 6.3 10.0
tU 85; 1:306 1.5 2.2 14.9 22.9 26.3 28.4 22.9 23.7 16.6 11.1
tU 20; 1:494 1.8 4.2 15.7 28.2 37.0 42.7 26.5 29.3 22.3 13.8
X
min r; s
l1 l2 l3 l1 l3 r i l2 l3 s i l3i
f r; s; Q e ;
i0
r i! s i!i!
min r; s
! !
X r s l3 i
mr; s lr s
1 l2 i! :
i0 i i l1 l2
Hence, EX l1 , EY l2 and Cov X ; Y l3 . Higher non-central mo-
ments mr;0 s EX r Y s r s U 3 are given by
Diagnostic smooth tests of ®t 249
m2;0 0 l2
1 l1 ;
m3;0 0 l3 2
1 3l1 l1 ;
m2;0 1 l2
1 l2 2l1 l3 l3 l1 l2 ;
0 0 0
similar equations hold for m 0; 2 ; m 0; 3 and m1; 2 . Therefore, the corresponding
central moments mr; s E X EX Y EY s are m2; 0 m3;0 l1 , m 0; 2
r
4.1 Example. In the work of Rayner and Best (1995), smooth tests of ®t are
derived as score tests with respect to certain alternatives; but again, there is the
problem that the `smooth alternatives' do not exist.
Rayner and Best (1995) use the maximum likelihood method. The MLE
of l1 and l2 are given by l^1 X and l^2 Y , respectively.
Pn The MLE l^3 of
l3 di¨ers from the empirical covariance SXY n 1 i1 Xi X Yi Y
(i.e. from the corresponding estimator derived by the method of moments)
and has to be computed by an iterative technique (see Kocherlakota and
Kocherlakota (1992), Section 4.7).
Therefore, a test can be based on h1 x; y; Q x l1 y l2
x EX y EY . The appropriate centered component
1 X n
U^n; 1 p Xi l^1 Yi l^2 l^3
n i1
p
n SXY l^3
has a limiting normal distribution with expectation 0 under the hypothesis
of bivariate Poissonity. Computing the variance of U^n; 1 , normalizing and
squaring, yields a test statistic which has a limiting w12 -distribution (Rayner
and Best (1995), Section 3).
Obviously, the smooth test which rejects the hypothesis of bivariate
Poissonity for large values of U^n;2 1 is not a test having diagnostic properties: a
signi®cant test result does not necessarily mean that the covariance structure
of the data at hand is incompatible with that of a bivariate Poisson distri-
bution for a given signi®cance level (since l3 can take any positive value). It
rather means that the MLE di¨ers signi®cantly from the nonparametric mo-
ment estimator, indicating that the hypothetical parametric model may not be
appropriate.
250 B. Klar
1 X n
U^n; 1 p Xi l^1 2 l^1
n i1
p
n SXX l^1 ;
p
analogously, the second component is U^n; 2 n SYY l^2 .
It is not di½cult to verify that condition (R1) on page 5 is satis®ed with
l1 x; y; Q x l1 ; l2 x; y; Q y l2 and l3 x; y; Q xy l3 x l1 l2
y l2 l1 . Since Eqh1 =ql1 1 and Eqh1 =ql2 Eqh1 =ql3 0, the ®rst
entry in the covariance matrix S in Theorem 2.1 a) is
1
f md
4; 0 2d
m3; 0 l^1 2 l^1 md
0; 4 2md
0; 3 l^2 2 l^2
n
md
2; 2 md
2; 1 md
1; 2 SXY l^1 l^2 2 gC
^2
n
md
0; 4 2md
0; 3 l^2 2 l^2 SXX l^1 2
2 md
2; 2 md
2; 1 md
1; 2 SXY l^1 l^2 SXX l^1 SYY l^2
md
4; 0 2d
m3; 0 l^1 2 l^1 SYY l^2 2
Diagnostic smooth tests of ®t 251
has a limiting chi-squared distribution with two degrees of freedom in the class
of all bivariate discrete distributions having the property that the expectation of
each marginal distribution equals its variance.
Remark: Using the moments of the bivariate Poisson distribution, the cova-
riance matrix, after simpli®cation, takes the form
!
m4; 0 l2
1 l1 m2; 2 l1 l2 l3
S0 :
m 0; 4 l2
2 l2
This is the covariance matrix given in Rayner and Best (1995), Section 4. The
corresponding statistic has a limiting w22 -distribution under the hypothesis of
bivariate Poissonity, but not within the wider nonparametric class of distri-
butions de®ned above.
The last example showed how tests with diagnostic properties can be con-
structed in multivariate settings where no orthogonal system is available. A
general objection against the use of moment estimators is the low e½ciency
compared with other estimators; in case of the bivariate Poisson distribution,
the moment estimator SXY of l3 is not e½cient for large values of the corre-
lation coe½cient r (see Kocherlakota and Kocherlakota (1992), p. 108). This
does not a¨ect the testing procedure; but possibly one would prefer to con-
tinue working with an e½cient estimator. Yet this argument does not meet
the intention of a diagnostic test. The aim of a directed test is to examine
whether the data coincide with a certain (simple) model in important charac-
teristics (the ®rst moments). If this is the case, the theoretical model is used
even if it is not the `true' underlying distribution. The term `e½ciency', how-
ever, is meaningful only within the parametric model.
Similarly, the statement that diagnostic tests do not always have good
power as tests of ®t is directed at the parametric model. However, diagnostic
tests are not a new goodness of ®t statistic for testing the parametric model
but they aim to test a di¨erent (nonparametric) hypothesis. Therefore, com-
parisons with the power of goodness of ®t tests for the parametric model are
of limited meaning.
Acknowledgements. This work is based on a part of the author's doctoral thesis at the University
of Karlsruhe, written under the supervision of Professor Norbert Henze, whose guidance is
gratefully appreciated.
References
Bargal AI, Thomas DR (1983) Smooth goodness of ®t tests for the Weibull distribution with
singly censored data. Commun. Statist.-Theory Meth. 12:1431±1447
Boulerice B, Ducharme GR (1995) A note on smooth tests of goodness of ®t for location-scale
families. Biometrika 82:437±438
Bowman
p KO, Shenton BR (1975) Omnibus test contours for departures from normality based on
b1 and b2 . Biometrika 62:243±250
Chihara TS (1978) An introduction to orthogonal polynomials. New York, Gordon and Breach
Gastwirth JL, Owens MEB (1977) On classical tests of normality. Biometrika 64:135±139
Henze N (1997) Do components of smooth tests of ®t have diagnostic properties? Metrika 45:121±
130
252 B. Klar
Henze N, Klar B (1996) Properly rescaled components of smooth tests of ®t are diagnostic. Aus-
tral. Journ. Statist. 38:61±74
Johnson NL, Kotz S, Kemp AW (1992) Univariate discrete distributions. J. Wiley & Sons, New
York
Kallenberg WCM, Ledwina T, Rafajlowicz E (1997) Testing bivariate independence and nor-
mality. SankhyaÅ Ser. A 59:42±59
Kocherlakota S, Kocherlakota K (1992) Bivariate discrete distributions. Marcel Dekker, New
York
Koziol JA (1986) Assesing multivariate normality: a compendium. Commun. Statist.-Theory
Meth. 15:2763±2783
Koziol JA (1987) An alternative formulation of Neyman's smooth goodness of ®t tests under
composite alternatives. Metrika 34:17±24
Lehmann EL (1983) Theory of point estimation. J. Wiley & Sons, New York
Mardia KV, Kent JT (1991) Rao score tests for goodness of ®t and independence. Biometrika
78:355±363
Neyman J (1937) Smooth test for goodness of ®t. Skandinavisk Aktuarietidskrift 20:149±199
Rayner JCW, Best DJ (1986) Neyman-type smooth tests for location-scale families. Biometrika
73:437±446
Rayner JCW, Best DJ (1988) Smooth tests of goodness of ®t for regular distributions. Commun.
Statist.-Theory Meth. 17:3235±3267
Rayner JCW, Best DJ (1989) Smooth tests of goodness of ®t. New York, Oxford University Press
Rayner JCW, Best DJ (1995) Smooth tests for the bivariate Poisson distribution. Austral. J.
Statist. 37:233±245
Thomas DR, Pierce DA (1979) Neyman's smooth goodness-of-®t test when the hypothesis is
composite. J. Amer. Statist. Ass. 74:441±445