Behavioral Path Model Rules
Behavioral Path Model Rules
Multivariate Behavioral
Research
Publication details, including instructions for
authors and subscription information:
http://www.tandfonline.com/loi/hmbr20
Changing a Causal
Hypothesis without
Changing the Fit: some
Rules for Generating
Equivalent Path Models
Ingeborg Stelzl
Version of record first published: 10 Jun
2010.
This article may be used for research, teaching, and private study
purposes. Any substantial or systematic reproduction, redistribution,
reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make
any representation that the contents will be complete or accurate
or up to date. The accuracy of any instructions, formulae, and drug
doses should be independently verified with primary sources. The
publisher shall not be liable for any loss, actions, claims, proceedings,
demand, or costs or damages whatsoever or howsoever caused arising
directly or indirectly in connection with or arising out of the use of this
material.
Downloaded by [McGill University Library] at 01:14 20 December 2012
Multivariate Behavioral Research, 1986,21,309331
Since computer programs have been available for estimating and testing linear caiusal
models (e.g. LISREL by J6reskog & Stirborn) these models have been used increasi~igly
in the behavioral sciences. This paper discusses the problem that very different caiusal
structures may fit the same set of data equally well. Equivalence of models is defined as
Downloaded by [McGill University Library] at 01:14 20 December 2012
undecidability in principle and four rules are presented showing how equivalent models
may be generated. Rules I and If refer to inversions in the causal order of variables and
Rules III and IV to replacements of paths by the assumption of correlated residuals.
Three examples are given to illustrate how the N ~ are S applied.
The author thanks the editor and two anonymous reviewers for their helpful
comments on an earlier version of this paper.
All oomspondence to the author should be addressed to: Ingeborg Stelzl, Fachber-
eich Psychologie der Universiat Marburg, Gutenbergstr. 18, D 355 Marburg, F.R.G.
where
Pi = vector of path coefficients for variable i, containing the
partial regression weights of the i-th variable on the
variables XI, X2...Xi-1
Z(i-l, = covariance matrix of the variables XI, X2..Xid1
Ziill = inverse of $i-ll
= vector of covariances of Xi with XI, X2..Xi-I
ofi)
A recursive path diagram as given in Equation 1 is called
complete if all paths are free parameters. A complete model, however,
is not testable, because the number of parameters is equal to tlhe
number of elements in the covariance matrix 2. To get a testablle
model some restrictions must be introduced to reduce the number of
parameters. For instance, some parameters might be replaced by kcd
constants, if their numerical values are known a priori from theory.
" The number of parameters might also be reduced by constraining some
parameters to be equal. In practice, however, the most common
JULY 1986 311
lngeborg Stelzl
restriction is to assume that some path coefficients are zero and to omit
these paths in the path diagram. Since the structural equations for
these models are special cases of the equational system given as
Equation 1, Equation 2 applies for these models too.
If, for instance, a psychologist assumes that Xg (= level of
education) has no direct influence on Xh (= neuroticism) and therefore
omits the path from Xgto Xhthat means that in Equation 1
Phg =0
Downloaded by [McGill University Library] at 01:14 20 December 2012
Notation
The general structure of a complete recursive path diagram is
determined when the causal order of the variables is known. For the
general structure of a model with zero constraints to be known, the
causal order and the zero paths must be given, as shown in Figure 1.
We will use this way of notation for the equation system of a path
diagram with zero constraints.
312 MULTIVARIATE BEHAVIORAL RESEARCH '
lngeborg Stelzl
Variable:
I
I
A, B, C , D - - . . - - - Q
Position: 1, 2 , 3 , U . . .n ....
Figure 1. Path diagram with three zero-paths: AC, AD, BC.
when different path models lead to the same restrictions in the partial
correlation will be our main concern in the following.
Rule I
Downloaded by [McGill University Library] at 01:14 20 December 2012
Figure 2. Path diagram with one zero-path: EM. All other path coefficients are free parameters.
If we look for changes in the causal order that can be made without
changing the set of variables partialled out in the critical partial
regression equations for K and P, we will find that we may:
Interchange the variables on position 1 to s-1;
Interchange the variables on positions s+ 1 to t-1;
Interchange the variables on positions t+ 1 to n.
These changes correspond to Rules Ia and Ib given above. Further-
more, one may interchange E and K according to Rule Ic. Interchang-
ing the positions of I and P, however, would change the set of variables
Downloaded by [McGill University Library] at 01:14 20 December 2012
Rule 11:
In Rule I only those changes in causal orderings were considered
which leave the set of predictors in the critical regression equation
(that is the equation with the zero restriction) unchanged. We now ask
for cases in which a partial correlation remains zero, in spite of
deleting or adding a variable to the set of variables, which are
partialled out. The answer is given by Rule 11.
Let Po be a recursive path diagram in which the variable E has no
direct influence on the variables M and N. Let E be at position g and M
and N on the adjacent positions h and h+l (with g < h), as shown in
Figure 4.
When we invert the positions of M and N and keep the restriction
of zero.paths from E to M and N, the new model PI is equivalent to 9 0 .
This rule can be derived fmm the well known formula for the
change in partial correlations that occurs, when predictors are added.
The restriction of Po are:
318 MULTIVARIATE BEHAVIORAL RESEARCH
lngeborg Stelzl
p(MEIA.. D,F..L,N) -
p(NEIA..D,F..L) = 0 or p(NE/A..D,F..L) = 0.
0 or p(ME1A.. D,F..L,N) = 0.
From the general formula for partial correlations (Lord & Novick,
1968, p.280) we have
Downloaded by [McGill University Library] at 01:14 20 December 2012
Hence
Furthermore ,
p(ME/A.D,F. L,N) =
.
[p(MEIA.D,F..L) - p(MN1A.DP.L) p(ENIA.DF.L)I/Denorn. =
EO - p(IJ1A.DF.L) .O]/Denom. = 0,
as required in the second restriction of P I . Adding N to the
predictors of M does not effect the zero-path EM.
Rule 111:
Again let Po be a recursive path diagram from n variables. Let
variable E be at position g and variable M at position h, with g < h:
Variable A,B ....B....M .....Q
Position 1,2 ....g ...h .....a
All%path coefficients for paths leading to M are free parameters. All
other path coefficients may be free or constrained.
The substitution of the path from E to M by assuming correlated
residuals results in an equivalent model if all path coefficients for M
are free parameters.
If you omit the path from E to M and replace it with correlated
residuals, the path coefficients for all other variables (besides M)
remain unchanged, because they are regressed on the same predictors
in Po and P I . Only the regression equation for M is changed, since E is
no longer taken as a determiner of M and is deleted from the set of
variables on which M is regressed. Thereby the regression coeacients
of the remaining predictors of M may be changed and thus must be re-
computed. The remaining portion of covariance between E and M,
which cannot be attributed to common predictors, is now "explained"
as covariance of residuals. If there are no constraints concerning the
path coefficients of M this way of substituting always produces a
solution for PI.Conversely, if there are no constraints in the path
coefficients for M (the variable for which a predictor is added), one can
always substitute correlated residuals by a directed path.
Rule IV:
Let Po again be a recursive path diagram in which variable E has
no direct effect on variable M and N.Let E be at position g and M and
N at the adjacent positions h and h + 1,with g < h, as has been shown
in Figure 4.
320 MULTIVARIATE BEHAVIORAL RESEARCH
lngeborg Stelzl
Concerning the paths other than EM and EN, we now make the
Downloaded by [McGill University Library] at 01:14 20 December 2012
following assumptions: The path coefficients for all other paths leading
to N are free. The path coefficientsfor all other paths leading to M may
be free or constrained. The path coefficients for all other variables
(besides M and N)may be free or constrained.
The path from M to N can be replaced by the assumption of
correlated residuals, if all variables (in this example variable E) ibr
which fixed zero-paths are assumed for variable N have fixed zero-
paths also for variable M. (Since the reverse need not be true, this
condition is somewhat weaker than the condition given in Rule 111 .1
When the path from M to N is replaced by correlated residuals
only the regression equation for N is changed. Therefore the path
coefficients (including the fixed zeros) for all other variables, besid,es
N, will remain unchanged.
In the equation for N the predictor M is deleted. But as has alrea~dy
been shown in deriving Rule 11, this will not affect the zero-path EIV:
The constraint of Pl that
Examples
help one avoid overlooking a rival hypothesis that is equally well supported by the
data and to prevent unwarranted conclusions.
It rnay happen, however, that a researcher has a covariance matrix but haq no
model. In this case our rules will not help. Dempster (1972) and Wermuth (1976,
1980)present a technique, which they call covariance selection analysis, and give
backward and forward strategies for finding partial correlations that are zero in a
covariance matrix. This technique might be used in such situations. A broader
discussion of methods for developing and modifying a model with the data already
at hand is given by Learner (1978).
Example 1
In Figure 5 a path-diagram involving four variables is shown. The
order of the variables is A, B, C, D with zero-constraints for the paths
from A to C and D.
Rule Ia allows the inversion of A and B
Rule I1 allows the inversion of C and D
Rule Ic allows the inversion of A and C
(or A and D, if CD have been inverted first).
The combination of these possibilities results in eight different
permutations, as shown in Figure 6.
These eight permutations correspond to only six different path
diagrams, which are given in Figure 7. Two diagrams (7 and 8) are
identical to diagrams already given (3 and 4) because they differ only
in the positions of variables, which are on adjacent positions and have
zero-paths between them.
The 6 path diagrams generated by the rules given above are
equivalent. That means that there is no way of deciding between them,
whatever the covariance matrix of the variables may be.
Besides inverting positions of variables, there are further possibil-
ities for generating equivalent models by assuming covariances in the
residuals instead of directed paths, according to Rules III and IV.In
diagrams 1 to 4, you may replace the paths between A and B and
between C and D, and in diagrams 5 and 6 any of the paths between C,
B, and D by the assumption of correlated errors.
322 MULTIVARIATE BEHAVIORAL RESEARCH
A
- B
lngeborg Ste~lzl
C D
I
II B A
LD Cl
Figure 6. Models resulting from the path diagram given in Figure 5 by permutationaccording to Rule I and
II.
Example I1
The data are taken from Sewell, Haller and Ohlendorf (1970) and
have already been reanalysed by other authors (Wiley, 1973). In the
LISREL-manual (Joreskog & Sijrbom, 1984, p. 111.40)a simple model is
suggested from which the reader may start with his modificatio~is.
This model and the LISREL-estimates for its parameters are given in
Figure 8. The model has three zero-paths: AE, BE and BC. We will now
apply our rules to see which models are equivalent to the model given
in Figure 8.
Rule I: The two zero-paths BE and AE do not impose any
restrictions on the order of variables A to D. Were these zero-paths the
only ones, the sequence of variables A, B, C, D would be completely
free. The zero-path BC, however, sets considerable limits to the
admissable permutations: D and E must stay to the right of B or C, so
that the path from C to D for example cannot be inverted. Indeed, these
JULY 1986 323
lngeborg Stelzl
Downloaded by [McGill University Library] at 01:14 20 December 2012
7 same as 3
8 same as 4
are only two inversions left: the inversion of A and B, which is trivial,
and the inversion of B and C, which does not make sense considering
the contents of the variables.
Rule I I The conditions for applying this rule are not met.
Rule 111: Any of the paths leading to D might be replaced by
correlating residuals. The path coefficients for the other paths leading
to D would thereby be changed.
Rule IV: The conditions for applying this rule are not met.
Yet, the model given in Figure 8 does not fit the data and each of
the equivalent models would of course give the same bad fit. The
modification indices suggest adding a path from B to E. If one does so,
324 MULTIVARIATE BEHAVIORAL RESEARCH
lngeborg Stelzl
Mental
A=~bility - .589 Academic
C = Performance
Educational
-
=Aspiration
tatus us -
Socioeconomic .245 Signif i c a n t
D=~ther's
Downloaded by [McGill University Library] at 01:14 20 December 2012
A B C D E
Figure 8. Path diagramand estimated parametersfor me model given in the LISREL-Manualas Computer
Exercise 5 "Educational Attainment".
Figure 9. Path diagram and estimated parameters for the model given in Figure 8 with two paths adtled.
Mental
,,*=Ability - 589 Academic
t =P e r f o r m a n c ~
I-
B=~tatus -
Socioeconomic .127 Sianif i c a n t
U = ~ 6 e r ' s~ n f l u e i c e .-?73/
Downloaded by [McGill University Library] at 01:14 20 December 2012
Figure 10. Path diagram and estimatedparametersfor the model given in Figure 9 with the direction of the
path between D and E inverted.
may be plausible for the path from C to E, because both variables are
influenced by scholastic motivation. The results are given in Figure
11.Similarly, any of the paths leading to D or E might be replaced by
correlated residuals. This would lead to many equivalent models with
varying coefficients for the paths left, in the model.
Example ZZZ
In our last example latent variables are involved. The model given
in Figure 12 is part of a more complex model which Lohmoller (1984)
fitted to the data of Marjoribanks (1972). The numerical values
Performance
Figure 11. Path diagram and estimated parametersfor the model given in Figure 9 with the path from C to
E replaced by mrrelated residuals.
the inner model), which account for the correlations between the
observables. Since there are 66 correlations but only 15 paramett?rs,
there are 41 degrees of freedom and the model seems to he highly
restrictive.
Father Mother
I
I
Dominance ..J,
-\
Test 4
Activities
Figure 12. Pam diagram with three latent variable*. The numerical values for the path wefflcients i r ~the
inner model and for the factor loadings are taken from LohrnoIIer (1984, p.3-09 and p.4-05).
verified from the tracing rules of path aqalysis (Van de Geer, 1971, p.
124) or the equations given by Jiireskog and Siirbom (1984, p.I,8), two
equivalent structural equation models combined with the same mea-
surement model produce the same covariance matrices and therefore
the two combined models are equivalent too. The numerical values for
the path coefficients which result when the ordering of the latent
variables is changed to AA,C, B are given in Figure 13.
Looking at Lohmoller's solution, an interesting empirical result
might be that direct dominance pressure does harm to the intellectual
abilities of the child, while indirect pressure, which raises the child's
intellectual activities, has positive effects. But in the model shown in
Figure 13, which as an equivalent model must give exactly the same
fit, the path from parental dominance to mental ability is almost zero.
Therefore, without further information, one cannot draw the psgcho-
logical conclwion that parental dominance has a direct harmful effect
on IQ, because the mqdel of Figure 13, which is a plausible rival
hypothesis and is in conflict with this result, cannot be excluded.
As has been stated above, Figure 12 presents only part of Loh-
miillex's model. His complete model contains 6 latent variables and 18
observables. In this model there are many zero-constraintsand invert-
ing the path fkom "Parental Dominance" to "intellectual Activity"
would not result in an equivalent model, But also in his model this
path may be replaced by carrelating residuals, and this modifloation
again leads to a solytion with a path coefficientof about zero for the
path from "'Parental Dominanioe" to "Mental Ability."
So, which kind of information woqld help US to decide about the
direction of the path between B and C? We neeld a more complex model
with more prior knowledge. In Figure 14 two madels (A and B ) are
presented to &ow how a further variable X qight be added, so thpt in
the resulting model changiqg; t h position
~ of $ and It:would no longer
lead to eqvivalence. Mowever, according to Rule 111replacing the path
from B ta C would still had, m equivalent moddLTherefwe in model C
a further variable Y is intmduc~dand t h , ~path f?am Y to C is fixed to
328 MULTIVARIATE BEHAVIORAL RESEARCH
lngeborg Stelzl
Father Mother
Parental
A =
Dominance
Test 1
Downloaded by [McGill University Library] at 01:14 20 December 2012
B=
Activities
C I ( m
~ a
Figure 13. Path diagram, equivalent b the diagram given in Figure 12.
n
X A B C
n
A B X C
m
Y X A B C
Figure 14. Three mcdels, where inverting BC doss not lead to equivalence
zero. Since there is now a constraint in the equation for C Rule 111does
not apply any longer. Since now the two models generated by inverting
the path BC or by replacing it by correlated errors are no longer
equivalent, one may at least hope that the data will aIIow acceptrmce
of one model and rejection of the others.
JULY 1986 329
lngeborg Stelzl
So far we have been concerned with only the inner model and have
not questioned the measurement model. Further equivalent models
might be found when the latent variables are redefined, for example by
rotating factors, by defining second order factors and the like. Yet,
with a highly restrictive factor pattern which contains many zero-
loadings, rotation will not lead to more simplicity but only destroy a lot
of zero-loadings. So we will not go into further detail.
Discussion
Downloaded by [McGill University Library] at 01:14 20 December 2012
The examples given above were chosen to show how the rules for
generating equivalent models are applied. As already stated, equiva-
lence means that there is no way to decide between different models,
whatever the covariance matrix looks like. But psychologists who are
engaged in applied research are not interested in all theoretically
possible covariance matrices, but in the ones they actually have. Hence
they are not interested only in equivalent models but in all models that
might fit their data-irrespective of the question of whether these
models might be discernible from other covariance matrices and
therefore not be equivalent. Their problems go beyond what has been
discussed here. Furthermore, researchers do not know the population
covariance matrix exactly but only have estimates for it. Therefore
those models which could be exduded if the population matrix were
known. but cannot be excluded with the data at hand remain in the pool
of possible alternatives.
Thus we have three kinds of undecidability:
1. Undecidability in principle by equivalence of the models,
2. Undecidability given a special covariance matrix for the popu-
lation, and
3. Undecidability with the data at hand.
This paper only dealt with the first of these issues, indecidability
in principle. Yet, our discussion appears to be relevant for every
researcher using covariance structure analysis: Before gathering data
for a hypothetical model, it will be usefhl to generate equivalent
models according to the rules given above. By doing so, one will see
beforehand what will definitely not be decided by the data. Perhaps
one will find that there is undecidedness only in some minor details.
Perhaps one will be able to modify one's model to get a more stable
structure. In some cases, one may even conclude that one should not
gather the data a t all. In any case, it will not be a waste of time to
consider these problems before carrying out an empirical investiga-
tion.
330 MULTIVARIATE BEHAVIORAL RESEARCH
lngeborg Stelzl
References
Lord, F.M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading,
MA: Addison-Wealey.
Land, K. C. (1969). Principles of path analysis. In E. F. Borgatta (Ed.), Sociologrical
methodology. 4 . 3 - 3 7 . San Francisco: Jossey-Bass.
Land, K. C. (1973). Identification, parameter estimation, and hypothesis testing in
recursive sociological models. In A. S. Goldberger & 0.D. Duncan (Eds.), Structural
equntion models in the social sciences. 4.1 9 4 9 . New York: Seminar Press.
Learner, E. E. (1978). SpeciF:atwn searches--Ad hot inference with nowperimental
dnta, New York: Wiley.
LohmGller, J. B. (198.4). LVPLS, Progmm Manual. Latent variables path analysis with
partial least-squares estimation. Koeln: Zentralarchiv fiir empirische Sozialifors-
chung.
Malinvaud, E. (19801. Statistical methods of econometrics (3rd ed.). Amsterdam: Ncrrth-
Holland Publishing Company.
Majoribanks, K. (1972). Environment, social class, and mental abilities. Jourmzl of
Educational Psychology, 63,103-109.
McDonald, R. P. (19'78).A simple comprehensive model for the analysis of covariimce
structures. British Journal of Mathematical and Statistical Psychology, 31, 59--72.
McDonald, R. P. (1980). A simple comprehensive model for the analysis of covari,ance
structures: Some remarks on applications. British Jourml of Mathematical and
StcrtiStical Psychology, 33,161-188.
Sewell, W. H., Haller, A. O., & Ohlendorf, G. W. (1970). The educational and eaarly
occupational attainment process: Revisions and Replications. American Sociolo~~ical
Review, 35,1014-1027.
Van de Geer, J. P. (1976). Introduction to multwariate analysis for the social sciences.
San Francisco: Freeman.
Wermuth, N. (1976). Model search among multiplicative models. Biometries, 32, 253-
263.
~ e G u t hN.
, (1980). Linear recursive equations, covariance selection, and path analy-
sis. Journal of the American Statistical Association, 75, 963-972.
Wiley, D. E. (1973). The identification problem for structural equation models with
unmeasured variables. h A. S. Goldberger & 0.D. Duncan (Eds.), Structural
equation h l s in the social sciences (pp. 69-89). New York: Seminar Press.