How Few Countries Will

Hctuxu

Uploaded by

sudangsuroy9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

How Few Countries Will

Hctuxu

Uploaded by

sudangsuroy9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Survey Research Methods (2012)

Vol.6, No.2, pp. 87-93

ISSN 1864-3361 © European Survey Research Association
http://www.surveymethods.org

How few countries will do? Comparative survey analysis from a

Bayesian perspective
Joop Hox
Utrecht University

Rens van de Schoot Suzette Matthijsse

Utrecht University Utrecht University and Dept of Public Health,
University Medical Center, Rotterdam

Meuleman and Billiet (2009) have carried out a simulation study aimed at the question how
many countries are needed for accurate multilevel SEM estimation in comparative studies.
The authors concluded that a sample of 50 to 100 countries is needed for accurate estimation.
Recently, Bayesian estimation methods have been introduced in structural equation modeling
which should work well with much lower sample sizes. The current study reanalyzes the
simulation of Meuleman and Billiet using Bayesian estimation to ﬁnd the lowest number of
countries needed when conducting multilevel SEM. The main result of our simulations is that
a sample of about 20 countries is suﬃcient for accurate Bayesian estimation, which makes
multilevel SEM practicable for the number of countries commonly available in large scale
comparative surveys.
Keywords: Multilevel SEM, sample size, cross-national research, Bayesian estimation

1 Introduction assumed, multigroup SEM can be used to investigate the de-

gree of equivalence of structural (substantive) models across
International cross-cultural and other comparative sur- countries.
veys involve a number of analysis issues. Measurement in-
struments must often be translated into different languages, When the number of countries is large, multi-group SEM
which raises the issue of measurement equivalence. Can we becomes unwieldy. The setups become complicated, espe-
assume that these instruments measure the same constructs cially if subtle differences in measurement properties must be
in the same way? We need to assess whether we have mea- included. The statistical model for the structural differences
surement equivalence, and if not we need to investigate how also becomes complicated. Multi-group SEM is a fixed ef-
we may correct measures in order to achieve measurement fects model, which means that it takes each group or country
equivalence. Next, the analysis focuses on examining rela- as given and the set of countries as the complete universe
tionships within and between countries (or other contexts). to generalize to. Unless many equality constraints are im-
That is, relationships can be established at the individual posed, SEM estimates a unique set of parameter values for
level within each country, but in comparative research the each different country, which results in a large model. Mul-
central issue is often the question whether such relationships tilevel modeling (MLM) offers a different approach. Multi-
are the same or different across countries. Finally, if we es- level modeling treats the countries as a sample from a larger
tablish differences between countries, the question is whether population. Instead of estimating a different parameter value
country characteristics can explain such differences. for each country, it assumes a (normal) distribution of pa-
The classic approach to deal with these questions is rameter values and estimates its mean and variance. This
structural equation modeling (SEM) using a multi-group makes MLM much more parsimonious than SEM when the
analysis. This analysis method makes it possible to test number of countries increases. In addition, differences be-
equivalence of measurement models; special procedures for tween countries can be modeled formally using country-level
categorical data enable SEM to be used to estimate and variables. For a general introduction to multilevel modeling,
test Item Response (IRT) models. Criteria for measurement we refer to Goldstein (2011), Raudenbush and Bryk (2002)
equivalence were already formulated by Jöreskog (1971), for and Hox (2010). Multilevel modeling for comparative sur-
a review we refer to Vandenberg and Lance (2000), while for veys has been discussed by Hox, de Leeuw and Brinkhuis
a discussion in the context of comparative surveys we refer to (2010) and Van de Vijver, van Hemert and Poortinga (2008).
Harkness et al. (2010). If measurement equivalence may be We mention in passing that multilevel modeling of compara-
tive survey data not only poses statistical questions, but also
methodological questions about the design. The statistical
model assumes random sampling at all levels, while the sur-
Contact information: Joop Hox, Utrecht University, the Nether- vey design in fact does not use sampling at the country level.
lands, e-mail: j.hox@uu.nl We can still use multilevel modeling, but its use is based on

87
88 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE

the advantages of a model based approach where we can ex- sion, Meuleman and Billiet confirm the suggestion that about
plicitly include country level explanatory variables and coun- 50 countries is the minimum sample size at the second level
try level residual variation in the model, rather than a sample for accurate estimation in multilevel SEM.
design based argumentation. We refer to Groves (1989) for a The sample size requirements suggested by the simula-
discussion of these two perspectives. tion studies reviewed above imply that for most comparative
When multigroup SEM is used, the number of countries surveys the country level sample sizes are problematic. For
is not a principled issue. Multigroup SEM can be used to instance, the European Social Survey round four (2008) in-
compare any number of groups. If the number of groups is cludes 30 countries (http://www.europeansocialsurvey.org),
huge, there may be practical analysis issues, such as the ca- the third wave of SHARE (2008-2009) includes 13 coun-
pacity of the software or the computer (or even the interpre- tries (http://www.share-project.org), the 2007 wave of
tational capacity of the analyst), but there is no formal lower the mathematics survey TIMMS includes 36-48 countries
or upper limit on the number of groups. In multilevel anal- (http://nces.ed.gov/timss), and the 2009 large scale educa-
ysis, the second level sample size (in comparative surveys tional assessment PISA sponsored by the OECD includes 65
generally the number of countries) is an issue. The second countries (http://www.opisa.oecd.org). These country level
level sample size must be large enough to permit accurate sample sizes suggest that only the larger collaborative com-
parameter estimates and associated standard errors. parative surveys involve enough countries to consider em-
Simulations have shown that multilevel regression mod- ploying multilevel SEM, but the majority appears too small
eling can be used with second-level samples as low as 20, to employ multilevel structural equation modeling.
provided that the interpretation focuses on the regression co- Recently, Bayesian estimation methods have been intro-
efficients (Maas and Hox 2005). However, accurate estima- duced in structural equation modeling (Lee 2007). Bayesian
tion and testing of variances requires much larger sample estimation works well with lower sample sizes, and will not
sizes, Maas and Hox (2005) suggest 50 groups as a lower produce inadmissible parameter estimates such as negative
limit when variances are important. Structural equation mod- variances. Bayesian methods generally imply prior informa-
eling with latent variables relies on (co)variances, which sug- tion in the analysis, but when uninformative priors are used
gests that for multilevel SEM even larger samples are needed this has only a small effect on the resulting parameter esti-
for accurate estimation. Indeed, a simulation involving a mates.
two-level confirmative factor model shows that with fewer The goal of the current paper is to examine how well
than 50 groups, the group level model parameters and their Bayesian estimation deals with the problem of estimating pa-
corresponding standard errors are not estimated with accept- rameters in a multilevel SEM model with a small sample size
able accuracy (Hox, Maas and Brinkhuis 2010). These simu- at the country level. The paper starts with an introduction
lations suggest that for accurate estimation at least 50 groups of Bayesian estimation methods and the issues involved in
should be available. a Bayesian multilevel SEM analysis. Next, it describes the
Meuleman and Billiet (2009) have carried out a simula- simulation design which is patterned after Meuleman and
tion study directly aimed at the question how many countries Billiet (2009). Our simulation design explicitly studies the
are needed for accurate multilevel SEM estimation in com- accuracy of the estimation method with very small numbers
parative surveys. They specified within country sample sizes of countries. The results and their implications for the anal-
to follow the sample sizes typically achieved in the European ysis of comparative surveys are discussed in detail.
Social Survey. The number of countries was varied from 20 We provide a basic introduction of Bayesian statistics,
to 100. The simulation model at both the individual and the but interested researchers could further refer to Lynch (2007)
country level is a confirmative one-factor model for four in- for an introduction to Bayesian estimation, and for technical
dicator variables, plus a structural effect predicting the factor details to Gelman, Carlin, Stern, and Rubin (2004). Bayesian
from an exogenous observed variable. Meuleman and Billiet structural equation modeling is discussed by Lee (2007) and
(2009) conclude that a sample of 20 countries is simply not Bayesian multilevel modeling by Hox (2010). In this pa-
enough for accurate estimation. They do not suggest a spe- per we use the software Mplus (Muthén and Muthén 1998-
cific lower limit for the country level sample size; instead, 2010) because it is often used by applied researchers. For the
they discuss how model complexity and goal of the analy- technical implementation of Bayesian statistics in Mplus, see
sis affect the country level sample size requirements. How- Asparouhov and Muthén (2010).
ever, their simulation results indicate that if we require that
the 95% confidence interval for country level factor loadings 2. Estimation methods in
lies in fact between 90 and 99 percent, which corresponds to multilevel SEM
a bias of about 5%, we require at least 60 countries. For 60
countries, the empirical alpha level for a test that the struc- In this section we describe briefly different estimation
tural effect equals zero is 0.083, which is acceptable. With methods for multilevel SEM, including Bayesian estimation.
40 countries, the empirical alpha level is 0.103, which is not For a more elaborate accessible introduction we refer to Hox
acceptable (cf. Boomsma and Hoogland 2001). The power (2010), and for a statistical treatment we refer to Kaplan
for a medium size structural effect at the country level is (2009).
0.523 with 60 countries, well below the value of 0.80 that Multilevel SEM assumes sampling at both individual
Cohen (1988) recommends as a worth pursuing. In conclu- and country levels. The individual data are collected in a
HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE 89
p-variate vector Yi j (subscript i for individuals, j for groups). More formally, let M be a statistical model with a vector
The data Yi j are decomposed into a between groups (Group of unknown parameters θ, for example regression parame-
level) component YB = Y j , and a within groups (individ- ters and correlations, and let Y be the observed data set with
ual level) component YW = Yi j − Y j . These two compo- sample size n. In Bayesian estimation, θ is considered to be
nents are orthogonal and additive, thus YT = YB + YW . The random and the behavior of θ under Y in such a Bayesian
population covariance model can be described by
matrices
are also orthogonal and ad-
ditive, thus T = B + W . Multilevel structural equa-
tion modeling assumes that the population covariance ma- p(θ|Y, M) ∝ p(θ|M) × p(Y|θ, M) (2)

trices B and W are described by distinct models for the
between groups and within groups structure. Several ap- where p(Y|θ, M) is the likelihood function, the information
proaches have been proposed to estimate the parameters of about the parameters in the data, p(θ|M) is the prior distribu-
the multilevel SEM. Muthén (1989) suggests to approximate tion, the information about the parameters before observing
the full maximum likelihood solution by assuming equal the data, and p = (θ|Y, M) is the posterior distribution, the in-
group sizes, which leads to a limited information estimation formation about the parameters after observing the data and
method called MUML (for Muthén’s Maximum Likelihood). taking the prior information into account.
For the prior distribution, we have a fundamental choice be-
A more accurate way to estimate a model for B and W
is a Weighted Least Squares (WLS) method implemented in tween using an informative prior or an uninformative prior.
Mplus. Full maximum likelihood estimation for multilevel An informative prior is a peaked distribution with a small
structural equation modeling requires to model the raw data. variance, which expresses a strong belief about the unknown
This minimizes the fit function given by population parameter, and has a substantial effect on the pos-
terior distribution. In contrast, an uninformative or diffuse
prior serves to produce the posterior, but has very little influ-

N
N −1 ence. An example of an uninformative prior is the uniform
F= log| i| + log(xi − μi ) i (xi − μi ), (1) distribution, which simply states that all possible values for
i=1
i=1 the unknown parameter are equally likely. Another exam-
where the subscript i refers to the observed ple of an uninformative prior is a very flat normal distribu-
cases, xi to those
variables observed for case i, and μi and i contain the pop- tion specified with an enormous variance. Sometimes such
ulation means and covariances of the variables observed for a prior is called an ignorance prior, to indicate that we know
case i. Mehta and Neale (2005) show that models for mul- nothing about the unknown parameter. However, this is not
tilevel data, with individuals nested within groups, can be accurate, since total ignorance does not exist. All priors add
expressed as a structural equation model. The fit function some information to the data, but diffuse priors add very lit-
(1) applies, with clusters as units of observation, and indi- tle information, and therefore do not have much influence
viduals within clusters as variables. Unbalanced data, here on the posterior. For our analyses we used the default prior
unequal numbers of individuals within clusters, are included specifications of Mplus which uses uninformative priors.
the same way as incomplete data in standard
SEM. The two- If the posterior distribution has a mathematically simple
stage approaches that model B and W separately (MUML form, the known characteristics of the distribution can be
and WLS) include only random intercepts in the between used to produce point estimates and confidence intervals.
groups model, the full ML representation can incorporate However, in complex models the posterior is generally a
random slopes as well (Mehta and Neale 2005). Maximum complicated multivariate distribution, which is often math-
likelihood estimation assumes large samples, and relies on ematically intractable. Therefore, simulation techniques are
numerical methods to integrate out random effects. In com- used to generate random draws from the multivariate poste-
parison, Bayesian methods are reliable in small samples, and rior distribution. These simulation procedures are known as
are better able to deal with complex models. The Bayesian Markov Chain Monte Carlo (MCMC) simulation. MCMC
approach is fundamentally different from classical statistics simulation is used to produce a large number of random
(Barnett 2008). In classical statistics, the population param- draws from the posterior distribution, which is then used to
eter has one specific value, only we happen to not know it. compute a point estimate and a confidence interval (for an
In Bayesian statistics, we express the uncertainty about the introduction to Bayesian estimation including MCMC meth-
population value of a model parameter by assigning to it a ods see Lynch 2007). Typically, the marginal (univariate)
probability distribution of possible values. This probability distribution of each parameter is used.
distribution is called the prior distribution, because it is spec- Given a set of initial values from a specific multivariate dis-
ified independently from the data. After we have collected tribution, MCMC procedures generate a new random draw
our data, this distribution is combined with the Likelihood of from the same distribution. Suppose that Z (1) is a draw from
the data to produce a posterior distribution, which describes a target distribution f (Z). Using MCMC methods, we gener-
our uncertainty about the population values after observing ate a series of new draws: Z (1) → Z (2) → . . . → Z (t) . MCMC
our data. Typically, the variance of the posterior distribution methods are attractive because, even if Z (1) is not from the
is smaller than the variance of the prior distribution, which target distribution f (Z), if t is sufficiently large, in the end
means that observing the data has reduced our uncertainty Z (t) is a draw from the target distribution f (Z). Having good
about the possible population values. initial values for Z (1) helps, because it speeds up the conver-
90 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE

(a) within (individual) level (b) between (country) level

Figure 1. Path diagram for within (individual) and between (country) level

gence on the target distribution, so the classical maximum is more difficult to determine than the mean, the mean of the
likelihood estimates are often used as initial values for Z (1) . posterior distribution is also often used. In skewed posterior
The number of iterations t needed before the target distri- distributions, the median is an attractive choice. In Bayesian
bution is reached is referred to as the ‘burn in’ period of the estimation, the standard deviation of the posterior distribu-
MCMC algorithm. It is important that the burn in is com- tion is comparable to the standard error in classical statis-
plete. To check if enough iterations of the algorithm have tics. However, the confidence interval generally is based on
passed to converge on the target distribution, several diagnos- the 1/2 α and 100 − 1/2 α percentiles around the point esti-
tics are used. A useful diagnostic is a graph of the successive mate. In the Bayesian terminology, this is referred to as the
values produced by the algorithm. A different procedure is 100 − α% credibility interval. Mplus by default uses the me-
to start the MCMC procedure several times with widely dif- dian of the posterior distribution for the point estimate, and
ferent initial values. If essentially identical distributions are the percentile-based 95% credibility interval, which we have
obtained after t iterations, we decide that t has been large followed in our simulations. Bayesian methods have some
enough to converge on the target distribution (Gelman and advantages over classical methods. To begin, in contrast to
Rubin 1992). the asymptotic maximum likelihood method, they are valid
An additional issue in MCMC methods is that successive in small samples. Given the correct probability distribution,
draws are dependent. Depending on the distribution and the the estimates are always proper, which solves the problem of
amount of information in the data, they can be strongly cor- negative variance estimates. Finally, since the random draws
related. Logically, we would prefer independent draws to are taken from the correct distribution, there is no assumption
use as simulated draws from the posterior distribution. One of normality when variances are estimated. In this study, we
way to reach independence is to omit a number of succes- examine if Bayesian estimation will help in drawing correct
sive estimates before a new draw is used for estimation. This inferences in multilevel SEM if the number of groups (coun-
process is called thinning. To decide how many iterations tries) is relatively small. The simulation studies cited in the
must be deleted between two successive draws, it is useful introduction typically find that at smaller country level sam-
to inspect the autocorrelations between successive draws. If ple sizes the parameter estimates themselves are unbiased,
the autocorrelations are high, we must delete many estimates. but that the standard errors are underestimated, which leads
Alternatively, since each draw still gives some information, to poor control of the alpha level and undercoverage for the
we may keep all draws, but use an extremely large number confidence intervals. We expect that the credibility intervals
of draws. in our Bayesian estimation will perform better at the country
level sample sizes usually encountered in comparative survey
The mode of the marginal posterior distribution is an at- research.
tractive point estimate of the unknown parameter, because it
is the most likely value, and therefore the Bayesian equiva-
lent of the maximum likelihood estimator. Since the mode
HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE 91
3. Simulation design Table 2: Statistical power for detecting the country level structural
effect, for various effect sizes and country level sample sizes
The simulation design in this study closely follows Meule-
Number of countries
man and Billiet (2009). The model at both the individual and
the country level is a one-factor model with four indicators. Bayesian estimation ML estimation1
There is one structural effect from an observed exogenous 10 15 20 20 40 60
variable on the factor. Figure 1 shows the path diagram with Effect size
the population parameter values. None (0.00) 0.03 0.05 0.05 0.16 0.10 0.08
The simulated data were generated from a population that Small (0.10) 0.04 0.06 0.06 0.18 0.15 0.16
has the same characteristics as used in Meuleman and Billiet Medium (0.25) 0.08 0.13 0.15 0.31 0.41 0.53
(2009:48): Large (0.50) 0.26 0.43 0.58 0.75 0.94 0.99
• The observed variables have a multivariate distribu- Very large (0.75) 0.67 0.89 0.97 1.00 1.00 1.00
1
tion. Parameters estimated by Meuleman and Billiet 2009.

• The intraclass correlation of the observed indicators is

0.08. Table 1 shows that, compared to ML estimation, Bayesian
estimation tends to result in a much larger bias for the coun-
• The within level unstandardized factor loadings are try level residual variance estimates, but to less bias for the
0.90, 0.90, 0.75 and 0.70. country level factor loadings and the structural effect. The
95% credibility intervals show a much better coverage in
• The between level factor unstandardized loadings are Bayesian estimation than their maximum likelihood based
0.27, 0.27, 0.28 and 0.28. counterparts. For example, with 20 countries the between
level factor loadings have a mean absolute bias of 0.03 in
• The within level independent variable has an unstan- Bayesian estimation, and -0.07 in Maximum Likelihood es-
dardized effect of 0.25. timation. The actual coverage of the nominal 95% interval is
0.94 in Bayesian estimation, and 0.84 with Maximum Like-
• The between level independent variable has an effect lihood estimation, which is woefully inadequate.
that is manipulated. One condition has an effect size Table 2 shows the proportion of p-values below 0.05, for
of 0.00. The other effect sizes were manipulated to various effect sizes. For an effect size of zero, the table shows
be 0.10 (small), 0.25 (medium), 0.50 (large) and 0.75 the operating alpha level, which indicates the prevalence of
(very large), following Cohen’s (1988) suggestions for the type I error. It is clear that ML estimation does not con-
effect sizes. trol the alpha level well, with an operating alpha level of 16%
with twenty countries. Thus, if the nominal alpha level is set
• The within level sample size is 1755. at the common value of 0.05, the prevalence of type I errors
Meuleman and Billiet generate data for five different num- is actually 0.16. The alpha level is much better controlled
bers of countries: 20, 40, 60, 80 and 100. We have generated in Bayesian estimation, where even at 10 countries the op-
data for 10, 15 and 20 countries, with 1000 replications for erating alpha level is 0.03, which is reasonably close to the
each condition in our simulation design. nominal alpha level of 0.05.
We have used Mplus 6.1 for our simulation. Mplus has Table 2 also shows that with a small number of countries the
a set of commands that can be used to tweak the Bayesian power in both Bayesian and Maximum Likelihood to detect
estimation process. Assuming that most users will use the anything but the largest effects is low. When the effect size
default settings, we have not attempted to modify the de- is not zero, ML estimation does reject the null hypothesis
fault settings. The major issue here is to let Mplus automat- more often than Bayesian estimation. For example, with 20
ically decide how long the burn-in must be. Mplus uses the countries the power to detect a large effect is 0.58 in Bayesian
Gelman-Rubin potential scale reduction (PSR; Gelman and estimation and 0.75 in Maximum Likelihood estimation. As
Rubin 1992) to decide when the chain has converged. By we showed above, this increased power is at the expense of a
default, two independent MCMC chains are produced, and very poorly controlled alpha level.
the between and within chain variation is compared. When
the between chain variance is smaller than 0.05, convergence 5. Discussion
is assumed. Lee (2007) discusses this and other Bayesian The results of the simulation show that Bayesian estima-
model checks, we will come back to this issue in the discus- tion indeed can get away with far fewer countries than Max-
sion.1 imum Likelihood estimation. Both the parameter estimates
and the coverage of the 95% interval are surprisingly good.
4. Results However, the between level residual error variances are esti-
mated very poorly. We come back to this issue later in the
The simulation results are summarized in Table 1, which
also reports a selection of the results obtained by Meuleman 1
One simulation run encountered convergence problems, which
and Billiet (2009). were solved by setting this convergence criterion to 0.01.
92 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE

Table 1: Mean absolute bias for various country level sample sizes

Number of countries
Bayesian estimation ML estimation1
10 15 20 20 40 60
Parameter bias
Within factor loadings 0.00 0.00 0.00 0.00 0.00 0.00
Within error variances 0.00 0.00 0.00 0.00 0.00 0.00
Within structural eﬀect 0.00 0.00 0.00 0.00 0.00 0.00
Between factor loadings 0.50 0.04 0.03 -.07 -.03 -.02
Between error variances 0.59 0.33 0.24 -.10 -.05 -.04
Between structural eﬀect -.05 -.05 -.04 0.11 0.05 0.04

Coverage
Within factor loadings 0.95 0.96 0.95 0.93 0.94 0.94
Within error variances 0.93 0.94 0.93 0.93 0.94 0.94
Within structural eﬀect 0.95 0.95 0.96 0.93 0.94 0.94
Between factor loadings 0.96 0.96 0.94 0.84 0.89 0.91
Between error variances 0.95 0.95 0.94 0.81 0.88 0.90
Between structural eﬀect 0.96 0.94 0.95 0.85 0.90 0.92
1
Parameters estimated by Meuleman and Billiet 2009.

discussion, when we discuss convergence problems in the as we have countries, we recommend inspection of autocor-
Bayesian context. With respect to statistical power, it is clear relations and setting much stricter criteria for convergence.
that Bayesian estimation does not solve the problem of small In fact, if we deviate from the software defaults and set the
sample, only very large country level effects can be discov- convergence criteria much stricter, the bias in the residual
ered when the number of countries is small. variances at the country level becomes much smaller, at the
The results also show that Bayesian estimation is not magic. cost of a much increased computation time.
With ten countries, problems start to show in the summary ta- Softwarewise, we have simply specified a different estima-
bles, but they are clearer when the simulation output is stud- tion method. From a principled standpoint, we have chosen
ied in more detail. For the condition with ten countries, each a different kind of statistics. As a result, the 95% credibil-
simulation run contains some outliers for the estimates of the ity interval now may correctly be interpreted as the interval
error variances and corresponding standard errors, with esti- that contains the population parameter with 95% probability.
mates up to twenty times the population values. Such outliers In our power table, we presented p-values. In the Bayesian
would be recognized as such in a real analysis. The between case, this is not the normal p-value, but the so-called poste-
model contains a total of 10 parameters, so it is not surprising rior predictive p-value. This is roughly interpreted as a stan-
that problems arise when the number of countries approaches dard p-value, but it is actually a different entity. Bayesian
the number of parameters in the between model. Simplifying modeling in general prefers that decisions about parameters
the model, for instance by using the mean of the observed in- are based on credibility intervals, and that decisions about
dicators instead of a latent variable would make estimation models are based on comparative evidence, such as informa-
easier. tion criteria or Bayes factors. A discussion of these issues is
We briefly mentioned convergence problems and outlying beyond the scope of this paper, but we believe that applied
estimates. In MCMC estimation, convergence means con- researchers should be aware that doing a Bayesian analysis
vergence of the chain to the correct distribution. In our simu- is not just choosing a different estimation method.
lation, we have decided to emulate a relatively nave user and
therefore to follow all defaults implemented in the software In our analysis, we have chosen the default uninformative
(Mplus 6.1). We also used an automatic cut-off criterion to priors provided by Mplus. Other choices are possible. One
decide whether convergence had been reached. In one sim- interesting option is using an informative prior. For example,
ulation run, we needed to change the default criterion to a the default prior for a factor loading in Mplus is a normal
more strict value. Textbooks introducing Bayesian statistics distribution with a mean of zero and a very large variance
caution users to always use diagnostic tools such as plots of (1010 ). We have more prior knowledge than that. If we model
the iteration history (trace plots, c.f. Gelman, Carlin, Stern seven-point answer scales with an underlying factor, using
and Rubin 2004; Lynch 2007), and we completely agree with standard identifying constraints, we know that the (absolute)
such recommendations. Obviously, in a simulation, visually factor loadings will not exceed, say, the value ten. Why not
inspecting trace plots for 15,000 replications times 20 param- use a prior distribution that reflects this knowledge? In doing
eters is not possible. In applied Bayesian analysis, we con- so, we would become real subjectivist statisticians, a posi-
sider such inspection mandatory. In addition, especially in tion that is far away from mainstream statistics. If we im-
modeling situations as extreme as having as many parameters pose priors that describe only realistic parameter values, the
convergence problem discussed above will disappear. But in
HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE 93
small samples, such prior information could easily dominate Harkness, J. A., Braun, M., Edwards, B., Johnson, T. P., Lyberg,
the information in the data. In this paper, we have taken the L. E., Mohler, P. P., et al. (2010). Survey methods in multina-
position that this is undesirable, and prefer to work with un- tional, multiregional, and multicultural contexts. Chicester, UK:
informative priors. Wiley.
Hox, J. J. (2010). Multilevel analysis. Techniques and applications.
Acknowledgements NY: Routledge.
Jöreskog, K. G. (1971). Simultaneous factor analysis in several
Rens van de Schoot received a grant from the Netherlands populations. Psychometrika, 36, 409-426.
Organisation for Scientific Research: NWO-VINI-451-11- Kaplan, D. (2009). Structural equation modeling (2nd ed.). Thou-
008. sand Oaks, CA: Sage.
Lee, S. (2007). Structural equation modeling: a Bayesian ap-
References proach. Chicester, UK: Wiley.
Lynch, S. M. (2007). Introduction to applied Bayesian statistics
Asparouhov, T., & Muthén, B. (2010). Bayesian anal-
and estimation for social scientists. Berlin: Springer.
ysis of latent variable models using Mplus. Version 4.
Unpublished manuscript, accessed October 13, 2011 on Maas, C. J. M., & Hox, J. J. (2005). Sufficient sample sizes for mul-
http://www.statmodel.com/download/BayesAdvantages18.pdf. tilevel modeling. Methodology: European Journal of Research
Barnett, V. (2008). Comparative statistical inference. Chicester, Methods for the Behavioral and Social Sciences, 1, 85-91.
UK: Wiley. Mehta, P. D., & Neale, M. C. (2005). People are variables too:
Boomsma, A., & Hoogland, J. J. (2001). The robustness of LIS- multilevel structural equations modeling. Psychological Meth-
REL modeling revisited. In R. Cudeck, S. du Toit, & D. Sörbom ods, 10, 259-284.
(Eds.), Structural equation modeling: Present and future. A Meuleman, B., & Billiet, J. (2009). A Monte Carlo sample size
Festschrift in honor of Karl Jöreskog (p. 139-168). Chicago: study : How many countries are needed for accurate multilevel
Scientific Software International. SEM? Survey Research Methods, 3, 45-58.
Cohen, J. (1988). Statistical power analysis for the behavioral Muthén, B. (1989). Latent variable modeling in heterogeneous
sciences. Mahwah, NJ: Lawrence Erlbaum Associates. populations. Psychometrika, 54, 557-585.
Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. (2004). Muthén, L. K., & Muthén, B. O. (1998-2010). Mplus user’s guide
Bayesian data analysis (2nd ed.). Boca Raton, FL: Chapman & (6th ed.). Los Angeles, CA: Muthén & Muthén.
Hall/CRC. Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear
Gelman, A., & Rubin, D. B. (1992). Inference from iterative simu- models (2nd ed.). Thousand Oaks, CA: Sage.
lation using multiple sequences. Statistical Science, 7, 457-511. Van de Vijver, F. R., van Hemert, D. A., & Poortinga, Y. H. (Eds.).
Goldstein, H. (2011). Multilevel statistical models. Chicester, UK: (2008). Multilevel analysis of individuals and cultures. NY:
Wiley. Taylor & Francis.
Groves, R. M. (1989). Survey errors and survey costs. New York:
Wiley.

Multilevel Analysis Techniques and Applications Th... - (1. Introduction To Multilevel Analysis)
No ratings yet
Multilevel Analysis Techniques and Applications Th... - (1. Introduction To Multilevel Analysis)
7 pages
In The Social Sciences (Pp. 95-124) - New York: Routledge
No ratings yet
In The Social Sciences (Pp. 95-124) - New York: Routledge
45 pages
2007 03 Multilevel Modelling
No ratings yet
2007 03 Multilevel Modelling
55 pages
8) Multilevel Analysis
No ratings yet
8) Multilevel Analysis
41 pages
Structural Equation Modeling
No ratings yet
Structural Equation Modeling
8 pages
Structural Equation Modeling
100% (1)
Structural Equation Modeling
7 pages
Glossary Multilevel Analysis
No ratings yet
Glossary Multilevel Analysis
8 pages
Structural Equation Modeling (SEM) in Social Sciences & Medical Research: A Guide For Improved Analysis
No ratings yet
Structural Equation Modeling (SEM) in Social Sciences & Medical Research: A Guide For Improved Analysis
12 pages
A Model For Integrating Fixed Random and
No ratings yet
A Model For Integrating Fixed Random and
21 pages
A Tutorial For Analyzing Structural Equation Modelling
No ratings yet
A Tutorial For Analyzing Structural Equation Modelling
7 pages
Chapter 9: Analysis of Mulicountry Data
No ratings yet
Chapter 9: Analysis of Mulicountry Data
2 pages
Issues in The Structural Equation Modeling of Complex Survey Data
No ratings yet
Issues in The Structural Equation Modeling of Complex Survey Data
6 pages
Broc, Guillaume - Gana, Kamel - Structural Equation Modeling With Lavaan-Wiley-ISTE (2019) - 61-63
No ratings yet
Broc, Guillaume - Gana, Kamel - Structural Equation Modeling With Lavaan-Wiley-ISTE (2019) - 61-63
3 pages
SEM for MBA Weekend Students
100% (1)
SEM for MBA Weekend Students
9 pages
Random Fixed Effects Sem
No ratings yet
Random Fixed Effects Sem
21 pages
What Is Structural Equation Modeling?
No ratings yet
What Is Structural Equation Modeling?
5 pages
Multigroup SEM Analysis & Moderation
No ratings yet
Multigroup SEM Analysis & Moderation
5 pages
Stats Reporting Script
No ratings yet
Stats Reporting Script
4 pages
Random Fixed Effects Sem
No ratings yet
Random Fixed Effects Sem
20 pages
Multilevel Models Applications Using SAS® - (1 Introduction)
No ratings yet
Multilevel Models Applications Using SAS® - (1 Introduction)
12 pages
Multilevel Modeling for Educators
No ratings yet
Multilevel Modeling for Educators
28 pages
Multilevel SEM Bias Correction
No ratings yet
Multilevel SEM Bias Correction
23 pages
Deng Yuan SEM15
No ratings yet
Deng Yuan SEM15
18 pages
Comparative Research Methods
No ratings yet
Comparative Research Methods
7 pages
NCME12
No ratings yet
NCME12
53 pages
EJ787904
No ratings yet
EJ787904
26 pages
Bayesian Structural Equation Models: A Health Application: Stojanovski, E. and K. Mengersen
No ratings yet
Bayesian Structural Equation Models: A Health Application: Stojanovski, E. and K. Mengersen
7 pages
Quantitative Cross-National Research Methods
No ratings yet
Quantitative Cross-National Research Methods
14 pages
Explaining Fixed Effects Random Effects Modeling of Time Series Cross Sectional and Panel Data
No ratings yet
Explaining Fixed Effects Random Effects Modeling of Time Series Cross Sectional and Panel Data
21 pages
Comparative Politics and The Comparative Method AREND LIJPHART University of Leiden
No ratings yet
Comparative Politics and The Comparative Method AREND LIJPHART University of Leiden
20 pages
Best Practice Recommendations For Using Structural Equation Modelling in Psychological Research
No ratings yet
Best Practice Recommendations For Using Structural Equation Modelling in Psychological Research
16 pages
Part II Unit 1
No ratings yet
Part II Unit 1
17 pages
Multilevel Model Analysis Using R: Nicolae-Marius Jula
No ratings yet
Multilevel Model Analysis Using R: Nicolae-Marius Jula
12 pages
SEM Thesis Writing Challenges
100% (2)
SEM Thesis Writing Challenges
8 pages
Jorg Blasius, Michael J. Greenacre-Visualization of Categorical Data-Academic Press Inc (1998)
No ratings yet
Jorg Blasius, Michael J. Greenacre-Visualization of Categorical Data-Academic Press Inc (1998)
615 pages
SEM RandomfixedAndersen2021
No ratings yet
SEM RandomfixedAndersen2021
20 pages
Fullpaper Revision Farisca Susiani
No ratings yet
Fullpaper Revision Farisca Susiani
7 pages
Week 4 - Primary Research Design & Comparability & Equivalence CH 5 and CH 611
No ratings yet
Week 4 - Primary Research Design & Comparability & Equivalence CH 5 and CH 611
37 pages
Fifty Years of Structural Equation Modeling
No ratings yet
Fifty Years of Structural Equation Modeling
53 pages
SEM Notes
No ratings yet
SEM Notes
3 pages
Comparative Analysis Guide
No ratings yet
Comparative Analysis Guide
8 pages
Characteristics of SEM: Cov R SD SD
No ratings yet
Characteristics of SEM: Cov R SD SD
9 pages
SEM 2023 Garret
No ratings yet
SEM 2023 Garret
25 pages
Scale: Comparative and Non Comparative Scaling Composite Measures
No ratings yet
Scale: Comparative and Non Comparative Scaling Composite Measures
42 pages
SMDE - (US) Experts Sesion Multivariate Analysis
No ratings yet
SMDE - (US) Experts Sesion Multivariate Analysis
4 pages
Statistical Analysis of Survey Data
No ratings yet
Statistical Analysis of Survey Data
30 pages
Structural Equation Modeling Lecture Notes
100% (1)
Structural Equation Modeling Lecture Notes
40 pages
An Introduction To Hierarchical Linear Modeling
No ratings yet
An Introduction To Hierarchical Linear Modeling
18 pages
Lavaan Multilevel Zurich2017
100% (1)
Lavaan Multilevel Zurich2017
162 pages
SEM for Advanced Research Methods
No ratings yet
SEM for Advanced Research Methods
2 pages
SEM Stata Materials
100% (1)
SEM Stata Materials
13 pages
Chapter09 MDA 8e
No ratings yet
Chapter09 MDA 8e
23 pages
MG Sent R1
No ratings yet
MG Sent R1
19 pages
Thesis Format - Updated!
No ratings yet
Thesis Format - Updated!
17 pages
Homicide Studies 2011 Nivette 103 31
No ratings yet
Homicide Studies 2011 Nivette 103 31
30 pages
Morin Et Al Doubly Latent Multilevel Procedures For Organizational Assessment and Prediction
No ratings yet
Morin Et Al Doubly Latent Multilevel Procedures For Organizational Assessment and Prediction
26 pages
Fixed Random Effects
No ratings yet
Fixed Random Effects
3 pages
Determination of The Aluminium Content in Different Brands of Deodor
No ratings yet
Determination of The Aluminium Content in Different Brands of Deodor
14 pages
Stats Guide
No ratings yet
Stats Guide
2 pages
2007 CSR
No ratings yet
2007 CSR
9 pages
International+Legal+Framework+on+Human+Trafficking+and+Criminal+Liability+on+Traffickers
No ratings yet
International+Legal+Framework+on+Human+Trafficking+and+Criminal+Liability+on+Traffickers
15 pages
Unit (6) Articles - Class 6 English Grammar Book
No ratings yet
Unit (6) Articles - Class 6 English Grammar Book
11 pages
A Comparison of Foreign Policies Between China And
No ratings yet
A Comparison of Foreign Policies Between China And
5 pages
Grammarism Articles Test 5 1614399
No ratings yet
Grammarism Articles Test 5 1614399
1 page
6-3-56-468
No ratings yet
6-3-56-468
3 pages
41551
No ratings yet
41551
10 pages
Extended Theories of Local Government
No ratings yet
Extended Theories of Local Government
9 pages
Lwp30001 Ejsshvol8 2 Sept24
No ratings yet
Lwp30001 Ejsshvol8 2 Sept24
18 pages
IJAREM-D5055
No ratings yet
IJAREM-D5055
9 pages
08-58296_tool_2-4
No ratings yet
08-58296_tool_2-4
5 pages
COMPARATIVEMETHODASTHEFLAGSHIPOFPOLITICALSCIEENCERSEARCHMETHODS
No ratings yet
COMPARATIVEMETHODASTHEFLAGSHIPOFPOLITICALSCIEENCERSEARCHMETHODS
13 pages
Theories of Local Government
No ratings yet
Theories of Local Government
2 pages
Conditional Sentence
No ratings yet
Conditional Sentence
13 pages
2nd Mid 4th Sem. Shima Mam
No ratings yet
2nd Mid 4th Sem. Shima Mam
6 pages
Cover Page
No ratings yet
Cover Page
1 page
635 (1) - 11-20
No ratings yet
635 (1) - 11-20
10 pages
Group Assignment Cover Page JU
No ratings yet
Group Assignment Cover Page JU
1 page
Understanding Conflict and Resolution Processing Globalized World
No ratings yet
Understanding Conflict and Resolution Processing Globalized World
7 pages
Concept Nature and Scope of Political Economy
No ratings yet
Concept Nature and Scope of Political Economy
20 pages
Mass Upsurge 1969
No ratings yet
Mass Upsurge 1969
12 pages
CCC PP MassUpsurge1969!5!16
No ratings yet
CCC PP MassUpsurge1969!5!16
12 pages
FRRFFT (:.... ,...... ,...... : Ir ( (Fi TL
No ratings yet
FRRFFT (:.... ,...... ,...... : Ir ( (Fi TL
1 page
5th Semester Previous Year Question
No ratings yet
5th Semester Previous Year Question
13 pages
Phrase Identification
No ratings yet
Phrase Identification
9 pages
The Election of 1970
No ratings yet
The Election of 1970
3 pages
Robust Multiple Linear Backward Eliminationregression: Dhaka University Journal of Science October 2023
No ratings yet
Robust Multiple Linear Backward Eliminationregression: Dhaka University Journal of Science October 2023
9 pages
Military and Politics
No ratings yet
Military and Politics
1 page
Six Point Movement How It Became The Cha
100% (1)
Six Point Movement How It Became The Cha
4 pages
Village Leaders and Rural Development in
No ratings yet
Village Leaders and Rural Development in
6 pages
Iran
No ratings yet
Iran
6 pages
v2 Certificate of Instrument Validation
No ratings yet
v2 Certificate of Instrument Validation
2 pages
A Detailed Lesson Plan in Mathematics in The Modern World
No ratings yet
A Detailed Lesson Plan in Mathematics in The Modern World
11 pages
Statistics Cheat Sheet for Students
No ratings yet
Statistics Cheat Sheet for Students
3 pages
ISO9001 Practice Exam
100% (12)
ISO9001 Practice Exam
12 pages
Synopsis Swati Adde
No ratings yet
Synopsis Swati Adde
14 pages
PISA 2015 Technical Report Chapter 16 Procedures and Construct Validation of Context Questionnaire Data PDF
No ratings yet
PISA 2015 Technical Report Chapter 16 Procedures and Construct Validation of Context Questionnaire Data PDF
56 pages
Events Management Dissertation Topics PDF
100% (2)
Events Management Dissertation Topics PDF
6 pages
Table of The Student's Answers On The Try-Out Test Items
No ratings yet
Table of The Student's Answers On The Try-Out Test Items
4 pages
Joshua's Return On Investment of Competency Mapping 1.123456ppt
No ratings yet
Joshua's Return On Investment of Competency Mapping 1.123456ppt
31 pages
SMART Indicator Development Guide
No ratings yet
SMART Indicator Development Guide
84 pages
HTC - Sec. - Social Studies General
No ratings yet
HTC - Sec. - Social Studies General
6 pages
GATE Engineering Mathematics Material
No ratings yet
GATE Engineering Mathematics Material
17 pages
Guide in Embedding Ethical Considerations in The Manuscript
No ratings yet
Guide in Embedding Ethical Considerations in The Manuscript
1 page
Road Traffic Accident Risk Prediction and Key Factor Identification Framework Based On Explainable Deep Learning
No ratings yet
Road Traffic Accident Risk Prediction and Key Factor Identification Framework Based On Explainable Deep Learning
15 pages
50 - Ministerial Roll - Madras Diocese
No ratings yet
50 - Ministerial Roll - Madras Diocese
6 pages
SADCAS F134 (E) - Proficiency Testing - ISO 15189 - 2022 Clause 7.3.7.3 & SADCAS Requirements (Issue 1)
No ratings yet
SADCAS F134 (E) - Proficiency Testing - ISO 15189 - 2022 Clause 7.3.7.3 & SADCAS Requirements (Issue 1)
4 pages
1 s2.0 S0169814107001709 Main
No ratings yet
1 s2.0 S0169814107001709 Main
8 pages
RM 0 CourseOutline
No ratings yet
RM 0 CourseOutline
7 pages
Ijrm 13 061
No ratings yet
Ijrm 13 061
10 pages
Research Proposal
No ratings yet
Research Proposal
9 pages
Epm6 Slides Ch05 How To Plan A TPM Project
100% (1)
Epm6 Slides Ch05 How To Plan A TPM Project
68 pages
SOP Plankton
No ratings yet
SOP Plankton
20 pages
Piping Design Guide for Non-Engineers
No ratings yet
Piping Design Guide for Non-Engineers
20 pages
Practices, Usage and Perceived Effectiveness of AI Tools Among IT Students of CSU-A
0% (1)
Practices, Usage and Perceived Effectiveness of AI Tools Among IT Students of CSU-A
48 pages
Action Research and Methodology Quiz
No ratings yet
Action Research and Methodology Quiz
25 pages
UNIT-5: Procedure of T-Test
No ratings yet
UNIT-5: Procedure of T-Test
12 pages
Comparing The Use of Open and Closed Questions For Web-Based Measures of The Continued-Influence Effect
No ratings yet
Comparing The Use of Open and Closed Questions For Web-Based Measures of The Continued-Influence Effect
15 pages
44a Statistical Diagrams Box Plots - H - Question Paper
No ratings yet
44a Statistical Diagrams Box Plots - H - Question Paper
17 pages
GBD160NGM - Addendum - 2237 - Angelina Malik
No ratings yet
GBD160NGM - Addendum - 2237 - Angelina Malik
1 page
Personalization Versus Privacy An Empirical Examin
No ratings yet
Personalization Versus Privacy An Empirical Examin
23 pages

How Few Countries Will

Uploaded by

How Few Countries Will

Uploaded by

Survey Research Methods (2012)

Vol.6, No.2, pp. 87-93

How few countries will do? Comparative survey analysis from a

Rens van de Schoot Suzette Matthijsse

1 Introduction assumed, multigroup SEM can be used to investigate the de-

       

• The intraclass correlation of the observed indicators is

You might also like