Romppel 2013
Romppel 2013
com
Abstract
Since the dimensionality and the related psychometric properties of the 12-item General Health Questionnaire (GHQ-12) are still under
debate, the present study compares different factor solutions from the literature to determine which shows the best fit and to investigate
reliability and construct validity. The analyses are based on a German population based representative sample (N = 2,041), using face-to-face-
interviews. The confirmatory factor analysis indicates the best fit to the one-factor model including response bias on the negative worded
items according to Hankins. Thus, the importance of methodical aspects for the dimensionality was emphasized. Moreover, the correlations
of the different subscales of the two- and three-factor models with several external criteria (BDI, PHQ-2, SF-36, PHQ-Anxiety, SPIN) do not
substantially differ. The preferred unidimensional model shows good psychometric properties. According to its associations with the external
criteria under study, the GHQ-12 as a unidimensional measure seems to be a useful screening tool for the assessment of mental distress or a
minor psychiatric morbidity with a main focus on depressive symptomatology.
© 2013 Elsevier Inc. All rights reserved.
items influences the interpretation of these items and the support or discourage the future application of the instrument
response of the participants. In its consequence it has an in population based studies.
effect on the factorial structure. This effect of the wording
on the factorial structure has been confirmed in different
studies on the dimensionality of other psychometric 2. Methods
instruments like the Rosenberg Self-Esteem Scale and the
Life-Orientation-Test [19-21]. Marsh [19] preferred a 2.1. Subjects
unidimensional solution with response bias on the nega- A representative sample of the German general
tively phrased items for the Rosenberg Self-esteem scale population was selected with the assistance of a
[19]. Up to now studies have assumed that the GHQ-12 is demographic consulting company. The area of Germany
free of response bias. Following the finding of Marsh [19], was separated into 201 sample areas representing the
Hankins tested a unidimensional model including response different regions of the country. Households of the
bias for the negatively phrased items for the GHQ-12 in respective area and members of this household fulfilling
comparison with a simple unidimensional model and a the inclusion criteria (age at or above 14, able to read and
three-dimensional model [17]. This unidimensional model understand the German language) were selected randomly
including response bias showed superior fit compared to the by Kish-selection-grid technique. The Kish-selection-grid-
two other models under study. Hankins suggests that this technique targets individuals on the doorstep among
finding underpins that the previous findings for the factorial household residents. The system is devised so that all
structure of the GHQ-12 were based on methodical artefacts individuals in a household have an equal chance of
[22]. A study by Ye [18] replicated the results of Hankins selection. The sample is representative in terms of age,
[17,22] in a Chinese sample. The study of Wang and Lin gender, and education. A first attempt was made for 3,194
[23] showed that the GHQ-12 has a unidimensional addresses, of which 3,108 were valid. If not at home, a
structure after controlling for wording effects. maximum of four attempts were made to contact the
In summary, the factorial structure of the GHQ-12 selected person. 872 subjects (28.1%) refused participa-
remains under debate [15,18,22], and thus, a solid statement tion, 137 subjects (4.4%) were not reached after four
about the reliability of the GHQ-12 is still lacking, because it attempts, and 10 subjects (0.3%) refused participation
depends on the factorial structure. Nevertheless, Hankins because of severe health problems. All subjects were
[17] reported the impact of dimensionality and response bias visited by a study assistant, informed about the investi-
on reliability. As the calculation of reliability by alpha is gation, and self-rating questionnaires were presented. The
based on the assumption of no response bias and the best assistant waited until participants answered all question-
fitting model indicates response bias, the alpha may be over- naires and offered help if the meaning of questions was
estimated [17]. Finally, a comprehensive psychometric not clear. A total of 2,066 people between the ages of 14
evaluation needs statements about the validity of the and 93 years agreed to participate, completing the self-
instrument. A useful procedure is the inclusion of external rating questionnaires in November and December 2002
criteria to clarify what the GHQ-12 assesses. Most of the (participation rate: 66.5%). 25 subjects were excluded
psychometric studies lack this final step, and the designa- from the following analyses because of incomplete data.
tion of the different dimensions is based on a review of the Thus, a dataset of 2,041 people is included in this study.
related items. Table 1 gives an overview of the demographic character-
The two- and three-factor-models distinguish different istics of the sample.
domains like “depression” and “loss of confidence”. A
unidimensional solution refers to a more holistic concept 2.2. Instruments
like “mental distress” or “minor psychiatric morbidity”,
possibly measured by the GHQ-12. A reliable statement In our study the German Version of the 12-item General
about what the GHQ-12 assesses would be very useful Health Questionnaire (GHQ-12) [24] with a four-point
to guide the application of the instrument in research Likert-Scale (0-1-2-3) was used. Thus, the total score ranges
and practice. from 0–36, with higher scores representing higher levels of
Addressing the controversial debate about the dimen- mental distress. Supplementing the GHQ-12, several other
sionality, reliability and construct validity of the GHQ-12, instruments assessing mental distress and self-perceived
we [1] test and compare different dimensional models general health status were used in the study.
discussed in the previous literature, [2] quantifying psycho- The Beck Depression Inventory (BDI) [25,26] is one of
metric properties, and [3] studying associations of the GHQ- the most commonly used measures to assess the severity of
12 with depression, anxiety, social phobia, and self- depressive symptomatology. It is well established in research
perceived health status to determine what the GHQ-12 and clinical practice. Its reliability and validity were proven
assesses in a large scale German representative population in numerous studies [27].
sample. The primary aim of our study is to test psychometric To assess anxiety and depression, the Short form of the
properties of the GHQ-12 in the general population and to Patient Health Questionnaire (PHQ) [28,29] was used. This
408 M. Romppel et al. / Comprehensive Psychiatry 54 (2013) 406–413
Table 1
Demographic characteristics of the representative population sample.
Total (N = 2,041) Male (N = 959) Female (N = 1,082)
Age M 48.8 47.1 50.2
SD 18.1 17.5 18.5
Age groups 14–24 years 11.9% (242) 14.0% (134) 10.0% (108)
25–34 years 13.2% (269) 13.4% (128) 13.0% (141)
35–44 years 17.6% (360) 17.3% (166) 17.9% (194)
45–54 years 16.4% (334) 16.9% (162) 15.9% (172)
55–64 years 18.0% (368) 19.0% (182) 17.2% (186)
65–74 years 14.8% (302) 15.8% (151) 14.0% (151)
≥75 years 8.1% (166) 3.7 (36) 12.0% (130)
Urbanity Rural area 28.9% (590) 29.1% (279) 28.7% (311)
Urban area 71.1% (1,451) 70.9% (680) 71.3% (771)
Education No qualifications 2.0% (41) 1.2% (11) 2.8% (30)
Less than 10 years 46.3% (944) 45.0% (431) 47.4% (513)
10 years of education 35.3% (718) 36.5% (350) 34.0% (718)
More than 10 years 16.3% (338) 17.4% (167) 15.8% (171)
Net household income b 750 €/month 10.8% (211) 9.4% (87) 12.0% (124)
750 to 1249 €/month 28.4% (554) 25.2% (232) 31.2% (322)
1250 to 1999 €/month 36.2% (708) 38.6% (352) 34.2% (352)
≥2000 €/month 24.6% (480) 26.8% (247) 22.6% (233)
instrument for psychiatric case definition in primary care ally, it contains a single question that assesses change in
demonstrated good validity and reliability. Compared with health from 1 year ago.
structured clinical interviews, 98% sensitivity and 80%
specificity were shown [30]. Depressive symptoms were 2.3. Statistical analyses
screened with the PHQ-2 [31,32], a short form of the PHQ
depression subscale. This short depression screener con- The models described in the introduction section above
tains two items assessing anhedonia and depressed mood were tested via confirmatory factor analyses using Mplus
over the past two weeks, scoring from 0 (“not at all”) to 3 6.1 [36]. In a confirmatory factor analysis a theoretically
(“nearly every day”). The total score of the PHQ-2 ranges derived factor structure is defined, and the degree to which
from 0 to 6. Compared with structured clinical interviews, the covariance structure estimated by the model fits the
83% sensitivity and 93% specificity was shown [31]. The empirically observed covariance structure is assessed [37].
five items of the anxiety module of the PHQ assess panic Model 1 represents the three-dimensional conception of the
attacks. The first question assesses whether there was a GHQ-12, with three latent variables (social dysfunction,
panic attack within the last four weeks. If there was at least anxiety/depression, and loss of confidence) and six, four
one attack, four further items assess whether there were and two measured variables loading onto them. Since a
previous attacks, if the attacks are unexpected, impairing latent variable, that is represented using only two in-
and if there are typical physical symptoms like tachycardia dicators, is locally under-identified, an equality constraint
or dizziness. Response categories for these items are yes on the two loadings associated with the latent variable can
vs. no. Again, a total anxiety score ranging from 0 to 5 be placed, following the recommendation of Little and
was calculated. colleagues [38]. Model 2 depicts the two-factor model, with
The Social Phobia Inventory (SPIN) [33,34] contains 17 two latent variables (social dysfunction and anxiety/
items referring to fear, avoidance and physiological symp- depression) and six measured variables loading onto each.
toms of social phobia in the previous seven days. Response Model 3 also represents a three-dimensional conception,
categories for these items range from 0 (“not at all”) to 4 but with “cope”, “stress”, and “depression” as latent
(“extremely”). The total score ranges from 0–68. Good variables and four, three and five measured variables.
psychometric properties have been established [33,34]. Model 4 represents the one-dimensional conception of the
A popular measure of self-perceived general health GHQ, with all 12 items defined as indicators of a single
status is the Short Form 36 (SF-36) [35]. It consists of 36 factor. Finally, we tested the unidimensional model
items which cover eight domains: Physical functioning (10 described by Hankins [22] as Model 5. As described
items), Role Limitations – Physical (four items), Bodily above, in this model the GHQ-12 was modelled as a
Pain (two items), Social Functioning (two items), General measure of one construct but with correlated error terms on
Health (five items), Role Limitations – Emotional (three the negative formulated items, modelling response bias.
items), Vitality (four items), and Mental Health (five This model was therefore identical to model 4, but it
items). After transformation, each subscale ranges from 0 to contains correlations between the error terms on the
100; higher scores indicate better health status. Addition- negative items (items 4, 5, 6, 7, 8, 9).
M. Romppel et al. / Comprehensive Psychiatry 54 (2013) 406–413 409
In our analyses the maximum likelihood (ML) method of the Fornell-Larcker-ratio [40]. A Fornell-Larcker-ratio
estimation was used; error covariances were constrained to smaller than 1 indicates good discriminant validity. The
zero in all models except model 5, in order to avoid χ 2 value, the Bayesian Information Criterion (BIC), the
overfitting and capitalizing on chance associations in the comparative fit index (CFI), the Tucker Lewis index (TLI),
data [39]. In addition to the standardized factor loadings, the the root mean square error of approximation (RMSEA), and
construct reliability (the extent to which the indicators of a the standardized root mean square residual (SRMR) are
construct share common variance) and the average variance reported as fit indices. When comparing models, the model
extracted (the extent to which the variance in the indicators with the lowest BIC value is usually the one to be preferred.
is accounted for by the latent construct) were calculated. Although, strictly speaking, the comparison of BIC values
Values greater than .7 (reliability) and .5 (average variance provides only a ranking, a difference of 10 points or more
extracted) indicate good reliability and convergent validity. has been suggested as a nearly certain indicator of a
Discriminant validity (the extent to which a construct in the difference in fit between models [41]. Values larger than
model is distinct from another construct) was tested using 0.95 for TLI and CFI, and values smaller than 0.06 for
Table 2
Standardized factor loadings and goodness-of-fit statistics for five alternative confirmatory factor analysis models of the General Health Questionnaire (GHQ).
Model 1 Model 2 Model 3 Model 4 Model 5
Social Anxiety/ Loss of Social Anxiety/ Cope Stress Depression Global Global
Dysfunction Depression Confidence Dysfunction Depression
1. Able to concentrate .66 .65 .65 .70 .66
2. Capable of making .58 .58 .63 .70 .58
decisions
3. Face up to problems .61 .61 .60 .64 .61
4. Lost sleep over .66 .64 .73 .73 .60 a
worry
5. Constantly under .59 .56 .66 .66 .55 a
strain
6. Could not overcome .75 .74 .74 .81 .66 a
difficulties
7. Unhappy and .80 .81 .81 .85 .65 a
depressed
8. Loss of .88 .79 .79 .87 .56 a
self-confidence
9. Thinking of self .82 .75 .76 .84 .54 a
as worthless
10. Play useful part .52 .54 .52 .57 .52
in things
11. Enjoy day-to-day .64 .63 .59 .69 .63
activities
12. Reasonably happy .75 .74 .62 .78 .75
Construct reliability .79 .80 .84 .80 .86 .69 .70 .86 .89 .88
Average variance .40 .50 .72 .40 .52 .36 .44 .56 .41 .37
extracted
Fornell-Larcker-ratio 1.9 1.5 1.0 1.6 1.2 2.0 1.7 1.2 - -
RMSEA and smaller than 0.08 for SRMR are considered as 3.2. Convergent and discriminant validity
indicators of a good fit [42].
To analyse the convergent and discriminant validity of The average variance extracted is satisfactory only for
the GHQ-12, Pearson's correlations were computed to some of the factors. Due to the high intercorrelations
determine the relations between the scales of the GHQ-12 between the factors, all but one factor (“Loss of Confidence”
according to the best model identified and the BDI, the in model 1) miss the criterion for a good discriminant
PHQ, the SF-36, and the SPIN, measuring mental distress validity of the constructs, namely a Fornell-Larcker-ratio
and self-perceived health status. smaller than 1 (Table 2).
Means, standard deviations, item-total correlations, and Pearson's correlations between the scales of the GHQ and
response-probabilities were calculated as psychometric the scales of several other instruments measuring mental
properties of the items, Cronbach's α was calculated as a distress and self-perceived health status are presented in
measure of the internal consistency. Table 3. Because of the large sample size, the coefficients are
To predict the general GHQ-12-score, a total of 12 all significant at the level of p b .001.
measures (BDI, PHQ-2, PHQ-Anxiety, SPIN, subscales of As presented in Table 3, the highest correlations of
the SF-36) were entered as predictors in a stepwise linear the GHQ are found with the BDI and the SF-36-
regression analysis. Subscale “Mental Health”, followed by the PHQ-2 and
Statistical analyses were conducted with the SPSS 15.0 the SF-36-Subscales social functioning and vitality. The
statistical package and Mplus 6.1 [36]. other correlations show low ranges. From the perspective
of discriminant and convergent validity, the assumption
that only one global factor describes the GHQ-12 seems
to be justified.
3. Results
3.3. Psychometric properties of the GHQ
3.1. Confirmatory Factor Analysis (CFA)
Item and scale characteristics of the unidimensional
As shown in Table 2, all factor loadings are greater than GHQ-12-conception were evaluated on the basis of the
.50, with indicator reliabilities (squared factor loadings) total sample (N = 2.041). As shown in Table 4, item–total
ranging between .27 and.77. In all models item 10 (“Play correlations are in the upper range, and response–
useful part in things”) has the smallest factor loading. In all probabilities are in the medium range between p = 0.37
models the latent variables show good construct reliabilities and 0.52. Internal consistency as a measure of the
(range .69 to .89). When comparing the BIC values of the reliability of the scale can be considered to be very
models, the best model is model 5 followed by model 1, good (Cronbach's α = .89, α = .79, and α = .86 for the full
although all models demonstrate rather unsatisfactory fit scale, the positive worded items, and the negative worded
indices (Tucker-Lewis Index b .90 and root mean square error items, respectively).
of approximation N .08). The unidimensional model incorpo-
3.4. Predicting GHQ-score
rating response bias (model 5) reaches the best overall fit for
the data and a significantly better fit than model 4, the As shown in Table 5, the stepwise procedure results in a
unidimensional model with error covariances constrained to regression model with only six predictors successfully
zero (Δχ 2 = 1190.5, df = 15, p b .001). predicting the total score of the GHQ-12 with an accounted
Table 3
Table 4
Pearson correlations between the General Health Questionnaire (GHQ) and
Psychometric properties of the German adaptation of the General Health
other scales.
Questionnaire (GHQ).
Cronbach's Models 4 and 5
Items M SD rit p
alpha
Global-scale
1. Lost sleep over worry 0.95 0.50 0.44 0.49
BDI .93 .63 2. Constantly under strain 1.02 0.48 0.51 0.51
PHQ-Anxiety .65 .33 3. Able to concentrate 0.77 0.74 0.63 0.44
PHQ-2 (Depression) .78 .55 4. Play useful part in things 0.67 0.72 0.72 0.42
SF-36: Physical Functioning .93 -.26 5. Face up to problems 1.01 0.49 0.56 0.50
SF-36: Role Limitations (physical) .92 -.33 6. Capable of making decisions 0.79 0.73 0.56 0.45
SF-36: Bodily Pain .89 -.38 7. Could not overcome difficulties 0.67 0.65 0.59 0.42
SF-36: General Health. .79 -.43 8. Reasonably happy 0.46 0.63 0.64 0.37
SF-36: Vitality .81 -.52 9. Enjoy day-to-day activities 0.55 0.64 0.67 0.39
SF-36: Social Functioning .79 -.54 10. Unhappy and depressed 0.63 0.67 0.71 0.41
SF-36: Role Limitations (emotional) .92 -.43 11. Loss of self-confidence 1.05 0.53 0.64 0.51
SF-36: Mental Health .82 -.62 12. Thinking of self as worthless 1.10 0.48 0.55 0.53
SPIN .91 .28
M = mean; SD = standard deviation; rit = part-whole corrected item-total
All p's b .001. correlation (related to the total score); p = response probability.
M. Romppel et al. / Comprehensive Psychiatry 54 (2013) 406–413 411
[3] Vanheule S, Bogaerts S. Short communication: the factorial structure [24] Schmitz N, Kruse J, Tress W. Psychometric properties of the General
of the GHQ-12. Stress Health 2005;21(4):217-22. Health Questionnaire (GHQ-12) in a German primary care sample.
[4] Toyabe SI, Shioiri T, Kobayashi K, Kuwabara H, Koizumi M, Endo Acta Psychiatr Scand 1999;100(6):462-8.
T, et al. Factor structure of the General Health Questionnaire (GHQ- [25] Hautzinger M, Bailer M, Worall H, Keller F. Beck-Depressions-
12) in subjects who had suffered from the 2004 Niigata-Chuetsu Inventar2nd ed. . Goettingen: Hogrefe; 1995.
earthquake in Japan: a community-based study. BMC Public Health [26] Beck AT, Ward CH, Mendelson M, Mock J, Erbaugh J. An inventory
2007;7:175. for measuring depression. Arch Gen Psychiatry 1961;4:561-71.
[5] Schmitz N, Kruse J, Tress W. Improving screening for mental [27] Richter P, Werner J, Bastine R. Psychometrische Eigenschaften des
disorders in the primary care setting by combining the GHQ-12 and Beck-Depression Inventars (BDI). Eine Überblick. Zeitschrift für
SCL-90-R subscales. Compr Psychiatry 2001;42(2):166-73. Klinische Psychologie 1994;23:3-19.
[6] Politi PL, Piccinelli M, Wilkinson G. Reliability, validity and factor [28] Spitzer RL, Kroenke K, Williams JBW. Validation and utility of a self-
structure of the 12-Item General Health Questionnaire Among Young report version of PRIME-MD – the PHQ primary care study. JAMA
Males in Italy. Acta Psychiatr Scand 1994;90(6):432-7. 1999;282(18):1737-44.
[7] Picardi A, Abeni D, Pasquini P. Assessing psychological distress [29] Löwe B, Spitzer RL, Zipfel S, Herzog W. Gesundheitsfragebogen für
in patients with skin diseases: reliability, validity and factor Patienten (PHQ-D). Maunal und Testunterlagen. Karlsruhe: Pfizer;
structure of the GHQ-12. J Eur Acad Dermatol Venereol 2001; 2002.
15(5):410-7. [30] Lowe B, Spitzer RL, Grafe K, Kroenke K, Quenter A, Zipfel S, et al.
[8] Kalliath TJ, O'Driscoll MP, Brough P. A confirmatory factor analysis Comparative validity of three screening questionnaires for DSM-IV
of the General Health Questionnaire-12. Stress Health 2004;20(1): depressive disorders and physicians' diagnoses. J Affect Disord
11-20. 2004;78(2):131-40.
[9] Gureje O. Reliability and the factor structure of the Yoruba Version of [31] Kroenke K, Spitzer RL, Williams JBW. The Patient Health
the 12-Item General Health Questionnaire. Acta Psychiatr Scand Questionnaire-2 – Validity of a two-item depression screener. Med
1991;84(2):125-9. Care 2003;41(11):1284-92.
[10] Campbell A, Walker J, Farrell G. Confirmatory factor analysis of the [32] Lowe B, Kroenke K, Grafe K. Detecting and monitoring depression
GHQ-12: can I see that again? Aust N Z J Psychiatry 2003;37(4): with a two-item questionnaire (PHQ-2). J Psychosom Res 2005;58(2):
475-83. 163-71.
[11] French DJ, Tait RJ. Measurement invariance in the General Health [33] Sosic Z, Gieler U, Stangier U. Screening for social phobia in medical
Questionnaire-12 in young Australian adolescents. Eur Child Adolesc in- and outpatients with the German version of the Social Phobia
Psychiatry 2004;13(1):1-7. Inventory (SPIN). J Anxiety Disord 2008;22(5):849-59.
[12] Shevlin M, Adamson G. Alternative factor models and factorial [34] Connor KM, Davidson JRT, Churchill LE, Sherwood A, Foa E,
invariance of the GHQ-12: A large sample analysis using confirmatory Weisler RH. Psychometric properties of the Social Phobia
factor analysis. Psychol Assess 2005;17(2):231-6. Inventory (SPIN) – new self-rating scale. Br J Psychiatry 2000;
[13] Cheung YB. A confirmatory factor analysis of the 12-item General 176:379-86.
Health Questionnaire among older people. Int J Geriatr Psychiatry [35] Ware JE, Sherbourne CD. The Mos 36-Item Short-Form Health Survey
2002;17(8):739-44. (Sf-36).1. Conceptual-framework and item selection. Med Care
[14] Graetz B. Multidimensional properties of the General Health 1992;30(6):473-83.
Questionnaire. Soc Psychiatry Psychiatr Epidemiol 1991;26(3): [36] Muthén LK, Muthén BO. Mplus User's Guide. 6th ed. Los Angeles:
132-8. Muthén & Muthén; 2010.
[15] Makikangas A, Feldt T, Kinnunen U, Tolvanen A, Kinnunen ML, [37] Brown TE. Confirmatory factor analysis for applied research. New
Pulkkinen L. The factor structure and factorial invariance of the 12- York: Guilford University Press; 2006.
item General Health Questionnaire (GHQ-12) across time: evidence [38] Little TD, Lindenberger U, Nesselroade JR. On selecting indicators for
from two community-based samples. Psychol Assess 2006;18(4): multivariate measurement and modeling with latent variables: When
444-51. “good” indicators are bad and “bad” indicators are good. Psychol
[16] Campbell A, Knowles S. A confirmatory factor analysis of the Methods 1999;4(2):192-211.
GHQ12 using a large Australian sample. Eur J Psychol Assess 2007; [39] Maccallum RC, Roznowski M, Necowitz LB. Model modifications in
23(1):2-8. covariance structure-analysis – the problem of capitalization on
[17] Hankins M. The reliability of the twelve-item general health chance. Psychol Bull 1992;111(3):490-504.
questionnaire (GHQ-12) under realistic assumptions. BMC Public [40] Fornell C, Larcker DF. Evaluating structural equation models with
Health 2008;8:355. unobservable variables and measurement error. J Mark Res 1981;
[18] Ye SQ. Factor structure of the General Health Questionnaire (GHQ- 18(1):39-50.
12): The role of wording effects. Pers Individ Diff 2009;46(2): [41] Raftery AE. Bayesian model selection in social research. Sociological
197-201. Methodol 1995;25:111-63.
[19] Marsh HW. Positive and negative global self-esteem: a substantively [42] Hu YJ, Bentler PM. Cutoff criteria for fit indexes in covariance
meaningful distinction or artifactors? J Pers Soc Psychol 1996;70(4): structure analysis: Conventional criteria versus new alternatives. Struct
810-9. Equation Model 1999;6(1):1-55.
[20] Mook J, Kleijn WC, Vanderploeg HM. Positively and negatively [43] Wang WC, Cunningham EG. Comparison of alternative estimation
worded items in a self-report measure of dispositional optimism. methods in confirmatory factor analyses of the general health
Psychol Rep 1992;71(1):275-8. questionnaire. Psychol Rep 2005;97(1):3-10.
[21] Roth M, Decker O, Herzberg PY, Brahler E. Dimensionality and norms [44] Aguado J, Campbell A, Ascaso C, Navarro P, Garcia-Esteve L,
of the Rosenberg Self-esteem scale in a German general population Luciano JV. Examining the factor structure and discriminant validity of
sample. Eur J Psychol Assess 2008;24(3):190-7. the 12-item General Health Questionnaire among Spanish postpartum
[22] Hankins M. The factor structure of the twelve item General Health women. Assessment 2012;19(4):517-25.
Questionnaire (GHQ-12): results of negative phrasing? Clin Pract [45] Llabre MM, Fitzpatrick SL. Revisiting measurement models in
Epidemiol Ment Health 2008;4:10. psychosomatic medicine research: a latent variable approach. Psycho-
[23] Wang L, Lin WP. Wording effects and the dimensionality of the som Med 2012;74(2):169-77.
General Health Questionnaire (GHQ-12). Pers Individ Diff 2011; [46] Chen FF, Hayes A, Carver CS, Laurenceau JP, Zhang ZG. Modeling
50(7):1056-61. general and specific variance in multifaceted constructs: a
M. Romppel et al. / Comprehensive Psychiatry 54 (2013) 406–413 413
comparison of the bifactor model to other approaches. J Pers 2012; [49] Goldberg DP, Oldehinkel T, Ormel J. Why GHQ threshold varies from
80(1):219-51. one place to another. Psychol Med 1998;28(4):915-21.
[47] Chen FF, West SG, Sousa KH. A comparison of bifactor and second- [50] Donath S. The validity of the 12-item General Health Questionnaire in
order models of quality of life. Multivariate Behav Res 2006;41(2): Australia: a comparison between three scoring methods. Aust N Z J
189-225. Psychiatry 2001;35(2):231-5.
[48] Kulenovic M, Kusevic Z, Grba S. Factor analysis of the General Health [51] Clarke DM, Smith GC, Herrman HE. A comparative-study of
Questionnaire (GHQ 12) on the sample of the unemployed and screening instruments for mental-disorders in general-hospital patients.
students. Coll Antropol 1995;19(2):407-11. Int J Psychiatry Med 1993;23(4):323-37.