Model schematics. A, Schematic of the multivariate observed variable model. Only 3 observed lifetime time t internalizing disorders and externalizing disorders along with only 1 observed internalizing and 1 observed externalizing disorder at time t + 1 are shown to simplify the presentation, but there were 10 observed lifetime internalizing and 8 observed externalizing disorders in the actual survival model at each time point. First onset of each of these 18 disorders between times t and t + 1 was predicted by lifetime history of the other 17 disorders as of time t. Estimation was made in 18 separate survival equations, each with 17 predictors for history of the other disorders, for a total of 306 (18 × 17) pairwise time-lagged associations between earlier and later mental disorders. The 17 predictor disorders were treated as time-varying covariates in a discrete-time (person-year) survival framework. Controls were also included for respondent age at interview, sex, person-year, and country. B, Schematic of the multivariate latent variable model. Only 3 observed lifetime time t internalizing disorders and externalizing disorders and only 3 disorders of each set at time t + 1 are shown to simplify the presentation, but there were 10 observed lifetime internalizing and 8 observed externalizing disorders in the actual survival model. First onset of each of these 18 disorders between times t and t + 1 was predicted by latent internalizing or latent externalizing variables at time t + 1. These latent variables, in turn, were predicted by lifetime history of latent internalizing and externalizing variables as of time t. These time t latent variables, finally, were predicted by lifetime history of observed internalizing or externalizing variables as of time t. Estimation was performed using a 3-part iterative procedure. A total of 36 independent associations were estimated, 270 fewer than in the model for associations among observed disorders. As in the earlier observed variable model, the predictor disorders were treated as time-varying covariates in a discrete-time (person-year) survival framework, and controls were included for respondent age at interview, sex, person-year, and country. e indicates externalizing; i, internalizing. The Greek letters δ and ε represent measurement errors in observed i and e variables, respectively. Lowercase letters signify measured variables; and uppercase, latent variables.
Kessler RC, Ormel J, Petukhova M, McLaughlin KA, Green JG, Russo LJ, Stein DJ, Zaslavsky AM, Aguilar-Gaxiola S, Alonso J, Andrade L, Benjet C, de Girolamo G, de Graaf R, Demyttenaere K, Fayyad J, Haro JM, Hu CY, Karam A, Lee S, Lepine J, Matchsinger H, Mihaescu-Pintia C, Posada-Villa J, Sagar R, Üstün TB. Development of Lifetime Comorbidity in the World Health Organization World Mental Health Surveys. Arch Gen Psychiatry. 2011;68(1):90-100. doi:10.1001/archgenpsychiatry.2010.180
Although numerous studies have examined the role of latent variables in the structure of comorbidity among mental disorders, none has examined their role in the development of comorbidity.
To study the role of latent variables in the development of comorbidity among 18 lifetime DSM-IV disorders in the World Health Organization World Mental Health Surveys.
Nationally or regionally representative community surveys.
A total of 21 229 survey respondents.
Main Outcome Measures
First onset of 18 lifetime DSM-IV anxiety, mood, behavior, and substance disorders assessed retrospectively in the World Health Organization Composite International Diagnostic Interview.
Separate internalizing (anxiety and mood disorders) and externalizing (behavior and substance disorders) factors were found in exploratory factor analysis of lifetime disorders. Consistently significant positive time-lagged associations were found in survival analyses for virtually all temporally primary lifetime disorders predicting subsequent onset of other disorders. Within-domain (ie, internalizing or externalizing) associations were generally stronger than between-domain associations. Most time-lagged associations were explained by a model that assumed the existence of mediating latent internalizing and externalizing variables. Specific phobia and obsessive-compulsive disorder (internalizing) and hyperactivity and oppositional defiant disorders (externalizing) were the most important predictors. A small number of residual associations remained significant after controlling the latent variables.
The good fit of the latent variable model suggests that common causal pathways account for most of the comorbidity among the disorders considered herein. These common pathways should be the focus of future research on the development of comorbidity, although several important pairwise associations that cannot be accounted for by latent variables also exist that warrant further focused study.
Comorbidity is the norm among common mental disorders because more than 50% of people with a mental disorder in a given year meet criteria for multiple disorders.1,2 The structure of this comorbidity has been the subject of considerable interest. Beginning with an influential article by Krueger,3 numerous researchers have documented that bivariate associations among hierarchy-free anxiety, mood, behavior, and substance disorders can be accounted for by correlated latent predispositions to internalizing and externalizing disorders, with division of internalizing disorders into secondary dimensions of fear (eg, panic and phobia) and distress (eg, major depressive episode and generalized anxiety disorder).4- 9
These results have been used to argue for a reorganization of the classification of mental disorders in the DSM and International Classification of Disease diagnostic systems.10- 13 However, additional work is needed to evaluate the empirical support for such a reorganization because the framework has only recently been expanded to include additional forms of psychopathology, such as psychotic experiences.14 Moreover, additional research on the stability of the structure across sociodemographic variables (eg, age, sex, and education) would be informative.
These results have also been used occasionally to investigate whether risk factors for individual disorders are more accurately conceptualized as risk factors for the latent dimensions underlying these disorders. Kramer and colleagues,15 for example, found that observed sex differences in several internalizing and externalizing disorders became statistically insignificant when controls were included for latent internalizing-externalizing dimensions. Such evidence can be valuable in distinguishing between specific and nonspecific risk factors. The use of latent variable models in this way is only in its infancy. One obvious application is to the development of comorbidity itself. In particular, although the cross-sectional structure of comorbidity has been examined in a number of studies, we are unaware of attempts to investigate the role of latent dimensions in accounting for the development of comorbidity.
Although several studies used longitudinal data to determine whether the structure of internalizing and externalizing disorders is stable over time,9,16,17 none investigated whether this structure accounts for the associations between temporally primary disorders and the subsequent first onset of comorbid disorders. A number of other longitudinal studies examined temporal progression18,19 or sequencing20- 23 between earlier and later mental disorders, documenting strong persistence of individual disorders over time and significant predictive associations between some but not other temporally primary and later disorders. For example, Fergusson and colleagues18 found that childhood conduct disorder but not attention-deficit/hyperactivity disorder (ADHD) predicted subsequent substance disorders. None of these studies, however, investigated whether associations of earlier disorders with onset of later disorders were mediated by latent variables.
Analysis of the latter sort could be useful in identifying potentially modifiable risk pathways by focusing attention on subsets of disorders with especially strong predictive associations that could subsequently be examined in more focused analyses.24,25 For example, clinical studies finding childhood impulse-control problems in a subset of patients with early-onset obsessive-compulsive disorder (OCD)26,27 and finding that impulse-control disorders continue to feature prominently in some cases of adult OCD28,29 have created interest in the importance of inhibitory dyscontrol in the pathogenesis of OCD.30 However, the role of putative neurobiological markers of such dyscontrol in accounting for the associations of impulse-control disorders with subsequent OCD remains unstudied. The documentation in epidemiological data of special associations between a cluster of early-onset impulse-control disorders and subsequent OCD could help spur such research by suggesting that more focused prospective neurobiological studies of this cluster beginning in childhood might yield valuable information about an important OCD subtype.
The present report proposes a novel approach to investigate the role of latent variables in the development of comorbidity. We begin with a conventional survival analysis of epidemiological data collected in 14 countries in the World Health Organization (WHO) World Mental Health (WMH) Survey Initiative31 to study associations between earlier lifetime disorders and the subsequent first onset of later disorders. We then elaborate these survival models using a new latent modeling approach to examine the extent to which the associations among observed disorders can be accounted for by the mediating effects of latent internalizing and externalizing variables.
The 14 WMH countries include 7 classified by the World Bank as developed (Belgium, France, Germany, Italy, the Netherlands, Spain, and the United States) and 7 classified as developing (Brazil, Colombia, India, Lebanon, Mexico, the Peoples' Republic of China, and Romania) (Table 1). Surveys were administered in multistage, clustered area probability household samples representative of specific regions within countries (Brazil, Colombia, India, Mexico, and the Peoples' Republic of China) or entire countries (the remaining countries). Respondents were interviewed face-to-face by trained lay interviewers who explained the purposes of the survey, made clear that participation was voluntary and that responses would be treated as confidential, and obtained informed consent before beginning interviews. These recruitment and consent procedures were approved by the local human subjects committee that monitored the study in each country. A total of 61 292 respondents were interviewed. Country-specific response rates ranged from 45.9% (France) to 98.6% (India). The weighted (by sample size) average response rate was 71.1%.
The interview was divided into 2 parts. Part 1 assessed core disorders and was completed by all respondents. Part 2 assessed additional disorders and numerous correlates and was completed by 100% of respondents who met criteria for any part 1 disorder plus a probability subsample of other part 1 respondents. Based on a concern about recall bias, disorders defined as beginning in childhood (ADHD, conduct disorder, oppositional defiant disorder [ODD], and separation anxiety disorder) were assessed only among respondents aged 18 to 44 years. This part 2 subsample, which ranges in size from 486 respondents in Belgium to 6218 in the Peoples' Republic of China and totals 21 229 respondents across countries, is the sample used in the present report. The part 1 samples were weighted to adjust for differential probabilities of selection and residual discrepancies between sample and census on sociodemographic and geographic variables. In addition, the part 2 samples were weighted to adjust for undersampling of part 1 respondents without part 1 disorders. A more detailed discussion of WMH sampling and weighting is presented elsewhere.32
Diagnoses were based on version 3.0 of the WHO Composite International Diagnostic Interview (CIDI),33 a fully structured, lay-administered interview that generates diagnoses according to ICD-10 and DSM-IV criteria. The DSM-IV criteria are used herein. Translation and back-translation followed standard WHO procedures.34 The 7-day interviewer training program was standardized across countries. Training culminated in an examination that had to be passed before the interviewer could begin production data collection. A more detailed discussion of WMH training and quality control is presented elsewhere.35
The 18 lifetime diagnoses include mood disorders (bipolar I-II or subthreshold disorder [BPD] and major depressive episode/dysthymia), anxiety disorders (agoraphobia with or without panic disorder, generalized anxiety disorder [GAD], OCD, panic disorder with or without agoraphobia, posttraumatic stress
disorder [PTSD], separation anxiety disorder, social phobia, and specific phobia), behavior disorders (attention-deficit disorder [ADD], hyperactivity disorder [HD], conduct disorder with covert symptoms [CD1; eg, lying, shoplifting], conduct disorder with overt symptoms [CD2; eg, bullying, being physically cruel to people], intermittent explosive disorder [IED], and ODD), and substance disorders (alcohol and other drug abuse with or without dependence).
As detailed elsewhere,36 blinded clinical reappraisal interviews found generally good concordance between DSM-IV diagnoses based on the CIDI and those based on the Structured Clinical Interview for DSM-IV Axis I Disorders.37 Organic exclusions but not diagnostic hierarchy rules were used in making diagnoses. The CIDI included retrospective disorder age-at-onset reports based on a special question sequence that has been shown experimentally to improve recall accuracy.38 Respondents were asked to date their age when they first had the full syndrome for each disorder, not the first symptom of the disorder.
Exploratory principal axis tetrachoric factor analysis with promax rotation was used to examine bivariate comorbidity. Clear internalizing (10 disorders) and externalizing (8 disorders) dimensions were found but no evidence of a third factor that distinguished fear from distress disorders. Discrete-time survival analysis39 with person-year as the unit of analysis and a logistic link function40 was then used to study associations of temporally primary lifetime disorders with subsequent first onset of later disorders. Each model predicted first onset of 1 of the 18 DSM-IV /CIDI disorders from information about prior lifetime occurrence of the other 17 disorders (18 × 17 = 306 pairwise associations), which were treated as time-varying covariates, controlling for respondent age, sex, and country. Retrospective age-at-onset reports were used to define the predictor disorders as time varying and to define age at onset of the outcome disorders.
We then estimated a latent variable model that constrained the coefficients in the observed variable models to be mediated by hypothesized continuous time-varying latent internalizing and externalizing variables. The coefficients in the observed variable model were constrained in the sense that this model included 306 coefficients (Figure, A), whereas the latent variable model used only 36 independent coefficients to reproduce these same associations (Figure, B). These included 10 coefficients (1 of which was nonindependent because the 10 coefficients together perfectly predict the time t latent variable) for the time t lifetime internalizing disorders predicting the time t latent internalizing variable; 8 coefficients (1 of which was nonindependent because the 8 coefficients together perfectly predict the time t latent variable) for the time t lifetime externalizing disorders predicting the time t latent externalizing variable; 4 coefficients (2 of which were nonindependent because the pair of time t latent variables perfectly predict each of the 2 time t + 1 latent variables) for the time t latent internalizing and externalizing variables predicting the time t + 1 latent internalizing and externalizing variables; 10 coefficients for the time t + 1 latent internalizing variable predicting first onsets of the 10 time t + 1 internalizing disorders; and 8 coefficients for the time t + 1 latent externalizing variable predicting first onsets of the 8 time t + 1 externalizing disorders.
In interpreting the latent variable results, it is useful to note that the latent variables are actually weighted (by odds ratios [ORs] of disorders predicting latent variables) composites of all predictor disorders. The assumption that a single weighted composite can represent the effects of all predictor disorders is equivalent to assuming that the ratios of the ORs across predictors are constant across outcomes. These constraints are the key features of the model. The standard covariance structure analysis programs used in previous studies of the structure of comorbidity could not be used to impose these constraints because discrete-time survival analysis is based on a person-year data array that varies in size across the outcomes. An iterative maximum-likelihood method implemented in a SAS macro was consequently written to estimate the coefficients.41 This procedure sequentially estimated the
coefficients in the 3 parts of the model (time t observed variables predicting time t latent variables, time t latent variables predicting time t + 1 latent variables, and time t + 1 latent variables predicting time t + 1 observed variables), each time holding constant the coefficients in the other 2 parts of the model to their values in the previous iteration, until estimates converged.
Once estimated, a likelihood-ratio χ2 test was used to compare the fit of the latent variable model and the observed variable model with 270 (306 − 36) degrees of freedom. As described in the “Results” section, the latent variable model provided a better fit. We also investigated whether any of the 306 pairwise associations remained significant after controlling for the latent variables. Simple pairwise tests were inappropriate here because separate .05-level tests would generate more than a dozen false-positive results of 306 tests. The Bonferroni method and its extensions deal with this problem,42 but these are low-power tests that make it difficult to detect all but the largest true associations.43 We consequently used an internal subsampling strategy to pinpoint pairwise associations for
further investigation by estimating observed variable survival models controlling for the latent variables in subsets of person-years. Rather than use random subsamples, which would simply have produced patterns determined by the rules of random sampling, we focused on meaningfully different subsamples defined by 4 independent life course stages (childhood, ages 4-12 years; adolescence, ages 13-19 years; young adulthood, ages 20-29 years; and middle adulthood, ages 30-44 years) and considered a given pairwise association substantively significant only if it was statistically significant at the .05 level in the total sample and in at least 2 independent subsamples with consistent sign patterns and ORs (exponentiated survival coefficients) greater than or equal to 2.0 or less than or equal to 0.5. Because the WMH data are clustered and weighted, the design-based method of jackknife repeated replications44 was used to calculate standard errors and assess statistical significance. A SAS macro was used for this purpose.41
Exploratory factor analysis of the 18 lifetime DSM-IV /CIDI disorders was performed separately in developed and developing countries. Only 2 meaningful factors were found, with unrotated eigenvalues of 8.0 and 1.8 (in developed countries) and 6.6 and 2.3 (in developing countries). Promax-rotated factor loadings (standardized partial regression coefficients) showed that the factors correspond to internalizing and externalizing dimensions (Table 2). All the mood and anxiety disorders other than BPD loaded clearly on the internalizing factor (0.43-0.87 standardized regression coefficients). All the behavior and substance disorders other than IED loaded clearly on the externalizing factor (0.51-0.94 standardized regression coefficients). Bipolar disorder cross-loaded in developed countries (0.31-0.37) and IED in developing countries (0.30-0.48). However, country-level analyses found that BPD generally loaded more strongly on the internalizing factor and IED on the externalizing factor, leading us to classify them with these dimensions in further analyses. (Detailed results are not reported herein, but are available from the authors on request.) Separate factor analysis of only the internalizing disorders found no reliable secondary distinction between fear and other distress disorders (unrotated eigenvalues of 4.8 and 1.0 in developed countries and 4.3 and 1.3 in developing countries).
As noted, we estimated 306 bivariate survival equations, each with the first onset of 1 disorder predicted by the prior
occurrence of 1 of the other 17 disorders. Of the 306 survival coefficients, 98.0% were positive and 95.1% were also significant (Table 3). (Detailed results are not reported herein, but are available from the authors on request.) The median (interquartile range) ORs were 3.4 (2.7-4.3). None of the negative ORs was significant. Within-domain ORs were generally larger than between-domain ORs, with within-domain median ORs of 3.6 to 5.5 compared with between-domain medians of 3.0 to 3.2.
This largely positive pattern persisted in attenuated form in multivariate models, with 80.0% of ORs positive and 43.0% positive and statistically significant (Table 3). The median (1.4) and interquartile range (1.1-1.9) of ORs were considerably lower than in bivariate models because of strong intercorrelations among predictor disorders. Nine of the 306 multivariate ORs were negative and significant (GAD predicting agoraphobia and specific phobia; HD, PTSD, and CD1 predicting OCD; CD2 predicting GAD and PTSD; and alcohol abuse predicting ADD and HD). Of the within-domain ORs, 50.9% to 58.9% were positive and significant compared with 28.8% to 33.8% of between-domain ORs. The median OR was higher within (1.6) than between (1.3) domains.
The latent variable model fit the observed data better than the observed variable model, as indicated by a lower Bayesian information criterion45 of 7514.3 (latent) vs 7530.0 (observed) and an insignificant improvement in likelihood-ratio χ2 of the observed variable model (χ2270 = 107.8; P = .99). Latent variable coefficients were quite stable, as indicated by the Pearson correlations of model coefficients across 4 samples (the total sample and the 3 subsamples of all developed countries, all developing countries, and all countries excluding the 4 with survey response rates of less than 60% [Belgium, France, Germany, and the Netherlands]) of 0.88 to 0.97. (Detailed results are not reported herein, but are available from the authors on request.)
As noted, the latent variables are actually weighted (by ORs of disorders predicting latent variables) composites of all predictor disorders. The assumption that a single weighted composite can represent the effects of all the predictor disorders on all outcomes is equivalent to assuming that the ratios of the ORs across predictors are constant across outcomes. The good fit of the latent variable model shows that this assumption is generally consistent with the data, which means that the predictive effects of these disorders on each other can plausibly be assumed to be mediated by common internalizing and externalizing pathways.
The metric of the time t latent variables was set by fixing the slope of the latent variable on the strongest time t predictor to 1.0 and scaling other slopes relative to that value (Table 4). Within the internalizing domain, specific phobia was by far the most powerful predictor (1.00 by definition), followed by OCD (0.62) and other phobias (0.46-0.48) (Table 4). At the other extreme, GAD and panic disorder were insignificant predictors. The remaining internalizing disorders had ORs of intermediate strength (0.18-0.44). Within the externalizing domain, HD (1.00) and ODD (0.97) were the most powerful predictors. Alcohol and other drug abuse were insignificant, and the remaining externalizing disorders had ORs in the range of 0.43 to 0.77.
The ORs for the disorders as outcomes were much more consistent than for the disorders as predictors, with ranges of 0.68 to 1.00 (internalizing) and 0.44 to 1.00
(externalizing). Agoraphobia and social phobia were the most strongly predicted internalizing disorders, whereas ADD and HD were the most strongly predicted externalizing disorders. The relative importance of internalizing and externalizing disorders predicting each other was estimated in the set of 4 ORs between the latent variables at times t and t + 1 (Table 5). Weighted (by relative prevalence of disorders) within-domain ORs (1.6 for internalizing; 1.4 for externalizing) were higher than between-domain ORs (1.3 for time t internalizing predicting time t + 1 externalizing; 1.1 for time t externalizing predicting time t + 1 internalizing), but between-domain ORs were nonetheless statistically significant.
Only 13 of the 306 residual pairwise time-lagged associations between observed disorders passed our test of statistical significance. Nine of these were positive. Four involved within-disorder reciprocal ORs between CD1 and CD2 (3.2-4.8) and between ADD and HD (4.0-19.4). Two others involved asymmetrical associations between well-known disorder pairs (panic predicting agoraphobia [2.0-2.22] and depression predicting GAD [2.0-6.0], although the latter association was limited to child-adolescent onset). Two others might reflect diagnostic confusions, with agoraphobia predicting specific phobia (2.2-4.8) and HD (but, importantly, not ADD) predicting BPD (1.9-3.9). The final significant positive association was for IED predicting OCD (1.5-4.0). The 4 significant negative residual associations included PTSD predicting OCD (0.4-0.7), CD2 predicting BPD (0.2-0.5) and PTSD (0.2-0.4), and IED (0.5-0.6) predicting drug abuse.
Six limitations of this study are noteworthy. First, diagnoses were based on fully structured lay interviews. These typically produce more reliable (ie, reproducible across multiple interviewers) diagnoses than semistructured clinical interviews,46 and their prevalence estimates typically correspond well with those based on clinical interviews.47 However, fully structured interviews, unlike semistructured clinical interviews, are unable to clarify symptom responses or check questions across disorders to facilitate differential diagnosis, potentially leading to inflated estimates of comorbidity.
Second, disorders were assessed dichotomously rather than dimensionally, presumably reducing our ability to detect subtle aspects of structure in the data. This might help explain why we did not detect higher-order subfactors in the 2-factor exploratory factor analysis model.
Third, data were combined across countries with very different cultures and across surveys with very different response rates (which could have introduced variation in sample selection bias), different rates of sample exclusion (due to cross-national differences in rates of suicide, homelessness, and institutionalization), and different languages in which interviews were administered. Although every effort was made to make the translations as comparable as possible,34 residual variation in meaning almost certainly contributed to cross-national variation in results.
Fourth, lifetime diagnoses were based on retrospective reports rather than prospective assessments, probably leading to recall bias that underestimated prevalence48 and distorted age-at-onset estimates49 despite the use of special memory-priming methods.38 Bias in model coefficients might have varied across disorders as a function of age at onset and/or recency. Given the importance of this potential bias, it is noteworthy that the onset distributions based on these retrospective data are quite consistent with those based on prospective studies and studies performed at separate life course stages.50 In addition, model coefficients in subsamples defined by life course stage are very consistent, suggesting that variation in recall across the sample age range does not influence results in any important way.
Fifth, models were based on the simplifying assumptions that the time-lagged associations among mental disorders are constant across countries and sex, stable across the life course, and unrelated to age at onset or time since onset of the predictor disorders. Preliminary analyses showed that these assumptions are a reasonable first approximation, but the investigation of these specifications needs to be a focus of ongoing WMH analysis.
Sixth, we did not take history of treatment into consideration even though early treatment, which varies in frequency across countries,51 might interrupt the progression of comorbidity and thereby distort estimates of predictive associations.
In the context of these limitations, our finding of a 2-factor internalizing-externalizing structure among WMH disorders is consistent with previous research4- 9 but does not support the distinction in some previous studies between distress (eg, depression, GAD, and PTSD) and fear (eg, panic and phobias) disorders. As noted in the introduction, others also failed to find a distinction between distress and fear disorders.16,52 This less-differentiated structure in the WMH data might be due to our focus on lifetime disorders, whereas 12-month disorders were the focus of most studies that distinguished distress and fear disorders.
Our finding of significant time-lagged associations across virtually all pairs of the disorders considered herein is broadly consistent with evidence of associations between earlier and later disorders in previous longitudinal studies,18- 23 although most previous studies focused on prevalent cases, whereas we studied first onsets. We found, again consistent with previous studies, stronger and more consistent time-lagged associations within than between the internalizing and externalizing domains. However, again as in previous studies, we also found significant between-domain time-lagged associations.22,23
Our analysis went beyond previous studies to investigate the role of latent variables in the development of comorbidity. We showed that most of the 306 pairwise time-lagged associations among the 18 disorders considered herein can be explained by a model that assumes the existence of mediating latent internalizing and externalizing variables. This finding extends previous cross-cultural work on the structure of comorbidity.53 The fact that the predictive associations across this large number of disorders are mediated by 2 higher-order variables makes the internalizing and externalizing spectrum dimensions compelling targets for inquiry aimed at reducing the burden of mental
disorder around the world by interrupting the processes leading to the onset of comorbidity.
Specific phobia and OCD stood out as the most important internalizing predictors and HD and ODD as the most important externalizing predictors. The time-lagged associations involving these 4 predictors were largely mediated by the latent variables, with only 2 of the 13 significant residual associations involving these 4 strongest predictors (HD predicting ADD and BPD). Both of these were positive, showing that the comparatively high ORs of these 4 predictor disorders are relatively constant across the range of WMH outcomes. Although it is unclear why specific phobia and OCD should be the most important predictors among the internalizing disorders or HD and ODD among the externalizing disorders, the fact that all 4 are typically early-onset disorders50 means they might be useful markers of youth at high risk for progression to later disorders. Even this possibility requires further analysis, however, because we did not investigate nonproportional hazards that might include differential predictive associations related to age at onset or time since onset. These more in-depth analyses go beyond the scope of this first report but will be pursued in ongoing WMH analyses.
Our finding of 13 significant residual associations shows that the latent variables do not explain all the comorbidity among the disorders considered. The 4 negative residual associations are most plausibly interpreted as suggesting the existence of more differentiated dimensions underlying internalizing and externalizing disorders. The negative association of IED with subsequent drug abuse, for example, could be due to externalizing disorders being made up of multiple dimensions, 1 or more of which is significantly more strongly related to drug abuse than to IED. Evidence consistent with this possibility exists in the literature.54- 56 The negative residual association of CD2 (overt CD) with subsequent PTSD, in comparison, might be related to the findings that blunted psychophysiological and emotional reactivity to fear stimuli are predictors of CD,57,58 whereas heightened physiological reactivity to trauma-related stress cues is a predictor of PTSD.59,60 These observations suggest that a more differentiated latent variable model that includes internalizing and externalizing subdimensions might explain the significant negative associations found herein in the less differentiated WMH latent variable model.
The positive residual associations in the WMH data, in comparison, are most plausibly interpreted as disorder subtypes rather than comorbidities, including the reciprocal associations between CD1 (covert CD) and CD2 (overt CD) and between ADD and HD, or as a severity marker in the association between panic disorder and subsequent agoraphobia. At least 1 other association, between agoraphobia and subsequent specific phobia, could be due to diagnostic confusion in the CIDI. The same kind of confusion might account at least in part for the positive association between HD and subsequent BPD because differentiation between these 2 disorders can be difficult, especially within the constraints of a fully structured diagnostic interview,61,62 although a number of common neurobiological correlates have also been found for HD and BPD,63,64 arguing that ADHD might be a risk marker of BPD. The WMH finding that this association is specific to HD and does not apply to ADD has not, to our knowledge, been investigated previously.
It is important to caution that these few unique, significant residual pairwise associations should be treated as no more than preliminary owing to the problem of multiple testing and the limitations noted at the beginning of this section. Replication in other data sets, most importantly prospective data sets, is needed before these associations should be considered reliable. Furthermore, even if they are subsequently found to be reliable, their existence should not deflect attention from our main finding that the consistently significant comorbidities found among the 306 disorder pairs considered herein are likely due to common underlying processes that should be a major focus of future research on the development of comorbidity. One important implication of this finding is that future research on specific pairwise comorbid associations needs to guard against interpreting results as unique without first demonstrating, as we did herein, that they are specific rather than mere realizations of larger processes involving all internalizing and/or externalizing disorders. The fact that we found only 2 factors, finally, does not mean that only 2 underlying processes are at work because multiple processes could underlie each factor, and these diverse processes need to be studied to enrich our understanding of the causal influences leading to the higher-order structure found herein.
Correspondence: Ronald C. Kessler, PhD, Department of Health Care Policy, Harvard Medical School, 180 Longwood Ave, Boston, MA 02115 (firstname.lastname@example.org).
Submitted for Publication: February 3, 2010; final revision received July 13, 2010; accepted August 11, 2010.
Author Contributions: Dr Kessler had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Financial Disclosure: Support for the WMH Surveys was obtained from a variety of government agencies, academic institutions, foundations, and private companies. In addition, Dr Andrade reports receiving partial research support for the São Paulo Megacity Mental Health Survey from Eli Lilly & Company, Brazil. Dr Kessler reports having been a consultant for GlaxoSmithKline, Kaiser Permanente, Pfizer, Inc, Sanofi-Aventis, Shire Pharmaceuticals, and Wyeth-Ayerst; having served on advisory boards for Eli Lilly & Company, Johnson & Johnson Pharmaceuticals, and Wyeth-Ayerst; and having received research support for his epidemiological studies from Bristol-Myers Squibb, Eli Lilly & Company, GlaxoSmithKline, Johnson & Johnson Pharmaceuticals, Ortho-McNeil, Pfizer, Inc, and Sanofi-Aventis. Dr Lepine reports having served on advisory boards for Wyeth Ayerst, Sanofi-Aventis, and Pope TBI and having received research support from Eli Lilly & Company and Pfizer, Inc. Dr Russo is an employee of Shire Pharmaceuticals. Dr Stein reports having received research grants and/or consultancy honoraria from AstraZeneca, Eli Lilly & Company, GlaxoSmithKline, Johnson & Johnson Pharmaceuticals, Lundbeck, Orion, Pfizer, Inc, Pharmacia, Roche, Servier, Solvay, Sumitomo, Tikvah, and Wyeth-Ayerst.
Funding/Support: This study was supported by grant R01MH070884 from the US National Institute of Mental Health (NIMH); the John D. and Catherine T. MacArthur Foundation; the Pfizer Foundation; grants R13-MH066849, R01-MH069864, and R01 DA016558 from the US Public Health Service; grant FIRCA R03-TW006481 from the Fogarty International Center; the Pan American Health Organization; the Eli Lilly & Company Foundation; Ortho-McNeil Pharmaceutical, Inc; GlaxoSmithKline; Bristol-Myers Squibb; and Shire. The São Paulo Megacity Mental Health Survey is supported by thematic project grant 03/00204-3 from the State of São Paulo Research Foundation. The Colombian National Study of Mental Health is supported by the Ministry of Social Protection. The WMH Initiative is funded by the WHO (India) and helped by Ramamurthy Chandrasekaran, MD, Jawaharlal Institute of Postgraduate Medical Education and Research. The Lebanese National Mental Health Survey is supported by the Lebanese Ministry of Public Health; the WHO (Lebanon); Fogarty International; anonymous private donations to the Institute for Development, Research, Advocacy and Applied Care, Lebanon; and unrestricted grants from Janssen Cilag, Eli Lilly & Company, GlaxoSmithKline, Roche, and Novartis. The Mexican National Comorbidity Survey is supported by the National Institute of Psychiatry Ramon de la Fuente and by grant CONACyT-G30544- H from the National Council on Science and Technology, with supplemental support from the PanAmerican Health Organization. The Shenzhen Mental Health Survey is supported by the Shenzhen Bureau of Health and the Shenzhen Bureau of Science, Technology, and Information. The Romania WMH study projects “Policies in Mental Health Area” and “National Study Regarding Mental Health and Services Use” were performed by National School of Public Health and Health Services Management (formerly, the National Institute for Research and Development in Health), with technical support of Metro Media Transylvania, the National Institute of Statistics-National Centre for Training in Statistics, SC. Cheyenne Services SRL, Statistics Netherlands with funding by the Ministry of Public Health (formerly, the Ministry of Health) with supplemental support from Eli Lilly Romania SRL. The European Study of the Epidemiology of Mental Disorders (which includes surveys in Belgium, France, Germany, Italy, the Netherlands, and Spain) is funded by contracts QLG5-1999-01042 and SANCO 2004123 from the European Commission, the Piedmont Region (Italy); grant FIS 00/0028 from the Fondo de Investigación Sanitaria, Instituto de Salud Carlos III, Spain; grant SAF 2000-158-CE from the Ministerio de Ciencia y Tecnología, Spain; grants CIBER CB06/02/0046 and RETICS RD06/0011 REM-TAP from the Departament de Salut, Generalitat de Catalunya, Spain, Instituto de Salud Carlos III; by other local agencies; and by an unrestricted educational grant from GlaxoSmithKline. The US National Comorbidity Survey Replication is supported by grant U01-MH60220 from the NIMH with supplemental support from the National Institute of Drug Abuse, the Substance Abuse and Mental Health Services Administration, grant 044708 from the Robert Wood Johnson Foundation, and the John W. Alden Trust. Preparation of the present report was funded by a grant from Shire Pharmaceuticals to investigate the long-term effects of ADHD on the later onset of secondary comorbid disorders.
Role of the Sponsors: The sponsors had no role in the design and conduct of the study; in the collection, management, analysis, and interpretation of the data; or in the preparation, review, or approval of the manuscript. Neither Shire nor any of the other funders of the larger WMH Survey Initiative played any role in defining the research question, performing the analysis, interpreting the results, writing the paper, or approving the paper for submission.
Additional Information: A complete list of WMH publications can be found at http://www.hcp.med.harvard.edu/wmh/.
Additional Contributions: The WMH staff assisted with instrumentation, fieldwork, and data analysis.