Each symptom component is a spatial dimension on the x-axis, y-axis, and z-axis.
Cognitive control is measured by the go/no-go paradigm; working memory is measured by the digit span paradigm; emotion electroencephalographic (EEG) β is β power in the parietal/occipital region for the happy condition during the conscious emotion paradigm; frontal electroencephalographic β is mean β power in the frontal region during the eyes-open resting condition; social function is self-reported daily functional capacity in the domain of social skills; and resilience is daily functional capacity in the domain of emotional resilience. All values are expressed in standardized units to facilitate interpretation of profiles across measures. Error bars represent 1 SD from the mean.
eTable 1. Comorbidities in primary sample.
eTable 2. Comorbidities in validation sample.
eTable 3. Study measures.
eTable 4. Partial eta squared values for 6-cluster solution versus DSM diagnosis.
eTable 5. Tukey HSD post-hoc tests.
eTable 6. R-squared for ANCOVAs including comorbidity covariates for 6-cluster solution versus DSM diagnosis.
eTable 7. PCA component loadings for three negative mood components.
eTable 8. Sex distribution by cluster.
eTable 9. PCA component loadings for three negative mood components in validation sample.
eTable 10. Number and percentage of individuals with each diagnosis across subtypes.
eFigure 1. Gap Statistic for cluster number determination.
eFigure 2. Calinski-Harabasz index for cluster number determination.
eFigure 3. Within-cluster sum of squares for cluster number determination.
eFigure 4. Dendrogram of clustering solution.
eFigure 5. Cluster centers plotted from 10,000 repeated subsamples.
eFigure 6. Histogram of cluster centers calculated from repeated subsampling.
eFigure 7. Distribution of adjusted Rand scores.
eFigure 8. Tukey HSD post-hoc tests.
eFigure 9. Clusters in primary and validation samples.
eFigure 10. Symptom profiles in primary and validation samples.
eFigure 11. Distribution of each diagnostic category by subtype.
Customize your JAMA Network experience by selecting one or more topics from the list below.
Grisanzio KA, Goldstein-Piekarski AN, Wang MY, Rashed Ahmed AP, Samara Z, Williams LM. Transdiagnostic Symptom Clusters and Associations With Brain, Behavior, and Daily Function in Mood, Anxiety, and Trauma Disorders. JAMA Psychiatry. 2018;75(2):201–209. doi:10.1001/jamapsychiatry.2017.3951
Can we identify reproducible symptom subtypes that map onto distinct underlying components of neurocognitive behavior, brain activation, and daily function and/or cut across commonly comorbid mood, anxiety, and trauma disorders?
In this cross-sectional study of 420 individuals with major depression, panic disorder, and posttraumatic stress disorder, clinical symptoms and associated measurements of neurocognitive behavior, neurophysiological brain activation, and daily functional capacity were assessed, and 6 clusters of symptoms were found to be distributed equivalently across diagnostic categories. The subtypes were strongly expressed in distinct underlying components of neurocognitive performance and brain activation and differentiated clinically meaningful degrees of functional capacity.
Identification of transdiagnostic subtypes that are coherent across symptom, behavioral, and neural levels may help disentangle the symptom overlap in conventional psychiatric diagnoses, ultimately guiding tailored treatment choices.
The symptoms that define mood, anxiety, and trauma disorders are highly overlapping across disorders and heterogeneous within disorders. It is unknown whether coherent subtypes exist that span multiple diagnoses and are expressed functionally (in underlying cognition and brain function) and clinically (in daily function). The identification of cohesive subtypes would help disentangle the symptom overlap in our current diagnoses and serve as a tool for tailoring treatment choices.
To propose and demonstrate 1 approach for identifying subtypes within a transdiagnostic sample.
Design, Setting, and Participants
This cross-sectional study analyzed data from the Brain Research and Integrative Neuroscience Network Foundation Database that had been collected at the University of Sydney and University of Adelaide between 2006 and 2010 and replicated at Stanford University between 2013 and 2017. The study included 420 individuals with a primary diagnosis of major depressive disorder (n = 100), panic disorder (n = 53), posttraumatic stress disorder (n = 47), or no disorder (healthy control participants) (n = 220). Data were analyzed between October 2016 and October 2017.
Main Outcomes and Measures
We followed a data-driven approach to achieve the primary study outcome of identifying transdiagnostic subtypes. First, machine learning with a hierarchical clustering algorithm was implemented to classify participants based on self-reported negative mood, anxiety, and stress symptoms. Second, the robustness and generalizability of the subtypes were tested in an independent sample. Third, we assessed whether symptom subtypes were expressed at behavioral and physiological levels of functioning. Fourth, we evaluated the clinically meaningful differences in functional capacity of the subtypes. Findings were interpreted relative to a complementary diagnostic frame of reference.
Four hundred twenty participants with a mean (SD) age of 39.8 (14.1) years were included in the final analysis; 256 (61.0%) were female. We identified 6 distinct subtypes characterized by tension (n=81; 19%), anxious arousal (n=55; 13%), general anxiety (n=38; 9%), anhedonia (n=29; 7%), melancholia (n=37; 9%), and normative mood (n=180; 43%), and these subtypes were replicated in an independent sample. Subtypes were expressed through differences in cognitive control (F5,383 = 5.13, P < .001, ηp2 = 0.063), working memory (F5,401 = 3.29, P = .006, ηp2 = 0.039), electroencephalography-recorded β power in a resting paradigm (F5,357 = 3.84, P = .002, ηp2 = 0.051), electroencephalography-recorded β power in an emotional paradigm (F5,365 = 3.56, P = .004, ηp2 = 0.047), social functional capacity (F5,414 = 21.33, P < .001, ηp2 = 0.205), and emotional resilience (F5,376 = 15.10, P < .001, ηp2 = 0.171).
Conclusions and Relevance
These findings offer a data-driven framework for identifying robust subtypes that signify specific, coherent, meaningful associations between symptoms, behavior, brain function, and observable real-world function, and that cut across DSM-IV-defined diagnoses of major depressive disorder, panic disorder, and posttraumatic stress disorder.
Diagnostic criteria defined by the DSM-IV are heterogeneous within each disorder and overlap substantially between disorders,1 as demonstrated by at least 50% of individuals having concurrent diagnoses from more than 1 category of anxiety and mood disorder at a given time.2-5 Heterogeneity within each disorder manifests not only at the symptom level but also in underlying behavior and physiology, and this limits the opportunity for health care professionals to understand disease mechanisms and to identify valid biomarkers for disease progression and intervention targets. Identifying such biomarkers is an urgent task, given that depression and anxiety have become the leading cause of disability and lost productivity worldwide6 and that only one-third of people recover from treatment.7 In this study, we propose a complementary data-driven approach to uncovering symptom clusters that are coherent across behavioral, physiological, and daily function levels.
In previous data-driven approaches in psychiatry, the focus has typically been on stratification based on symptom type or severity and behavior within a single diagnostic category (eg, schizophrenia,8-10 psychotic disorders,11,12 depression,13-17 attention-deficit/hyperactivity disorder,18-20 and autism21-23). This focus is important for data-driven discovery of subtypes within diagnostic categories but cannot address the need to characterize the heterogeneity and overlap of symptoms across diagnostic categories. Of the available data-driven studies of symptoms across multiple diagnoses, the focus has been on youth transdiagnostic samples.24-26 To our knowledge, no study has documented valid symptom clusters in a cohort of adults spanning multiple mood, anxiety, and trauma disorders and integrated symptom cluster data with data from multiple levels of function.
To address these issues, our data-driven approach has 4 corresponding aims. Our first aim was to use unsupervised machine learning to identify naturally occurring transdiagnostic subgroups within representative samples spanning multiple mood, anxiety, and trauma diagnoses, including major depressive disorder (MDD), panic disorder, generalized anxiety disorder, obsessive-compulsive disorder, posttraumatic stress disorder (PTSD), and bipolar disorder type 2. Because of the large overlap in symptoms across these disorders,27-30 they were considered appropriate for our approach. In this context, the term subgroup refers to transdiagnostic rather than within-diagnostic classification. Second, we sought to assess the robustness and generalizability of the resulting subgroups in an independent validation sample. Third, we integrated multiple sources of data to assess how subgroups differ with respect to independent and external metrics for neurocognitive performance and brain activation. We selected domains that encompass behavior, brain physiology, and self-reported functioning31,32 and that assess broad aspects of neurocognitive and behavioral dysfunction implicated multiple mood, anxiety, and stress disorders.33 Fourth, we evaluated how daily functional capacity varied between the subtypes. Daily functioning was selected as an indicator of clinical meaning because it is considered a primary domain affected in individuals experiencing depression and anxiety symptoms.34 To provide a complementary, theoretically driven diagnostic frame of reference for interpretation, we mapped our data-driven subtypes onto the original categories found in the DSM-IV. We hypothesized that participants would be represented in transdiagnostic, reproducible symptom clusters that map onto specific profiles of neurocognition, brain activation, and daily functional capacity. We further hypothesized that subtypes would cut across diagnostic boundaries.
Participants were recruited systematically by advertisements placed in outpatient and community health settings in the communities surrounding the University of Sydney, Australia, and the University of Adelaide, Australia. These samples are diverse with respect to race/ethnicity, age, and sex, and are therefore representative of the surrounding communities.
Our protocol received independent institutional ethical review board approval of the Human Research Ethical Committees of the Sydney Medical School at the Western Sydney Area Health Service and the University of Sydney prior to recruitment of participants. All participants signed and dated an approved informed consent form. Data are from participants who consented to have their data made available to the Brain Research and Integrative Neuroscience Network Foundation Database,35 for open sharing and secondary analysis by the research community.
We recruited clinical participants who had 1 of 3 primary diagnoses (based on DSM-IV criteria): MDD, PTSD, and panic disorder. Diagnosis was made using the Mini-International Neuropsychiatric Interview36 and Hamilton Depression Rating Scale37 for MDD, the Clinician-Administered PTSD Scale38 and the Structured Clinical Interview for DSM-IV Axis 1 Disorders39 for PTSD, and the Mini-International Neuropsychiatric Interview and Composite International Diagnostic Interview40 for panic disorder.
Comorbid mood and anxiety disorders were present. Of participants with a primary diagnosis of MDD (n = 100), comorbid conditions included PTSD (n = 12; 12%), panic disorder (n = 14; 14%), generalized anxiety disorder (n = 36; 36%), and attention-deficit/hyperactivity disorder (n = 1; 1%). Of participants with PTSD (n = 47), comorbid conditions included MDD (n = 17; 36%) and generalized anxiety disorder (n = 4; 9%). Of participants with panic disorder (n = 53), comorbid conditions included MDD (n = 7; 13%), PTSD (n = 15; 28%), generalized anxiety disorder (n = 5; 9%), dysthymia (n = 3; 6%), obsessive-compulsive disorder (n = 13; 25%), and seasonal affective disorder (n = 31; 59%) (additional data appear in eTable 1 in the Supplement).
Participants were excluded for lifetime and current medical conditions that could affect testing procedures. Exclusion criteria also included lifetime diagnoses of a neurological disorder, brain injury, or any other disorder affecting cognitive, sensory, and/or motor function or the presence of an ongoing substance use disorder (as defined by the DSM-IV).
A total of 497 adults with a mood, anxiety, or trauma disorder (n = 248) or healthy control status (n = 249) were enrolled. Healthy participants, recruited from equivalent communities in each population center, were matched for age and sex to clinical participants on a casewise basis. Those with incomplete symptom data (n = 77) were excluded from reporting, reducing the sample size to 420 participants with a mean (SD) age of 39.8 (14.1) years (range, 18-83 years); 256 (61.0%) were women.
Consistent with our focus on transdiagnostic heterogeneity, individuals were treated as 1 large transdiagnostic sample for the purpose of analysis. Details of recruitment and screening have been published previously35,41 and are documented further in the eMethods in the Supplement.
Data for the independent validation sample were acquired from a sample of 381 adult participants, of whom 207 (54.3%) were female. The mean (SD) age was 36.7 (14.6) years (range, 18 to 86 years). They were also recruited from community sources and tested at an academic center.32 This transdiagnostic scope of this sample also encompassed MDD, PTSD, panic disorder, multiple comorbid disorders, and healthy control participants. (For comorbidity rates and details of inclusion and exclusion criteria, see the eMethods and eTable 2 in the Supplement.)
Negative mood was assessed by the Depression, Anxiety and Stress Scale, version 21 (DASS-21).42 The DASS-21 is a self-report scale for assessing 3 symptom areas common to mood, anxiety, and stress disorders, including subscales for depression (including low positive affect, low self-esteem, and sense of hopelessness), anxiety (encompassing fear, somatic features, and hyperarousal features) and stress (tension and irritability) (eTable 3 in the Supplement contains details on study measures). Because the DASS-21 is well normed and well established for use in representative community samples43 and racial/ethnic groups44 and because it correlates with other instruments used to measure multiple mood and anxiety symptoms,45 it was appropriate for use with our transdiagnostic samples.
A standardized behavioral test battery, “IntegNeuro,” was used to assess neurocognitive function. IntegNeuro has demonstrated reliability, validity, and cross-cultural consistency, has established norms for patients aged 6 years to 80 years and older,46-49 and has demonstrated usefulness for diagnostic groups in case-control studies.50-52 The cognitive constructs assessed by IntegNeuro include cognitive control (choice reaction time, switching of attention, verbal interference/Stroop, go/no-go, and maze), working memory (digit span and span of visual memory), language fluency (word generation for words starting with F, A, or S), and response speed (motor tapping).
Electroencephalography (EEG) served as our neurophysiological measure of brain activation. As in prior research on negative mood states,53-56 EEG data were recorded during resting conditions and during facial emotion-viewing paradigms. Two resting conditions were recorded: a 2-minute span with eyes open, followed by a 2-minute span with eyes closed.
We used well-established facial emotion paradigms for eliciting brain activation responses during the viewing of facial expressions under both conscious (unmasked) and nonconscious (masked) conditions. Facial emotion stimuli depicted expressions of fear, anger, sadness, and happiness, relative to a neutral expression.57 The stimuli included 8 different individuals selected from a standardized series.58 In each paradigm, the stimuli were grouped by the 8 individuals displaying the same emotion, with each grouping repeated 5 times. In the nonconscious condition, facial emotion stimuli were presented below the threshold for conscious sensory detection.
A Neuroscan Compumedics Nuamps system and an Quikcap Electrode System (Compumedics Ltd) were used to record EEG data according to the 10 to 20 electrode international system with 32 channels, including 4 electrooculography channels, an orbicularis oculus channel, and a masseter channel. The details of this well-established protocol have been published previously.41
Informed by prior EEG research on negative mood states, we quantified data for primary frequency bands of focus—α (8–13 Hz) and β (14.5–30 Hz)—and these values were averaged for the frontal region (Fp1, Fp2, F7, F3, Fz, F4, and F8), the central region (FC3, FCz, FC4, C3, Cz, C4, CP3, CPz, and CP4), the temporal regions (T3, T4, T5, and T6), and the parietal/occipital regions (P3, Pz, P4, O1, Oz and O2) bilaterally.41,53,59 We designated θ (4-7.5 Hz) and δ (1.5-3.5 Hz) frequencies as secondary bands of interest, and we clarified in exploratory analyses that there were no significant effects for these bands. Outliers beyond 3 SDs from the mean of power values at each electrode site were mean-replaced. Drawing on prior findings for negative mood states,60 we also quantified frontal α asymmetry according to a subtraction of the natural log transformation of α power for FC4 minus FC3.
Daily functional status was assessed using the Brief Risk-Resilience Index for Screening, a 45-question screening tool that includes subindexes assessing capacity for social skills and emotional resilience.34 The association between this instrument and other self-reported measures of functional capacity, including social and occupational functioning and quality of life, has been established by previous research.34
Statistical analyses were conducted using the stats, psych, cluster, and factoextra packages in R, version 3.3.2 (R Foundation for Statistical Computing), and NumPy, SciPy, IPython, Jupyter, matplotlib, and scikit-learn packages in Python, version 3.6.2 (Python Software Foundation). We used a principal component analysis on the DASS-21 item-level data for all participants (clinical and control) to reduce the data dimensionality while retaining significant variance across measures. The number of components was determined by interpretation of the scree plot and by the percentage of variance explained. Symptom component scores for all participants (clinical and control) were the inputs to the agglomerative hierarchical clustering with Ward error sum of squares algorithm in the R cluster package; this is a standard and reliable method to identify clusters. The optimal number of clusters was determined using 4 standard methods: the gap statistic,61 the Calinski-Harabasz index,62 the elbow method, and the dendrogram (eResults and eFigures 1-4 in the Supplement). Cluster centers were plotted from 10 000 repeated subsamples to assess robustness of the clustering solution (eFigures 5-7 in the Supplement).
To further evaluate robustness and reproducibility, we repeated our clustering methods in the independent validation sample. We performed a principal component analysis using item-level data from the DASS-21 and used the resulting component scores as inputs to the agglomerative hierarchical clustering created with the Ward error sum of squares algorithm, then evaluated how closely the resulting cluster centers matched the original cluster centers.
We evaluated the extent to which subtypes differentiated on the external measures of neurocognition, neurophysiological, and daily functional status. One-way analysis of variance (ANOVA) tests were run to identify significant differences on each measure between the mean scores of the 6 subtypes. Variance explained by the 6-cluster solution was compared with that explained by DSM-IV diagnosis (eTable 4 in the Supplement); post hoc tests were also completed (eFigure 8 and eTable 5 in the Supplement). Redundant variables within each measure were not analyzed to reduce comparisons and retain meaningful results. Multiple comparisons were addressed by using the Bonferroni correction (eMethods in the Supplement). Additional analysis of covariance tests that included comorbidity covariates were run, and variance explained by the 6-cluster solution was compared with variance explained by DSM-IV diagnosis (eTable 6 in the Supplement). For neurocognitive performance, ANOVA tests were run on each of the 9 tests, with a Bonferroni-corrected α level of P = .006. For neurophysiology measures, ANOVAs were run separately for electroencephalographic tests with eyes open and eyes closed and for conscious and nonconscious emotion conditions. In these ANOVAs, dependent variables were the 4 averaged regional power values for both α and β bands; thus, the corrected alpha level was P = .006. An ANOVA was run on the single measure of α asymmetry at P = .05. For self-reported daily function, ANOVAs were run on the 2 functioning domains of social skills and emotional resilience at the corrected alpha level of P = .03.
Principal component analysis of the DASS-21 items revealed 3 components with the orthogonal rotation converging in 6 iterations (eTable 7 in the Supplement). Together, these components accounted for 71.2% of the total variance. Based on the loadings (eResults in the Supplement), the 3 components were named anhedonia, anxious arousal, and tension.
The unsupervised machine learning algorithm identified a 6-cluster solution (Figure 1). Each cluster, or subtype, had a distinct symptom profile (eFigure 9 in the Supplement). Based on each subtype’s mean symptom component scores, they were interpreted as representing the following: normative mood (characterized by low symptom scores on all 3 components) (n = 180); tension (n = 81); anxious arousal (n = 55); general anxiety (n = 38); anhedonia (n = 29); and melancholia (n = 37).
Subtypes differed significantly in anhedonia, anxious arousal, and tension, as determined by 1-way analyses of variance (Table 1). Subtypes did not differ significantly with respect to age, sex, or years of education. (See eTable 8 in the Supplement for sex distribution across clusters.)
In an independent sample, principal component analysis identified the same 3-component solution of anhedonia, anxious arousal, and tension components (eResults and eTable 9 in the Supplement). Hierarchical clustering algorithms, using the 3 component scores as inputs, identified the same 6-cluster solution (eFigures 9 and 10 in the Supplement).
Subtypes were expressed in a differential profile of behavioral performance on tests of neurocognition (Figure 2). Subtypes differed significantly in their cognitive control, as measured by the go/no-go test (z scores were −0.038 for people with normative mood, −0.012 for people with tension, −0.491 for people with anxious arousal, −0.903 for people with general anxiety, 0.057 for people with anhedonia, and −0.163 for people with melancholia; F5,383 = 5.13; P < .001; ηp2 = 0.063). Subtypes also differed by working memory as measured by the digit span test (z scores were for 0.062 for people with normative mood, −0.116 for people with tension, −0.357 for people with anxious arousal, −0.209 for people with general anxiety, 0.158 for people with anhedonia, and −0.260 for people with melancholia; F5,401 = 3.29; P = .01; ηp2 = 0.039). This difference was owing to particularly poor cognitive control and working memory for the anxious arousal subtype.
Subtypes were also expressed in a differential profile of resting brain activation, as assessed by the neurophysiological measure EEG power (Figure 2). Subtypes differed in β power across the frontal region (z scores were for −0.160 for people with normative mood, 0.011 for people with tension, −0.047 for people with anxious arousal, −0.013 for people with general anxiety, 0.725 for people with anhedonia, and −0.165 for people with melancholia; F5,357= 3.84; P = .002; ηp2 = 0.051), particularly for the resting, eyes-open condition. This difference was most prominently related to the comparatively elevated β power of the anhedonia subtype (Figure 2). Subtypes were further differentiated by their emotion-evoked brain activation as assessed by EEG power. During the conscious emotion paradigm, subtypes differed in parietooccipital β power for facial expressions of happiness in particular (z scores were for −0.141 for people with normative mood, 0.052 for people with tension, −0.101 for people with anxious arousal, 0.590 for people with general anxiety, −0.127 for people with anhedonia, and 0.273 for people with melancholia; F5,365 = 3.56; P = .004; ηp2 = 0.047) (Figure 2). This profile of elevated β was primarily owing to the general anxiety subtype relative to other subtypes (Figure 2). Subtypes did not differ significantly in the measure of EEG α asymmetry.
Subtypes were expressed clinically through differential profiles of self-reported functioning, assessed by the Brief Risk-Resilience Index for Screening domains of capacity for social skills (z scores were for 0.287 for people with normative mood, 0.280 for people with tension, −0.617 for those with anxious arousal, 0.195 in persons with general anxiety, −0.529 in those with anhedonia, and −0.959 in people with melancholia; F5,414 = 21.33; P < .001; ηp2 = 0.205) and emotional resilience (z scores were for 0.348 for people with normative mood, 0.188 for people with tension, −0.661 for people with anxious arousal, 0.032 for people with general anxiety, −0.329 for persons with anhedonia, and −0.590 for people with melancholia; F5,376 = 15.10; P < .001; ηp2 = 0.171) (Figure 2). The profile of low emotional resilience, in particular, reflected the poorest functioning in the melancholia subtype.
Subtypes were mapped onto the original DSM-IV diagnostic categories to provide a complementary, theoretically driven frame of reference. Subtypes were shown to cut across diagnostic boundaries because frequency distributions showed subtypes were composed of participants from all diagnostic groups, revealing that cluster groups did not represent diagnosis. For instance, individuals with primary diagnoses of MDD and PTSD were distributed across all 6 subtypes (Table 2; eResults, eTable 6, and eFigure 11 in the Supplement).
This study demonstrates a novel approach to identifying subtypes defined by distinct profiles of symptoms that map onto unique patterns of neurocognition, brain activation, and clinically relevant daily functioning. A machine learning stratification algorithm, blind to diagnosis, identified 6 clusters of individuals based on specific symptom profiles. This 6-cluster solution was replicated in an independent sample, indicating the solution is reproducible. Each type demonstrated a unique profile across domains. Anxious arousal was distinguished by poor daily functioning and the greatest level of neurocognitive impairment compared with most subtypes, particularly in the cognitive control domain. General anxiety was characterized by an elevation in emotion-elicited parietooccipital β power as compared with the normative mood and anxious arousal subtypes, and intact daily functioning. Melancholia, in contrast, was distinguished by the poorest daily functioning, particularly social functioning, compared with the normative mood, tension, and general anxiety subtypes. Anhedonia was distinguished by specific elevations in resting frontal β power, and the tension subtype by average performance across domains despite severe symptoms of tension.
Although previous studies have not examined multidomain profiles across multiple diagnoses, specific aspects of these profiles align with prior findings. Our observation of the poorest neurocognition in the anxious arousal subtype accords with previous within-diagnosis relationships between anxiety and impaired cognition.63-66 Our results align with the view that functional disability, spanning social and (especially) emotional domains, is a hallmark of melancholia.67,68 The frontal β elevation observed for anhedonia is somewhat surprising, because β waves are often associated with active or anxious thinking.69,70 However, at least 1 previous study has shown a connection between increased β and depression specifically.71 A possible account of the elevated parietal β and somewhat poor working memory of people with general anxiety is a parietally mediated compensation for core working memory dysfunction72; however, in this study, the difference on working performance from other subtypes was not significant. By contrast, the relatively intact status of the tension subgroup may reflect more fundamental compensatory mechanisms or a state of stress that does not reach the level of disrupting daily life.
Our approach demonstrates 1 application of a proposed model for ultimately developing a taxonomy for mental disorders that maps specific symptom profiles onto underlying neurobehavioral dimensions. There are important ways in which this approach should be further refined and expanded. Future studies should use additional data-driven techniques and independent samples to further test the robustness of subtype structures. Longitudinal designs (such as the Minnesota Twin Family Study73) will help determine whether subtypes are stable over time, as has been shown within the diagnosis of major depression for adolescents and older adults.13 To further elucidate the functional anatomical basis of the subtypes, high-density EEG data could be acquired for source localization. With in vivo imaging, we could anchor transdiagnostic subtypes in increasingly proximal measures of underlying brain circuits.74,75 Imaging should also be considered as primary inputs for clustering and validating solutions, alongside symptom measures. The Bipolar and Schizophrenia Network for Intermediate Phenotypes has advanced the identification of biotypes for psychosis based on cognitive and neurophysiological measures,11 and validated with functional and imaging measures.76 It will be important to incorporate information at many different domain levels and examine cross-level interactions77 as a means to tap biological systems implicated in major psychiatric domains. Several foundational studies have used imaging of resting functional brain connectivity to identify neurophysiological biotypes within samples of depressed individuals.78-80 These novel approaches have had a profound effect on our understanding of depression and reflect state-of-the-art systems biology research needed to develop valid biotypes.
The clinical utility of new subtype models ultimately rests on their value in helping guide intervention decisions. Because our data-driven approach yields groups of individuals that share symptom, behavioral, and brain activation profiles, it offers one way forward for considering new targets for intervention studies. For example, in light of the general anxiety subtype profiles, 1 such hypothesis would be that behavioral interventions targeting working memory would have a specific association with general anxiety symptoms and would be visible on associated working memory and emotion-elicited EEG β power end points.
One limitation of our study is the relatively small number of individuals in each subtype and the overall small size of the sample. Over and beyond replication, larger samples are needed so that stratifications by sex, age, and symptom-defined subgroups retain large enough numbers in important strata. Data from a large number of participants, input into a data set for the machine learning processes used in this study, will allow additional latent constructs in the data to be uncovered.
Furthermore, this study used limited symptom data for subtype determination. These inputs were constrained by the need for common assessments across diagnoses and samples. Future research that expands the sample size and input features and uses complementary features for subtype determination will be essential in establishing a valid and clinically viable taxonomy for mood, anxiety, and trauma disorders.
Our analyses also focused on current Axis 1 diagnoses. Thus, we were unable to include data on the potential contributions of lifetime history of psychiatric disorder or history of substance use and other disorders. Future studies should include a lifetime history to ensure that all relevant domains underlying psychopathology are being tapped.
We demonstrate a data-driven approach for identifying transdiagnostic subtypes that are distinct, reproducible, and expressed across domains of symptoms, neurocognitive functioning, brain activation, and daily functioning. Because the symptom profiles map on to clinically relevant domains, they offer new targets for developing personalized treatments.
Corresponding Author: Leanne M. Williams, PhD, Department of Psychiatry and Behavioral Sciences, Stanford University, 401 Quarry Rd, Mail Code 5717, Palo Alto, CA 94305 (firstname.lastname@example.org).
Accepted for Publication: October 30, 2017.
Correction: This article was corrected on February 7, 2018, with respect to the 2 leftmost labels on the x-axis of panels A through F in Figure 2; these had been transposed and have been moved to correct positions. It was also corrected on October 24, 2018, to correct 2 typographical errors in Table 1; the numbers were −0.047 and −0.026 and are now corrected to −0.471 and −0.257.
Published Online: December 3, 2017. doi:10.1001/jamapsychiatry.2017.3951
Author Contributions: Ms Grisanzio and Dr Williams had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Grisanzio, Goldstein-Piekarski, Williams.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: Grisanzio, Goldstein-Piekarski, Williams.
Critical revision of the manuscript for important intellectual content: All authors.
Statistical analysis: Grisanzio, Goldstein-Piekarski, Wang, Rashed Ahmed, Williams.
Obtained funding: Williams.
Administrative, technical, or material support: Grisanzio, Wang.
Study supervision: Goldstein-Piekarski, Samara, Williams.
Conflict of Interest Disclosures: Dr Williams has previously received consultant fees from Brain Resource and Humana and is currently on the Scientific Advisory Board for Psyberguide, a project of the One Mind Institute. Dr Wang is an employee of Brain Resource. No other disclosures were reported.
Funding/Support: This research was supported in part by grants DP0773994 and LP0883621 from the Australian Research Council. Brain Resource was the industry partner on LP0883621. National Institute of Mental Health grant R01MH101496 supports Mss Grisanzio and Goldstein-Piekarski and Dr Williams, and National Institute of Mental Health grant F32MH10829 supports Ms Goldstein-Piekarski.
Role of the Funder/Sponsor: The Australian Research Council and National Institute of Mental Health had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. The Brain Research and Integrative Neuroscience Network Foundation, a non-profit 501c(3), which manages the Brain Resource International Database for independent research purposes, provided access to summary scored data for the purposes of secondary analysis. The foundation had no role in interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Meeting Presentation: This paper was presented at the American College of Neuropsychopharmacology Annual Meeting; December 3, 2017; Palm Springs, California.
Additional Contributions: We thank the contribution of each of the clinical sites that provided data to the database. We acknowledge Scott Fleming, BS, of the Department of Psychiatry and Behavioral Sciences at Stanford University for his uncompensated contribution to this study in the form of advice on the approaches used to test the cluster analysis solutions.