Perlis RH, Uher R, Ostacher M, Goldberg JF, Trivedi MH, Rush AJ, Fava M. Association Between Bipolar Spectrum Features and Treatment Outcomes in Outpatients With Major Depressive Disorder. Arch Gen Psychiatry. 2011;68(4):351-360. doi:10.1001/archgenpsychiatry.2010.179
Copyright 2011 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.2011
The distinction between major depressive disorder (MDD) and bipolar disorder remains a challenging clinical problem when individuals present with a major depressive episode.1- 3 A number of illness features have been proposed to indicate risk of bipolar disorder in this setting, including earlier age of illness onset,4- 7 greater number of depressive recurrences or briefer episodes,4,6,8,9 family history of bipolar disorder,6,9- 11 and aspects of temperament such as hyperthymia and cyclothymia.12- 15 Symptomatic differences have also been proposed, among them presence of irritability or anger,16- 19 presence of psychotic symptoms,10,11,20- 26 suicidality,6,25 and atypical neurovegetative symptoms7,23,27- 29 including psychomotor agitation or slowing.10,30,31 Moreover, even in individuals who do not meet full syndromal criteria for bipolar I or II disorder, it has been suggested that these illness features may be markers for an underlying bipolar diathesis or bipolar spectrum illness.2,32
The identification of individuals at risk for bipolar disorder is of more than academic importance, as treatment may be markedly different; in particular, antidepressants have been suggested to exacerbate the illness course of at least a subset of bipolar individuals.33- 37 Indeed, review articles and continuing medical education programs frequently assert that unrecognized bipolarity is a substantial contributor to apparent treatment-resistant MDD. However, this hypothesis has rarely been tested directly.38- 40 In one of the few empirical investigations, in a cohort of 61 patients previously diagnosed with treatment-resistant MDD seen in a specialty clinic, nearly two-thirds were ultimately rediagnosed with bipolar disorder.41 On the other hand, after a decade of articles suggesting underrecognition of bipolar disorder, typically relying on screening tools42 with limited specificity, it has also been suggested that bipolar disorder may now be overdiagnosed or inappropriately diagnosed.43 Thus, an unanswered question of substantial clinical importance is whether illness features that may indicate bipolar liability have predictive validity in generalizable clinical populations that present with major depression.
The Sequential Treatment Alternatives to Relieve Depression study (STAR*D) presents a unique opportunity to address this question. The STAR*D study was an effectiveness study of MDD conducted in both primary and specialty care settings and designed to closely mimic clinical practice.44 The study's design, which emphasized generalizability to US clinical populations, includes several key features that allow it to estimate predictive validity for bipolar spectrum features. First, reflective of clinical practice, it did not include a structured assessment for bipolar disorder or psychosis. Thus, while individuals who described a prior manic episode or current psychotic symptoms at a screening interview would be excluded by the protocol, if bipolar disorder is as underrecognized as has been suggested,42 most such patients would not have been screened out. Second, for many features of illness course, the STAR*D study relied on unstructured assessments rather than research instruments such as the Structured Clinical Interview for DSM Disorders,45 again resembling standard clinical practice. Assessments included most, though not all, commonly described indicators of bipolar liability.2,46 Finally, and most importantly, the STAR*D study examined up to 4 prospective treatment trials, yielding the largest and most systematically defined treatment-resistance cohort to date.
Taken together, the STAR*D study provides an opportunity to examine in a large, ecologically valid cohort the potential effect of unrecognized bipolar disorder or bipolar spectrum features that are routinely assessed in outpatient practices rather than research settings. Specifically, this study examines the association between putative bipolar spectrum features, individually and combined, and antidepressant treatment outcomes.
The STAR*D multicenter trial determined prospectively which of several treatments are most effective in treating participants with nonpsychotic MDD who do not remit with or tolerate first-line treatment with the selective serotonin reuptake inhibitor citalopram. The methods of the STAR*D study are detailed elsewhere47 and are summarized below.
The STAR*D study was carried out by 14 regional centers across the United States, each of which oversaw implementation of the protocol at 2 to 4 clinical sites. Of the 41 clinical sites, 18 were primary care settings and 23 were psychiatric care settings.
Research outcomes were collected by telephone interviews conducted by a small team of trained research outcome assessors who were masked to treatment and by telephone-based interactive voice response. Research outcome assessors received extensive training in the administration of efficacy measures, with interrater reliability assessed periodically.
The STAR*D study used broad inclusion and minimal exclusion criteria to ensure a representative sample. The study enrolled male and female outpatients aged 18 to 75 years with a DSM-IV diagnosis of nonpsychotic MDD and a baseline score of 14 or greater on the 17-item Hamilton Rating Scale for Depression (HRSD-17)48 and for whom the treating clinician had determined that outpatient antidepressant treatment was safe and appropriate. Exclusion criteria included lifetime diagnosis of MDD with psychotic features, schizophrenia, schizoaffective disorder, or bipolar disorder I, II, or not otherwise specified based on clinical assessment and self-report (but not self-report questionnaires); a well-documented history of nonresponse or intolerability in the current major depressive episode to adequate doses49 of 1 or more medications given in the first 2 protocol treatment steps; a current primary diagnosis of eating disorder or obsessive-compulsive disorder; presence of severe, unstable concurrent psychiatric conditions likely to require hospitalization within 6 months (eg, severe alcohol dependence with recent detoxification admissions); presence of concurrent medical or psychiatric conditions or concomitant medications that contraindicated a protocol treatment; and pregnancy or intent to conceive within the 9 months subsequent to study entry.
By design, the study recruited only individuals who sought treatment at the clinical sites, so no advertising was permitted. Participants were informed of all risks, benefits, and adverse events associated with each study treatment and provided written informed consent prior to study entry. The study protocol was approved by institutional review boards at all participating sites and monitored by an National Institutes of Mental Health Data Safety Monitoring Board.
In addition to sociodemographic data, data collected at the initial study visit included number of prior episodes, current episode duration, age at first episode, history of suicide attempts, and family history of bipolar disorder in first-degree relatives. (For family history, subjects were asked individually about parents, siblings, and children; a positive response to any of these was considered a positive family history for the present analysis). To allow assessment to more closely mimic clinical practice, no structured interview such as the Structured Clinical Interview for DSM Disorders45 or Mini-International Neuropsychiatric Interview50 was used. To assess concurrent axis I diagnoses, study participants completed a modified version of the Psychiatric Diagnosis Screening Questionnaire (PDSQ)51,52; the psychoticlike symptom screen, which assesses the prior 2 weeks, and the maniclike symptom screen, which assess the prior 6 months, were used in the present analysis. The psychoticlike symptom screen includes 6 yes/no questions about beliefs of being spied on, plotted against, having special powers, being controlled, or events that others said did not occur, as well as hearing voices or seeing things. Mood congruence was not assessed. In a validation study of approximately 1300 outpatients, sensitivity was 74% and specificity was 74% for psychosis if at least 1 item was endorsed; specificity increased to 89% with 2 items and 96% with 3.53
The maniclike symptom screen comprises 6 yes/no questions about elevated mood, extreme self-confidence, decreased need for sleep, talkativeness, new projects, and impulsive activities. This subscale was shown to have only modest correlation (0.51) with another self-report inventory of mania54 and lower test-retest reliability than other subscales,52 so sensitivity and specificity were not estimated subsequently.53 Still, as the only systematic measure of maniclike symptoms and in light of the face validity of the individual items themselves, which resemble those incorporated in structured interviews, this measure was retained for analysis. Results of STAR*D analyses using the other PDSQ subscales have been described elsewhere.55
At the initial visit, the study site clinical research coordinator reviewed inclusion and exclusion criteria and completed the HRSD-17 and the 16-item Quick Inventory of Depressive Symptomatology (QIDS-C16).56 The clinical research coordinator was not required to review the PDSQ results.
Within 72 hours of the baseline visit, a research outcomes assessor blinded to participant treatment status conducted a telephone interview with study participants to complete a second baseline HRSD-17 score as well as the 30-item Inventory of Depressive Symptamology–Clinician Rated56,57; the HRSD-17 was repeated by a research outcomes assessor following each study visit. Subjects also completed the self-report form of the brief Inventory of Depressive Symptamology (QIDS-SR) at baseline and each subsequent study visit; this was used as the secondary outcome measure in the STAR*D study. Primary study outcomes defined by protocol included 2 definitions of remission based on self-report (QIDS-SR score ≤5) and assessment by a blinded rater (HRSD-17 score ≤7). As the former definition was available for more subjects and has been considered the primary outcome in prior STAR*D analyses, the present investigation focused on QIDS-SR–defined outcomes. As in prior articles, irritability58 was defined according to Inventory of Depressive Symptomatology item at study entry.
All patients were treated with citalopram at level 1, with a goal of achieving symptomatic remission.44 Dosing was directed by a treatment manual (www.star-d.org), which suggests a starting dose of 20 mg/d of citalopram, increased to 40 mg/d by weeks 2 to 4 and 60 mg/d by weeks 4 to 6. Adjustments were allowed to minimize adverse effects, maximize safety, and optimize the likelihood of therapeutic benefit for each patient.
For those individuals entering level 2, treatment was randomized to either citalopram augmentation with buspirone or bupropion, or switch to sertraline, venlafaxine, or bupropion. (A small subset of individuals received cognitive therapy in addition to or in lieu of citalopram following level 1.) Treatments at level 3 included switching to nortriptyline or mirtazapine or augmentation (of current antidepressant) with lithium or thyroid hormone. Treatments at level 4 included switching to tranylcypromine or the combination of mirtazapine and venlafaxine.
At each level, treatment visits occurred at 0, 2, 4, 6, 9, and 12 weeks; an optional 14-week visit could be added if needed. After up to 14 weeks of treatment, all subjects who did not achieve remission were encouraged to enter the subsequent level. Patients could also exit the current level prior to 12 weeks and enter the next level if they experienced intolerable adverse effects, could not reach an optimal antidepressant dose because of adverse effects, or continued to have significant symptoms defined as a QIDS-C16 score of 9 or greater after at least 9 weeks at the maximal tolerable dose. While both remitters and responders could enter the 12-month naturalistic follow-up, the latter group was also encouraged to enter next-step treatment with a goal of achieving remission.
The variables to be considered as putative bipolar spectrum features were selected by consensus among the authors by intersecting those implicated in recent systematic reviews46 with the available data in the STAR* D study. They include aspects of course: episode duration, age at onset, frequency of recurrence; family history of bipolar disorder; and symptoms at study entry: irritability, psychomotor symptoms, hypersomnia, and hyperphagia, as well as manialike and psychosislike subscale items from the PDSQ. To address nonnormal distributions and reflect published criteria for bipolar spectrum features, continuous variables including episode duration and recurrences were dichotomized to indicate episode duration of less than or equal to 3 months and 3 or more prior episodes. In the absence of agreement in the literature on the optimal cutoff for onset age, this variable was considered as a continuous measure, then dichotomized at 25 years of age for consistency with the criteria of Ghaemi and Goodwin.2 The count of PDSQ psychosislike and manialike subscale items was dichotomized in primary analyses to indicate presence or absence of at least 1 feature of each, as the distribution of scores was highly skewed to the right. That is, those in the group with psychosis endorsed at least 1 of the PDSQ items used to screen for possible psychotic symptoms within 2 weeks of study entry. For exploratory purposes, associations with scores of 2 or more and 3 or more on each subscale were also examined.
Three approaches were used to examine the hypothesis that bipolar spectrum features are associated with poorer treatment outcomes in individuals diagnosed with MDD. First, the features were examined individually for association with outcome. Second, they were examined in combination using the definition of bipolar spectrum disorder described by Ghaemi and colleagues.2 This definition prioritizes family history of bipolar disorder in a first-degree relative or history of antidepressant-induced mania or hypomania. Additional criteria include hyperthymic personality, more than 3 prior major depressive episodes, brief major depressive episodes (<3 months on average), atypical depressive symptoms by DSM-IV, psychotic symptoms, onset of depression before 25 years of age, postpartum depression, absence of sustained antidepressant effect, and treatment resistance. From these, we selected the criteria that do not reflect antidepressant response (as that represented the hypothesis being tested here) and which were not collected, omitting history of antidepressant-induced mania, lack of sustained antidepressant response, and treatment resistance; hyperthymia and postpartum depression were also omitted. Syndromal bipolar spectrum disorder was thus defined as family history of bipolar disorder with at least 1 other putative feature or 3 of 5 putative features in the absence of family history.
To examine the association between individual bipolar spectrum features, or syndromal bipolar spectrum (as well as potential sociodemographic and clinical confounding variables), and longitudinal outcomes, survival methodology was used with results right-censored at study exit or remission; the 369 subjects with no postbaseline visits were excluded, yielding a modified intent-to-treat cohort (n = 3672). Because the primary outcome of interest was number of trials required for remission rather than within-level time to remission, the time variable was the level number. In other words, primary analysis examined “levels (or interventions) to remit” rather than “days to remit,” considering subjects with at least 1 postbaseline visit (ie, a modified intent-to-treat approach). Sensitivity analysis examining overall days, rather than levels, to achieve remission yielded similar results and are presented in eTable 1. Cox regression was used to examine the association between bipolar spectrum features and outcome, first adjusted for baseline depression severity alone (as measured by HRSD-17) and then adjusted for all potential sociodemographic and clinical confounding variables. Alternate approaches to addressing confounding, including use of stepwise backward-elimination techniques, yielded essentially identical results in all cases. Given the modest sample sizes in individual treatment arms, particularly at level 3 and higher, treatment × predictor interactions were not considered. For putative proxy markers of poor outcome, including absence of postbaseline visit, loss to follow-up, and psychiatric significant adverse events including suicide attempt and hospitalization, multiple logistic regression was used, with adjustment for baseline depression severity and then for additional potential confounding variables.
Rather than arbitrarily weighting the individual features, a third set of analyses attempted to define weights of bipolar spectrum features empirically. This approach fitted a structural equation model (SEM) with bipolarity as a latent (ie, not directly observed) variable indicated indirectly by proxy measures including age at onset, recurrence, atypical neurovegetative and psychomotor symptoms, psychoticlike symptoms in the 2 weeks prior to study entry, maniclike symptoms in the 6 months prior to study entry, irritability, and family history of bipolar disorder (eFigure 1). Initially, a measurement model was fitted as a confirmatory factor analysis in which all indicators load on a single common factor reflecting latent bipolarity. Treatment resistance was then modeled as a discrete time survival of unremitting depression in subsequent levels of the treatment trial with censoring of missing values. To reflect the hypothesis that indicators contribute to treatment resistance as a function of bipolar disposition, treatment resistance was regressed on the latent bipolarity with no direct effects of indicators on treatment outcome. The SEM was fitted using maximum likelihood with the EM algorithm and up to 10 000 iterations. This method converges on a solution that maximizes the explanation of covariance between observed variables and allows different weighing of indicators on the latent bipolarity to explain treatment resistance. The SEM approach was also applied to examine the association between latent bipolarity and study discontinuation following baseline visit, subsequent loss to follow-up, or psychiatric serious adverse events.
Univariate analyses used Stata 10.0 (Statacorp, College Station, Texas). The SEM analyses used Mplus 5.21.59
Of 4173 subjects consented, 132 were excluded following consent based on the clinical research coordinator evaluation, including 23 diagnosed with bipolar disorder and 8 with a history of psychotic symptoms. The sociodemographic and general clinical features of the remaining 4041 have been described elsewhere.60Table 1 summarizes the distribution of putative bipolar spectrum features for all subjects. Notably, 1524 of 3999 participants (38.1%) endorsed at least 1 maniclike/hypomaniclike symptom on the PDSQ in the prior 6 months (PDSQ was not completed for 42 subjects). On the PDSQ psychosislike screen, 1198 of 3999 participants (30.0%) endorsed at least 1 symptom in the prior 2 weeks (eTable 2 lists the proportion of subjects who endorsed individual psychoticlike screen items). In all, 650 participants (16.2%) described having both maniclike and psychoticlike symptoms.
The first set of analyses examined the association between individual features, or sets of features, and treatment resistance. Possible confounding sociodemographic and clinical features were examined in univariate models for association with outcome (eTable 3); greater depression severity, self-reported race other than white, being unmarried, having public insurance, not graduating from high school, and comorbid panic disorder (by PDSQ) were found to be associated with treatment-resistant MDD, and analyses of bipolar spectrum features were adjusted for them.
Table 2 summarizes hazard ratios for remission of individual putative bipolar spectrum features. After adjustment for potential confounders from eTable 3, presence of psychoticlike symptoms, irritability, psychomotor symptoms, and hyperphagia were significantly associated with nonremission across subsequent therapeutic trials, while family history of bipolar disorder and presence of manic symptoms were not significantly associated with outcome. Brief episode duration, but not greater number of episodes, was associated with better outcome.
Bipolar spectrum criteria using the standards of Ghaemi and Goodwin2 could be evaluated in 3166 subjects; the remainder could not be scored because of indeterminate number of past episodes or insufficient data to define atypical depression. Of those who could be evaluated, 870 of 3166 participants (27.6%) met the a priori criteria for bipolar spectrum disorder. No significant association with nonremission was identified (Table 2) with or without adjustment for potential confounding variables.
We then used SEM models to examine the relationships between indicators of bipolarity and the effect of a latent bipolar spectrum variable on remission. The putative indicators of bipolarity were generally only weakly correlated (average ρ = 0.08; eTable 4). Reflecting the relatively weak correlations, a common factor explained a large proportion of covariance (0.74) but only a small proportion of variance (0.21) of the 8 indicators, and the measurement confirmatory analysis model had suboptimal fit (eTable 5). The measurement model suggested high loading of age at onset and recurrence but low loadings from more specific indicators of bipolar diathesis (eTable 4, eTable 6, and eTable 5 for details of model structure and fit). These results suggest that the concept of latent bipolarity defined by the 8 indicators has low internal consistency (Cronbach α = .14). That is, they suggest that the previously proposed collection of indicators is unlikely to reflect a single phenomenon. However, even if the commonality between the indicators is small, it may still be predictive of clinically relevant outcomes. Therefore, a survival SEM examined if latent bipolarity predicted resistance across the 4 courses of treatment in STAR*D (eFigure 1). While some individual clinical features might independently predict poorer outcome, these models allowed an examination of the effect of this putative category on outcome. The best fitting structural equation model (eTables 4-6 and eFigure 2) showed inconsistent loading of indicators on the latent bipolarity variable and no significant relationship between latent bipolarity and treatment resistance (z = 1.015; P = .31). Psychoticlike symptoms exhibited the strongest loading on latent bipolarity, but other indicators including family history of bipolar disorder loaded nonsignificantly, reflecting the weak and inconsistent correlations between the indicators. The nonsignificant relationship between latent bipolarity and treatment outcome further supports the lack of consistent association between the grouping of indicators and treatment outcome observed in the univariate analyses.
In addition to treatment resistance, unrecognized bipolar spectrum illness has been suggested to contribute to adverse effects among individuals exposed to antidepressants. Therefore, the second set of analyses examined adverse outcomes associated with exposure of bipolar patients to antidepressants. With citalopram treatment, 2 subjects developed manic episodes, 1 at day 36 and 1 at day 41 of treatment; 2 others developed hallucinations (one, command auditory hallucinations at or shortly after study entry; the other, unspecified hallucinations at day 26). Of these 4, 1 had a positive family history of bipolar disorder (and met criteria for bipolar spectrum disorder), 2 reported at least 1 manic symptom, and 1 reported at least 1 psychotic symptom. One of the 4 met Ghaemi and Goodwin2 criteria for bipolar spectrum illness.
Additional outcomes that might serve as proxies for antidepressant-associated adverse events included rapid study discontinuation (failure to return for a postbaseline visit), early discontinuation with loss to follow-up, and serious adverse events including psychiatric hospitalization or suicide attempt. For each of these outcomes, we first examined sociodemographic or clinical features that might confound associations with that outcome (eTable 7), then adjusted for these variables in subsequent analyses (Table 3). Only onset of illness at or after 25 years of age was associated with risk of failure to return for a postbaseline visit (Table 3), while other putative bipolar spectrum features and meeting criteria for bipolar spectrum illness were not. For study discontinuation with loss to follow-up, DSM-IV atypical depression and greater irritability and/or agitation on the Inventory of Depressive Symptamology were associated with greater risk, while briefer episode duration was associated with decreased risk. Again, no association with other individual illness features or bipolar spectrum illness was identified. Finally, greater risk of a significant adverse event was associated with briefer episode duration and 3 or more past episodes but not with any other variables examined.
When the latent bipolarity variable was examined for association with these putative proxy outcomes in level 1 of the STAR*D study in SEM models (eFigure 3, eFigure 4, and eFigure 5), no evidence of association was observed for failure to return after postbaseline visit (β = −.042; standard error [SE] = 0.038; P = .27) or early discontinuation (β = .055; SE = 0.066; P = .41). However, significant association was identified with psychiatric serious adverse effects (β = .297; SE = 0.077; P < .001).
Our results indicate that putative bipolar spectrum features are common in this general clinical population presenting for treatment of MDD. In particular, the substantial proportion of individuals with at least 1 self-reported recent psychoticlike or maniclike PDSQ screening item suggests that the STAR*D cohort reflects routine practice, where individual symptoms may not be routinely recognized or considered to be of sufficient clinical importance to guide treatment selection.
It must be emphasized that the presence of subjects who endorsed individual psychoticlike or maniclike symptoms does not imply that this study of nonpsychotic individuals with MDD enrolled persons with DSM-IV–defined bipolar disorder or psychotic depression. Rather, these symptoms may be better understood as features that, if recognized, might prompt consideration of a bipolar spectrum illness in some clinical contexts. For psychosis, a large validation study suggested specificity was 89% when 2 items were endorsed and 96% when 3 or more items were endorsed.53 The maniclike subscale is more problematic from a psychometric perspective.51,53 Still, it measures symptoms commonly assessed in clinical practice to screen for past or current manic/mixed episodes, and thus was retained for analysis based on face validity. Nonetheless, failure to identify association with outcome must be interpreted with this limitation in mind.
Analyses of remission suggest that the presence of even a single self-reported recent psychoticlike symptom is associated with poorer antidepressant response across multiple treatment levels. While the presence of a single symptom is not diagnostic for a psychotic disorder, such symptoms appear to have strong predictive validity, supporting prior studies of poorer outcomes with (syndromal) psychotic depression. In particular, our results are consistent with data from the Epidemiologic Catchment Area study that found more chronicity and recurrence risk in psychotic MDD61 and similar results in a 10-year follow-up from the National Institutes of Mental Health Collaborative Depression Study.62
On the other hand, several indicators consistently associated with bipolar disposition in the literature, including history of manic symptoms and family history of bipolar disorder,46 were not associated with outcome of treatment with antidepressants in the STAR*D study. Briefer episode duration, suggested to represent a risk marker for bipolarity, was associated with greater likelihood of remission. Moreover, even if some individual symptoms suggested to be indicative of a bipolar diathesis are associated with poorer outcome, it does not necessarily follow that the diathesis itself is associated with poorer outcome. (In the same fashion, even if fever is associated with poorer outcome in hospitalized patients and sometimes indicates pneumonia, it would not necessarily follow that pneumonia mediates most poor outcomes). We adopted 2 complementary approaches to examine this bipolar diathesis, one based on previously proposed criteria that arbitrarily weigh symptoms, the second based on empirically deriving a latent variable. Meeting criteria for bipolar spectrum illness was not associated with any differential outcome, whether in terms of remission or adverse events. In general, the results of SEM models also strongly suggest that individual symptoms or risk factors explain these poorer outcomes better than a latent bipolar spectrum factor.
The one exception was psychiatric serious adverse events, in which a significant association with risk was identified in the SEM model. Here, inspection of factor loading is informative, indicating strong effects of earlier onset and recurrence (eFigure 5). Thus, the most parsimonious interpretation of this finding, apart from type I error, is simply that recurrent and early-onset MDD may indicate greater risk of adverse outcomes.
The inconsistent prediction of outcomes of interest by the various previously proposed indicators of latent bipolarity raises the question of whether these indicators tap into a uniform concept. Our data suggest that this is unlikely. The latent bipolarity variable exhibited poor internal consistency, reflecting the fact that the putative bipolar spectrum variables are not highly correlated. Indeed, based on the consideration of factor loading, it might be argued that this variable poorly captures the core bipolar spectrum features such as family history. Of course, this argument could be made about any arbitrary weighting of bipolar spectrum features, given the lack of clear agreement on the relative importance of such features. This was the primary rationale for the complementary analysis examining a definition of bipolar spectrum illness based specifically on previously described criteria.
In addition to demonstrating the predictive validity of self-reported symptoms on psychosislike symptom screening, 3 broader conclusions follow from these analyses. First, they suggest that the prevalence of bipolar disorder in otherwise-unselected populations of individuals who had a major depressive episode may be well below the 25% to 50% estimated in studies that use a nonspecific measure of bipolarity such as the Mood Disorders Questionnaire.42 If a large subset of patients had actually been misdiagnosed, as these studies would suggest, we would have anticipated substantially poorer outcomes, either in terms of remission or adverse events, in this group at risk for bipolar disorder. We observed none of these, despite a large cohort with excellent statistical power to detect such effects.
Second, they contradict the often-cited notion that a large proportion of apparently treatment-resistant MDD is actually bipolar disorder or bipolar spectrum illness.41 Were this the case, bipolar loading (whether in terms of a latent variable or a categorical definition such as that proposed by Ghaemi and Goodwin2) should predict poorer outcome across levels, beyond the effect of individual clinical features such as irritability. This was not observed during up to 1 year of sequential treatment. Thus, while treatment-resistant MDD should certainly prompt reconsideration of the primary diagnosis and comorbidity, it does not appear to be the case that many or most of these patients actually have unrecognized bipolar disorder in a typical outpatient setting.
Third, our findings suggest that the bipolar spectrum, defined by a broad range of indicators derived from a comprehensive literature review, has modest predictive validity. This does not imply that the bipolar spectrum concept is not a useful one, nor that it might not be validated by other means such as genetic investigation.63 However, our results provide little support for the predictive validity of bipolar spectrum illness in a generalizable population of treatment-seeking adult patients diagnosed with MDD.
Several limitations bear consideration in interpreting these results. Despite intensive efforts to design a study that closely mimics clinical practice, it can be argued that the requirement for necessary aspects of human subjects research (informed consent, rating scales) yields a study population that might not generalize well. Supporting the greater generalizability of this cohort compared with most clinical trials, most patients in the STAR*D study would have been excluded from MDD efficacy studies.64
In addition, while the STAR*D study did not include systematic assessment for bipolar disorder, subjects with overt features of bipolar disorder were excluded per protocol. Such exclusion probably mimics what transpires with depressed patients in general practice: even clinicians who do not screen for bipolar disorder would be likely to consider the diagnosis when a patient reports a previous diagnosis of bipolar I disorder or a prior manic episode. In general, the subtler presentations of bipolar II disorder could be less likely to have been detected at entry into the STAR*D study. Supporting this assertion, many patients self-reported recent hypomaniclike or psychoticlike symptoms but were not excluded. Nonetheless, the deliberate exclusion of overtly bipolar patients could have limited our power to detect effects on outcome. In a similar fashion, as STAR*D did not include formal measures of manic symptomatology, it is possible that some patients who developed such symptoms would be unrecognized. However, if such symptoms did not meaningfully affect detectable clinical outcomes, their clinical importance is likely to be modest in general outpatient clinical populations.
Finally, some of the measures incorporated here, such as onset age, family history, and prior course, were assessed using simple instruments (ie, patient report) rather than the more detailed research measures used in, for example, some genetic investigations.65 While their lack of precision may bias results toward the null, we note that, for some longitudinal history measures, significant associations with outcome were still detected. More importantly, these assessments—as with many of the PDSQ questions—mimic real-world clinical practice, and therefore provide a useful measure of how standard clinical assessment may or may not be useful in predicting outcome.
We also note that the STAR*D study was not designed to investigate bipolar spectrum features and, in particular, does not include measures of temperament or personality, which others have suggested are indicative of a bipolar diathesis. These include, for example, hyperthymic and cyclothymic temperament12- 15 as well as borderline personality disorder66; in particular, rapid affective shift is not examined in the data available to us. Still, most of the other clinical features suggested in multiple studies cited as criteria for bipolar spectrum illness2 and highlighted by a consensus panel46 could be examined in the STAR*D data set. In light of these strengths and limitations, the present results are likely to generalize to routine psychiatric care and general practice but may not be entirely consistent with experience in highly specialized mood disorder clinics using more intensive assessment.
Considered as a whole, our results cast doubt on the frequent assertion that unrecognized bipolar disorder is widespread in clinical practice and particularly in treatment-resistant MDD. Screening for bipolar disorder among psychiatric patients remains important, as does considering individual risk factors such as family history or age at onset.46 Still, our findings indicate that, in most individuals presenting with a major depressive episode without a prior manic or hypomanic episode, unrecognized bipolarity does not appear to be a major determinant of treatment resistance.
Correspondence: Roy H. Perlis, MD, MSc, Massachusetts General Hospital, Simches Research Building, 185 Cambridge St, Sixth Floor, Boston, MA 02114 (email@example.com).
Submitted for Publication: April 21, 2010; final revision received September 23, 2010; accepted October 7, 2010.
Published Online: December 6, 2010. doi:10.1001/archgenpsychiatry.2010.179
Financial Disclosure: Dr Perlis reports receiving research support from Eli Lilly & Company and Elan/Eisai; advisory/consulting fees from AstraZeneca, Bristol-Myers Squibb, Eli Lilly & Company, and Pfizer Inc; consulting fees or honoraria from AstraZeneca, Bristol-Myers Squibb Company, Eli Lilly & Company, GlaxoSmithKline, Pfizer Inc, and Proteus Biomedical; equity holdings and patents from Concordant Rater Systems, LLC; and royalty/patents from Concordant Rater Systems, LLC. Dr Fava reports receiving research support from Abbott Laboratories, Alkermes, Aspect Medical Systems, AstraZeneca, Bristol-Myers Squibb Company, Cephalon, Eli Lilly & Company, Forest Pharmaceuticals Inc, GlaxoSmithKline, J & J Pharmaceuticals, Lichtwer Pharma GmbH, Lorex Pharmaceuticals, Novartis, Organon Inc, PamLab, LLC, Pfizer Inc, Pharmavite, Roche, Sanofi/Synthelabo, Solvay Pharmaceuticals, Inc, and Wyeth-Ayerst Laboratories; and advisory/consulting from Aspect Medical Systems, AstraZeneca, Bayer AG, Biovail Pharmaceuticals, Inc, BrainCells, Inc, Bristol-Myers Squibb Company, Cephalon, Compellis, Cypress Pharmaceuticals, Dov Pharmaceuticals, Eli Lilly & Company, EPIX Pharmaceuticals, Fabre-Kramer Pharmaceuticals, Inc, Forest Pharmaceuticals Inc, GlaxoSmithKline, Grunenthal GmBH, Janssen Pharmaceuticals, Jazz Pharmaceuticals, J & J Pharmaceuticals, Knoll Pharmaceutical Company, Lundbeck, MedAvante, Inc, Neuronetics, Novartis, Nutrition 21, Organon Inc, PamLab, LLC, Pfizer Inc, PharmaStar, Pharmavite, Roche, Sanofi/Synthelabo, Sepracor, Solvay Pharmaceuticals, Inc, Somaxon, Somerset Pharmaceuticals, and Wyeth-Ayerst Laboratories; and speaking fees from AstraZeneca, Boehringer-Ingelheim, Bristol-Myers Squibb Company, Cephalon, Eli Lilly & Company, Forest Pharmaceuticals Inc, GlaxoSmithkline, Novartis, Organon Inc, Pfizer Inc, PharmaStar, and Wyeth-Ayerst Laboratories. Dr Goldberg reports receiving compensation for speaker's bureaus or lectures from AstraZeneca, Janssen-Cilag, GlaxoSmithKline, Eli Lilly & Co, Merck, and Pfizer; and advisory board or consultant fees from Eli Lilly & Co. Dr Rush has served as a consultant for AstraZeneca, Bristol-Myers Squibb/Otsuka, Merck, and Otsuka Pharmaceuticals and has received royalties from Guilford Publications and Healthcare Technology Systems.
Funding/Support: The Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study is supported by National Institutes of Mental Health grants N01 MH-90003 (University of Texas–Southwestern Medical Center at Dallas; A. J. Rush, principal investigator) and MH086026 (Dr Perlis).
Disclaimer: The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government.
Additional Contributions: We would like to thank all of the STAR*D investigators for their help in making this large and complex multicenter study possible and for generating the data for this article, and acknowledge Stephen Wisniewski, PhD, for creating the STAR*D database.