Figure 1. Sample and estimated Hamilton Scale for Depression (HAM-D) mean scores. A, For trajectory responders (class 1) and trajectory nonresponders (class 2) over time. B, For patients receiving placebo. C, For patients receiving active drug.
Figure 2. Least squares Hamilton Scale for Depression (HAM-D) mean (95% CI) scores for patients receiving placebo, trajectory nonresponders receiving active drug, and trajectory responders receiving active drug.
Ralitza Gueorguieva, Craig Mallinckrodt, John H. Krystal. Trajectories of Depression Severity in Clinical Trials of DuloxetineInsights Into Antidepressant and Placebo Responses. Arch Gen Psychiatry. 2011;68(12):1227–1237. doi:10.1001/archgenpsychiatry.2011.132
Author Affiliations: Division of Biostatistics, School of Public Health (Dr Gueorguieva), and Department of Psychiatry (Dr Krystal), Yale University School of Medicine, New Haven, and Veterans Affairs National Center for Posttraumatic Stress Disorder, Veterans Affairs Connecticut Healthcare System, West Haven (Dr Krystal), Connecticut; and Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, Indiana (Dr Mallinckrodt).
Context The high percentage of failed clinical trials in depression may be due to high placebo response rates and the failure of standard statistical approaches to capture heterogeneity in treatment response.
Objective To assess whether growth mixture modeling can provide insights into antidepressant and placebo responses in clinical trials of patients with major depression.
Design We reanalyzed clinical trials of duloxetine to identify distinct trajectories of Hamilton Scale for Depression (HAM-D) scores during treatment. We analyzed the trajectories in the entire sample and then separately in all active arms and in all placebo arms. Effects of duloxetine hydrochloride, selective serotonin reuptake inhibitor (SSRI), and covariates on the probability of following a particular trajectory were assessed. Outcomes in different trajectories were compared using mixed-effects models.
Setting Seven randomized double-blind clinical trials of duloxetine vs placebo and comparator SSRI.
Patients A total of 2515 patients with major depression.
Interventions Duloxetine and comparator SSRI.
Main Outcome Measure Total score on the HAM-D.
Results In the entire sample and in the antidepressant-treated subsample, we identified trajectories of responders (76.3% of the sample) and nonresponders (23.7% of the sample). However, placebo-treated patients were characterized by a single response trajectory. Duloxetine and SSRI did not differ in efficacy, and compared with placebo they significantly decreased the odds of following the nonresponder trajectory. Antidepressant responders had significantly better HAM-D scores over time than placebo-treated patients, but antidepressant nonresponders had significantly worse HAM-D scores over time than the placebo-treated patients.
Conclusions Most patients treated with serotonergic antidepressants showed a clinical trajectory over time that is superior to that of placebo-treated patients. However, some patients receiving these medications did more poorly than patients receiving placebo. These data highlight the importance of ongoing monitoring of medication risks and benefits during serotonergic antidepressant treatment. They should further stimulate the search for biomarkers or other predictors of responder status in guiding antidepressant treatment.
Trial Registration clinicaltrials.gov Identifier: NCT00073411
The high degree of heterogeneity in the therapeutic response to antidepressant medications among patients with major depressive disorder constitutes an important public health problem, with significant implications for the development of medications for the treatment of depression. The presence of distinct patterns of response to antidepressant medications might help to explain why medications that were prescribed to more than 10% of the American population in 2005 are associated with modest effect sizes,1 with only 51% of studies in the Food and Drug Administration antidepressant database showing positive results.2,3 Further heterogeneity in clinical response in the real world may be greater than that of patients in clinical trials, as many common features of clinical complexity lead patients to be excluded from clinical trials.4 This view is consistent with the observation that antidepressant medications, commonly prescribed and effective treatments for suicide risk among many adults,5 increase the risk of suicide in some patients.6
Similarly, heterogeneous responses to placebo have plagued antidepressant drug development. High rates of placebo response7 may contribute to the observation that approximately half of the studies of approved antidepressant medications produce negative or questionable results.3 These challenges motivate research to identify distinct trajectories of clinical response to antidepressant medications or placebo on an empirical basis.8,9 The identification of distinct trajectories of response might provide a foundation for the development of biomarkers or other predictors of treatment response, as well as refinements in the design of antidepressant clinical trials.
Trajectory-based methods address limitations of other approaches of longitudinal analysis of heterogeneous samples. For example, end-point analyses with last observation carried forward are often used, despite serious problems with bias and loss of temporal information.10 Mixed-effects models11 use all available data on patients, reduce bias, and improve signal detection over end-point analyses.10,12,13 However, they assume the same mean trajectory over time for all patients within the same treatment group. Therefore, they do not account for the fact that treatment responders and nonresponders have distinct patterns of response. In contrast, trajectory-based models (also known as latent class models14 and growth mixture models15- 17) allow identification of distinct classes of developmental trajectories and assessment of the effect of treatments on trajectory membership. These methods have been applied to treatment research studies in the areas of alcoholism research18,19 and depression.20- 22
Duloxetine hydrochloride is a Food and Drug Administration–approved antidepressant medication with demonstrated efficacy for major depressive disorder.23- 25 Pooled analyses of duloxetine studies11,25,26 have addressed important questions that required larger sample sizes than the typical clinical trial. Herein, we apply trajectory-based methods to the pooled data to explore whether they identify similar or different trajectory classes with active and placebo treatment, whether they provide new insights into the nature of antidepressant and placebo responses, and whether trajectory-based analyses improve signal detection over traditional analytic methods.
We analyzed all treatment arms from 7 randomized multicenter double-blind active and placebo comparator-controlled clinical trials of duloxetine for major depressive disorder. Earlier phase II trials conducted in the dose range of 5 to 20 mg/d that were not included in the integrated summary of efficacy were excluded. Table 1 lists the trials, protocols, arms, and sample size per arm. Four different protocols were used for these studies (HMAQ, HMAT, HMAY, and HMCR). Parts A and B reflect trials run in parallel following the same protocol. Pooling of data from these trials was anticipated during study design. All trials incorporated double-blind variable-duration placebo lead-in periods to blind patients and investigators to the start of active therapy. Safety and efficacy results from these studies have been published previously as individual study findings23,24,27,28 and summarized as pooled analyses of safety25 and efficacy.13
For this analysis, the following 3 levels of the drug factor were used: duloxetine (all duloxetine doses), SSRI (fluoxetine hydrochloride, paroxetine hydrochloride, and escitalopram oxalate), and placebo (all placebo groups). Because protocols for dose adjustment varied across trials, we could perform dose analyses on only 2 of the protocols (HMAT and HMAY). These secondary analyses revealed no significant dose effects, and the results are not presented herein.
The outcome variable was total score on the 17-item Hamilton Scale for Depression (HAM-D).29 We used growth mixture modeling and a commercially available computer program (MPlus; Muthen and Muthen, Los Angeles, California15- 17) to identify distinct trajectories of HAM-D scores during treatment. We first fitted models to the entire sample (duloxetine, SSRI, and placebo arms combined) and then fitted separate models to the active arms and to the placebo arms. The latter analyses were used to evaluate whether different classes would emerge for patients in the active arms and in the placebo arms. We considered linear, quadratic, and cubic trends over time, with between 1 and 4 trajectory classes. We also considered piecewise models with a change point at 2 weeks, linear change before week 2, and quadratic change after week 2.
The selection of the best model was based on the Schwartz-Bayesian information criterion and on the Lo-Mendell-Rubin (LMR) likelihood ratio test.30 The LMR statistic tests whether a model with 1 less class than the fitted model describes the data as accurately and is used to select the number of classes. We applied the restriction that we need to have at least 5% of the patients in a class for that class to be meaningful clinically and stable numerically. Classification accuracy was assessed using the entropy value ranging between 0 and 1, with values closer to 1 corresponding to better classification accuracy.
Once the best-fitting model for the entire sample was identified, patients were classified to the most likely trajectory class, and weighted logistic regression analysis was performed to assess the effects of treatment and baseline characteristics on membership in a particular class. Baseline characteristics included sex, atypical flag (yes or no), melancholia flag (yes or no), age, age at onset, Hamilton Scale for Anxiety (HAM-A) total score at baseline, duration of current episode (<8 weeks, 8-18 weeks, 18 weeks to 1 year, or >1 year), and number of previous episodes (0, 1-2, 3-4,≥5, or missing). We performed backward elimination at the P < .10 level. The weights were the posterior probabilities of membership in the assigned class. The association of each baseline characteristic with trajectory membership was also tested one at a time usingχ2 test, t test, or Wilcoxon rank sum test.
Because the separate analyses of the subsample receiving active drug and the subsample receiving placebo revealed 2 trajectory classes for patients receiving active drug (responders and nonresponders) but only 1 class for patients receiving placebo, mixed-model analyses were performed to assess whether patients receiving placebo had significantly different responses from patients receiving active drug who were classified in responder and nonresponder trajectories. To distinguish these 2 trajectory classes from the clinical definitions of responders and nonresponders, we refer to them as trajectory responders and trajectory nonresponders. The response variable in the mixed model was HAM-D total score during 8 weeks. The predictor variables were trajectory class membership (placebo, trajectory responder on duloxetine or SSRI, or trajectory nonresponder on duloxetine or SSRI), time (as a categorical variable), and their interaction.
An unstructured variance-covariance matrix was used for the errors. To control for the potential confounding of baseline covariates when comparing the randomized placebo group with the nonrandomized trajectory responder and trajectory nonresponder groups, we used a propensity scoring approach.31 We calculated predicted probability (propensity score) for each patient to be in the trajectory nonresponder class and used this probability as a covariate in the mixed model. The propensities were calculated based on the fitted logistic regression model for patients in the active arms using all available baseline covariates as predictors.
To assess the relationship between trajectory response and clinical response, we usedχ2 test and measures of agreement. Trajectory responders were patients who were classified in the responder trajectory. Clinical responders were patients with at least 50% improvement from baseline and a HAM-D total score of less than 10 using last-observation-carried-forward imputation for missing data.
Our primary analyses are valid under missing at random (MAR) assumptions. We assessed the effect of missing data on our results by performing a limited sensitivity analysis under missing not at random (MNAR) assumptions. We used the Muthén-Roy pattern mixture model that Muthén et al32 recommended as the most appropriate and flexible model of its class. It allows the pattern of dropout to influence the outcome of growth mixture modeling by defining 2 distinct latent class variables, one related to dropout and another related to the outcome trajectories. We identified trajectory classes based on the Muthén-Roy model and repeated all remaining analyses using the corresponding trajectory class definitions.
According to the Schwartz-Bayesian information criterion, the best-fitting set of models was the set of piece wise growth mixture models (Table 2). Among these, according to the LMR statistic, the model with 2 classes fit the data best. Figure 1A shows the estimated and sample means for the 2 trajectory classes over time based on all the data. The class on the bottom (class 1), with 76.3% probability of membership in this class, can be interpreted as the class of trajectory responders. The class on the top (class 2), with 23.7% probability of membership in this class, can be interpreted as the class of trajectory nonresponders.
Univariate associations between classifications in the 2 trajectory classes (trajectory responder and trajectory nonresponder) and treatment, protocol, and covariates are summarized in Table 3. Based on logistic regression analysis with backward elimination, the following variables were significantly related to trajectory membership: drug, protocol, HAM-A total score at baseline, and duration of current episode. Compared with patients receiving duloxetine, patients receiving placebo had significantly lower odds of being in the responders trajectory (odds ratio [OR], 0.56; 95% CI, 0.42-0.73). The results for patients receiving SSRI were similar and statistically significant (OR, 0.69; 95% CI, 0.51-0.93). Patients receiving duloxetine and patients receiving SSRI did not have statistically different odds of being in the responders trajectory (OR, 1.24; 95% CI, 0.93-1.64). Higher HAM-A total score at baseline was associated with significantly lower odds of being in the responders trajectory (OR, 0.90; 95% CI, 0.88-0.92). Longer duration of current episode seemed to be associated with lower odds of being in the responders trajectory (P = .03), but the post hoc pairwise comparisons of longer durations with the shortest duration were not statistically significant. Compared with the other protocols, protocol HMAY was associated with significantly higher odds of being in the responders trajectory. The 2 trials using this protocol (HMAY part A and HMAY part B) were different from the other trials because they were conducted in Eastern Europe, they included 6-month extensions rather than acute phase only, and the dropout rates were 9% and 19% compared with 30% to 40% for the other trials.
Because duloxetine and SSRI were not differentiated by odds of assignment to the responders trajectory, we combined these 2 groups for the separate analyses of the active and placebo arms. Analyses of both the active and placebo subsamples showed that piecewise linear models fit the data best (Table 4). The active arms identified classes similar to the ones based on the entire sample. Figure 1B and C show the best-fitting solutions for the placebo and active data, respectively. Notably, the placebo arms failed to identify more than 1 trajectory class based on the LMR statistic, suggesting that there was no categorical difference in patients' responses to placebo. Rather, continuous bell-shaped distributions were sufficient to describe the between-patient heterogeneity in means, rates of change, and curvature in the response over time in the placebo arms.
To assess whether this result might be affected by potential confounding factors (eg, the number of treatment arms, percentage of patients randomized to placebo,33 or different sample sizes for the active and placebo groups), we performed separate analyses for the 3-arm and 4-arm trials, separate analyses of trials with different placebo response rates (20%, 25%, and 40%), and separate analyses of the duloxetine and SSRI arms. The substantive conclusions were the same. Only 1 class was identified for the placebo group (P > .1 for all by LMR test). Separate analyses of the duloxetine and SSRI groups favored a 2-class solution over a single-class solution (P > .03 for duloxetine and P > .002 for comparator SSRI). Adding a third class did not improve the fit of these data (P = .07 for duloxetine and P = .15 for comparator SSRI), and the third class consisted of a very small percentage of patients with unstable trajectory estimation. Therefore, 2 classes were necessary to adequately describe heterogeneity in response trajectories with duloxetine and comparator SSRI.
The mixed models comparing HAM-D scores over time of trajectory responders receiving active drug, trajectory nonresponders receiving active drug, and patients receiving placebo (Figure 2) showed a significant interaction between trajectory class membership and time (F16,1801 = 73.3, P < .001). Not surprisingly, trajectory responders receiving active drug showed reduced HAM-D scores compared with patients receiving placebo at week 8 (t2081 = −4.08, P < .001). However, pairwise comparisons at week 8 also showed significantly higher HAM-D scores for trajectory nonresponders receiving active drug compared with patients receiving placebo (t2056 = 7.95, P < .001). When controlling for the propensity of a patient to be a trajectory responder, the differences between trajectory nonresponders receiving active drug vs patients receiving placebo and between trajectory responders receiving active drug vs patients receiving placebo at week 8 remained significant (t2062 = 21.23 and t2045 = −3.94, respectively; P < .001 for both). Therefore, it seems that the baseline characteristics we studied were insufficient to account for the differences observed between the nonresponders receiving active drug and the patients receiving placebo.
Using the definition of clinical response as at least 50% improvement from baseline and a HAM-D total score of less than 10 and a last-observation-carried-forward imputation method for missing data, the correspondence between trajectory responders and clinical responders indicated that 480 of 481 (99.8%) trajectory nonresponders were also clinical nonresponders. However, 1318 of 2034 (64.8%) trajectory responders were classified as clinical responders. The remaining trajectory responders (716 of 2034 [35.2%]) were classified as clinical nonresponders.
Overall, dropout proportions were 23.5% of patients receiving duloxetine, 23.0% receiving SSRI, and 26.7% receiving placebo. Among trajectory responders on active drug, the dropout proportion was 18.5%, and among trajectory nonresponders receiving active drug, the dropout proportion was 24.4%. In the entire sample, the Muthén-Roy pattern mixture model under MNAR assumptions identified 2 classes of trajectory response similar to those identified under MAR assumptions (63.7% trajectory responders and 36.3% trajectory nonresponders) and 2 classes of dropout patterns (22.6% with high probability of dropout and 77.4% with low probability of dropout). More patients were classified as trajectory nonresponders in the MNAR analysis than in the MAR analysis (36.3% vs 19.1%). Univariate differences in baseline characteristics between trajectory responders and trajectory nonresponders similar to those given in Table 2 emerged, with 2 exceptions. There was no longer a significant difference in class membership by melancholia depression type (P = .38), and the trajectory responders identified under MNAR assumptions had older mean (SD) age at onset than trajectory nonresponders identified under MNAR assumptions (32.1 [13.7] vs 31.0 [14.1] years, P = .04). Backward elimination with MNAR trajectory class definitions resulted in the same final model as that reported for MAR assumptions, with similar ORs.
In the placebo group, 1 trajectory class and 2 dropout classes (23.3% with high probability of dropout and 76.7% with low probability of dropout) were identified, while in the active group 2 trajectory classes (68.0% trajectory responders and 32.0% trajectory nonresponders) and 2 dropout classes (19.0% with high probability of dropout and 81.0% with low probability of dropout) were identified. According to the MNAR analysis, the mixed models comparing HAM-D scores over time of trajectory responders receiving active drug, trajectory nonresponders receiving active drug, and patients receiving placebo showed a significant interaction between trajectory class membership and time (F16,1826 = 82.6), significantly lower HAM-D scores for trajectory responders receiving active drug than for patients receiving placebo at week 8 (t2181 = 21.72), and significantly higher HAM-D scores for trajectory nonresponders receiving active drug than for patients receiving placebo at week 8 (t2047 = −17.99) (P < .001 for all). Therefore, the MNAR sensitivity analyses confirmed our results under MAR assumptions.
The overall trajectory-based analyses of the treatment of 2515 patients successfully classified them into response trajectories and confirmed that duloxetine and SSRI treatment increased the likelihood that most patients treated with these medications would be classified in the responders trajectory. The magnitude of the effects seemed as large as the magnitude of the effects from end-point or mixed-model analyses.12,13 Therefore, trajectory analyses allowed for strong signal detection, although treatment effects were not more significant than in end-point and mixed-model analyses. The added improvements of trajectory analyses were that patients were classified into response trajectories and that the trajectories were different for active drug and for placebo. As noted earlier,“responder” in this study refers to a favorable clinical trajectory rather than achievement of a priori criteria based on symptom thresholds, an approach that is commonly used in clinical trials.34
Separate analyses of the active and placebo groups revealed that distinct trajectories were identified in the groups treated with duloxetine or SSRI but not in the placebo group. Most patients (about three-quarters) receiving active drug were classified as trajectory responders, and fewer (about one-quarter) were classified as trajectory nonresponders. Patients receiving placebo could not be separated into distinct trajectories and on average showed gradual improvement over time. The failure to identify more than 1 trajectory classes over time in the placebo group is remarkable because growth mixture models are more prone to overestimate rather than underestimate the number of trajectory classes.35,36 The finding of a single trajectory for patients assigned to placebo may reflect limited statistical power to resolve subtle differences in response trajectories in this group. However, this reasoning assumes that there are categorically different outcomes in each group, and this may not be the case. The present data suggest that widely divergent trajectories of individual patients treated with placebo are best explained as variations within a single class (ie, placebo response differences may be a dimensional rather than categorical characteristic).
The present findings challenge the prevailing view37,38 that placebo response is associated predominantly with rapid and transient clinical improvement. In some investigations, researchers classified patients showing this response pattern as placebo responders, although the validity of this assumption was never demonstrated. The problem with this conclusion is that growth mixture models provide valid results only under the assumption that placebo responders and nonresponders are groups differentiated by a categorical distinction. If categorically different classes do not exist, spurious latent classes are likely to be identified.35,36 Therefore, whether one finds categorically different trajectories depends directly on whether one is willing to assume a priori that they exist. This problem is exacerbated in studies with small sample sizes. This study did not assume a priori that placebo response was constituted by multiple trajectory classes. Despite the fact that this study analyzed a much larger sample than prior studies of its kind, its analyses do not support the notion that there is a specific placebo response profile. Rather, we observed an average gradual improvement over time among patients taking placebo, with noticeable dimensional but not categorical heterogeneity. Our results are consistent with the view that placebo response is a continuous measure that is manifested to varying degrees across patients. This view is consistent with the approach by Tarpey and Petkova,39 who used continuous rather than categorical latent variables to model patient responses to antidepressant medications.
The findings that patients receiving active drug were classified as trajectory responders and trajectory nonresponders, while there was only 1 trajectory for patients receiving placebo, may be partially explained by the effect of the active compound. Active compounds generally have the potential to produce adverse responses that could contribute to categorically different responses. For example, medications with inverted U-shaped dose-response relationships may be beneficial for patients with low baseline values but may be harmful to patients at the opposite end of the spectrum.
Another possible explanation for the single trajectory class among patients receiving placebo is that all clinical trials in this analysis had a placebo treatment phase before treatment with randomized medication. This placebo lead-in may have acted as a filter that reduced response heterogeneity during subsequent placebo treatment because it eliminated patients with immediate placebo response from the sample. Placebo lead-in is a practice that is intended to improve signal detection by selecting out patients who are likely to respond to placebo40 and by selecting in patients whose poor placebo response portends superior medication responses.41 However, the published data42- 46 overwhelmingly indicate that the rate of clinical response during placebo lead-in is very low and that exclusion of placebo responders does not improve signal detection in clinical trials. Therefore, it is unlikely that the presence of placebo lead-in periods in the contributing studies influenced the present findings or compromised their generalizability.
Our analyses do not support the view that antidepressant treatment response was slower and less transient than placebo response. Rather, consistent with a recent meta-analysis47 confirming early signs of SSRI efficacy, the change in HAM-D scores as a result of antidepressant treatment among responders seemed to be faster and more sustained than the change in HAM-D scores among patients on placebo. Our analyses also support the notion that early improvement during the first 2 weeks of treatment is predictive of treatment outcome48 and that different antidepressant medications have similar treatment response profiles.49,50
In the present analysis, where there are 2 trajectories for patients treated with antidepressants and 1 trajectory for patients treated with placebo, some patients would seem to be more effectively treated with placebo than with a serotonergic antidepressant. The trajectory of nonresponse to antidepressants was not consistent with the timing of a transient increase in suicidality among antidepressant-treated patients with depression.6 Instead, the trajectories of responders and nonresponders diverged increasingly over time, suggesting that patients affected adversely by serotonin reuptake inhibitors might be better off if these medications were discontinued. At a minimum, they highlight the potential clinical importance of careful ongoing monitoring of the effect of prescribed antidepressants.
The clear separation of patients treated with antidepressants into responders and nonresponders and the observed homogeneity of treatment response on placebo, together with the significantly worse mean response of trajectory nonresponders taking active drug compared with the mean response of patients taking placebo, may help explain the failure of many clinical trials of antidepressant medication to demonstrate treatment efficacy. The main hypotheses in almost all clinical trials focus on demonstrating average treatment effects. However, when there is sizable heterogeneity in treatment response, with most patients benefiting from a treatment but with some patients demonstrating significantly worse outcomes than the average improvement while receiving placebo, it is likely that average treatment effects will be diminished and may lose statistical significance. Therefore, unless strategies are introduced to reduce this heterogeneity or study sample sizes are increased significantly to augment statistical power, it would seem that the status quo regarding failed trials of serotonin reuptake inhibitors is unlikely to change.
The fact that almost all trajectory nonresponders were clinical nonresponders but only about two-thirds of trajectory responders were clinical responders suggests that the clinical response definition is stricter than the trajectory response definition. It is possible that patients who do not meet the clinical response definition but are classified as trajectory responders may meet the clinical response definition with a longer follow-up period. This can be evaluated in follow-up analyses. Validation of our results in other studies is also necessary to assess how stable the trajectory class definitions are.
The inferences drawn from the present analyses are limited by several factors. First, we were unable to differentiate whether the poor responses to antidepressants arose from a negative effect of taking antidepressant medication or as a consequence of discontinuation of these medications. Although our limited sensitivity analysis under MNAR assumptions suggests that our results are not sensitive to the effects of missing data, further research is necessary to estimate the extent of the influence of missing data due to different model assumptions for missing data.
Second, although the reported studies included only patients with unipolar major depression determined using rigorous diagnostic methods, it is impossible to rule out the presence of latent bipolar disorder in the study population. Antidepressants are reported to have reduced efficacy in treating depression among patients with bipolar disorder, and they may increase mood cycling.51,52 Similarly, adverse effects of other forms of psychopathologic conditions, particularly personality disorders, or the social use of alcohol cannot be determined using the existing data set.
Third, the implications are limited by the brief duration (8 weeks) of the reported studies. In the present study, it was impossible to predict whether nonresponders may become responders with extended treatment or whether the negative consequences of antidepressant discontinuation in some nonresponders outweigh the apparent benefits.
Fourth, the present analysis is predicated on the assumption that heterogeneous classes of antidepressant medications exist. An alternative view is that the probability of nonresponse may vary continuously across the population, and if this is indeed the case, alternative models with latent trait rather than latent class variables might be more appropriate.39
Fifth, we considered only 1 type of linear parametric model. Alternative nonlinear models have been considered by other authors53,54 to predict treatment outcome from early treatment response. It is difficult to distinguish the fit of such models from the fit of simpler polynomial models with a limited number of time points. Such models have not been used in the context of growth mixture modeling and are unavailable in standard software, but they might provide an important tool for flexible modeling of treatment response for more frequently collected treatment outcome data.
Sixth, our analyses do not assess causal treatment effects because we consider treatment a predictor of class membership rather than a predictor of growth factors within a class, as suggested by Muthén and Brown.55 In the causal framework, latent classes are considered characteristics of the patients (eg, never responders, drug-only responders, placebo-only responders, and always responders), and a key assumption is that the“numbers of classes and the true, population proportions of patients in each class be the same across intervention conditions.”56(pS96) In our analysis, classes are considered empirically derived distinct trajectories of longitudinal response, and we have direct evidence that the number of classes and the proportion of classes vary by treatment. Although our propensity scoring approach allowed us to control for observed predictors of trajectory membership, it is possible that unmeasured confounders (eg, genotypes and environmental factors) can affect our results.
Identifying predictors of clinical trajectories might advance the personalized treatment of major depressive disorder (ie, to assist in the better matching of patients and treatments). In this regard, the present study replicated the finding from the large multicenter antidepressant trial Sequenced Treatment Alternatives to Relieve Depression (STAR*D) that high levels of anxiety predicted poorer antidepressant response.57 Predictors of membership in both favorable and poor response trajectories might be used to inform the selection of medications for particular patients. Despite a long history of the study of clinical,31,58 neurochemical,59 and genetic60 predictors of subtypes of depression and treatment response, there are still no objective bases for the personalized treatment of depression that are sufficiently explanatory and specific to guide the treatment of individual patients. Future research will be needed to determine whether trajectory-based analyses will be useful in advancing this objective.
Correspondence: Ralitza Gueorguieva, PhD, Division of Biostatistics, School of Public Health, Yale University School of Medicine, 60 College St, Room 215B, New Haven, CT 06520-8034 (email@example.com).
Submitted for Publication: May 27, 2011; accepted July 8, 2011.
Author Contributions: Dr Gueorguieva had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Financial Disclosure: Dr Krystal has been a consultant to the following companies: Aisling Capital LLC, AstraZeneca Pharmaceuticals, Biocortech, Brintnall& Nicolini Inc, Easton Associates, Gilead Sciences Inc, GlaxoSmithKline, Janssen Pharmaceuticals, Lundbeck Research USA, Medivation Inc, Merz Pharmaceuticals, MK Medical Communications, Pfizer Pharmaceuticals, F. Hoffmann-La Roche Ltd, SK Holdings Co Ltd, Sunovion Pharaceuticals Inc, Takeda Industries, Teva Pharmaceutical Industries Ltd, and Transcept Pharmaceuticals. Dr Krystal is on the scientific advisory board of Abbott Laboratories, Bristol-Myers Squibb, Eisasi Inc, Eli Lilly and Co, Forest Laboratories, Lohocla Research Corporation, Mnemosyne Pharaceuticals Inc, Naurex Inc, Pfizer Pharmaceuticals, and Shire Pharmaceuticals. Dr Krystal has the following patents and inventions: (1) patent 5 447 948 (September 5, 1995) with J. P. Seibyl and D. S. Charney on dopamine and noradrenergic reuptake inhibitors in the treatment of schizophrenia and (2) patent application PCTWO06108055A1 with G. Sanacora related to targeting the glutamatergic system for the treatment of neuropsychiatric disorders.
Funding/Support: This trial was supported by grants K05 AA 14906 and 2P50 AA 012870 from the National Institute on Alcohol Abuse and Alcoholism and the Veterans Affairs National Center for Posttraumatic Stress Disorder and by Clinical and Translational Science Award UL1 RR024139 from the National Center for Research Resources (a component of the National Institutes of Health) and the National Institutes of Health Roadmap for Medical Research.
Disclaimer: The contents herein are solely the responsibility of the authors and do not necessarily represent the official view of National Center for Research Resources or the National Institutes of Health.
Additional Contributions: Brian Pittman, MS, provided helpful comments on the manuscript.