Flow diagram of the trial. RCT indicates randomized controlled trial.
Differences in depression symptom rating scale scores across time between groups treated with selective serotonin reuptake inhibitors and placebo. A, Best-fit model (logarithmically increasing treatment response) for the difference in standardized effect size between groups (placebo: n = 2254 and selective serotonin reuptake inhibitor: n = 3618) Dotted lines represent 95% confidence intervals. B, Weighted mean difference in scores using the Hamilton Depression Rating Scale (left) (n = 1893-3433) and the Montgomery-Asberg Depression Rating Scale (right) (n = 133-3159). Error bars represent 95% confidence intervals.
Absolute difference in rate (A) and relative risk (B) of response between groups treated with selective serotonin reuptake inhibitors and placebo across time: 50% reduction in Hamilton Depression Rating Scale (HDRS) score (n = 688-1428) (left); Clinical Global Impression moderately or much improved (n = 166-203) (center); and HDRS score less than 7 or less than 8 (n = 684) (right). Error bars represent 95% confidence intervals.
Taylor MJ, Freemantle N, Geddes JR, Bhagwagar Z. Early Onset of Selective Serotonin Reuptake Inhibitor Antidepressant ActionSystematic Review and Meta-analysis. Arch Gen Psychiatry. 2006;63(11):1217-1223. doi:10.1001/archpsyc.63.11.1217
Selective serotonin reuptake inhibitors (SSRIs) are often described as having a delayed onset of effect in the treatment of depression. However, some trials have reported clinical improvement as early as the first week of treatment.
To test the alternative hypotheses of delayed vs early onset of antidepressant action with SSRIs in patients with unipolar depression.
Trials identified by searching CENTRAL, The Cochrane Collaboration database of controlled trials (2005), and the reference lists of identified trials and other systematic reviews.
Randomized controlled trials of SSRIs vs placebo for the treatment of unipolar depression in adults that reported outcomes for at least 2 time points in the first 4 weeks of treatment (50 trials from >500 citations identified). Trials were excluded if limited to participants older than 65 years or specific comorbidities.
Data were extracted on trial design, participant characteristics, and outcomes by a single reviewer.
Pooled estimates of treatment effect on depressive symptom rating scales were calculated for weeks 1 through 6 of treatment. In the primary analysis, the pattern of response seen was tested against alternative models of onset of response. The primary analysis incorporated data from 28 randomized controlled trials (n = 5872). A model of early treatment response best fit the experimental data. Treatment with SSRIs rather than placebo was associated with clinical improvement by the end of the first week of use. A secondary analysis indicated an increased chance of achieving a 50% reduction in Hamilton Depression Rating Scale scores by 1 week (relative risk, 1.64; 95% confidence interval, 1.2-2.25) with SSRI treatment compared with placebo.
Treatment with SSRIs is associated with symptomatic improvement in depression by the end of the first week of use, and the improvement continues at a decreasing rate for at least 6 weeks.
Conventional wisdom suggests that the therapeutic effects of antidepressant agents, such as selective serotonin reuptake inhibitors (SSRIs), take 2 to 3 weeks or more to become evident.1 There are a variety of elegant explanations for this delay, although there is no consensus.2- 4 Clinical experience shows that a proportion of patients report improvement in symptoms earlier than this, but this improvement is often ascribed to a placebo effect.5
Although the earliest studies6,7 of antidepressant drug action indicated rapid response, landmark studies5,8 in the 1980s found evidence that true drug response (as opposed to nonspecific or placebo response) was delayed for 2 weeks or more. Other research groups conversely found evidence of treatment effects as early as the first week of treatment,9,10 leading to ongoing debate as to the origin of these differences.11,12 Although many of the classic studies predate the widespread use of SSRIs, for these agents, the results are also inconsistent. Many trials13,14 do not report statistically significant benefits of SSRIs over placebo until after several weeks of treatment, but some analyses15,16 of randomized controlled trials (RCTs) comparing antidepressant agents with placebo report statistically significant benefits of SSRIs after as little as 1 week of use.
So although it is often held that significant treatment effects are not reliably demonstrated until after several weeks of treatment, there may, in fact, be early benefits.17,18 Apparent early response to active treatment may simply reflect chance, or there may be a true early effect of SSRI treatment that individual studies have usually lacked the statistical power to demonstrate consistently. If SSRIs truly have clinically important early effects, this finding will have considerable significance for physicians and patients and for our understanding of the pathogenesis of depressive disorders.
Meta-analysis can reduce uncertainty about estimates of effect by pooling the results of multiple studies.19 It is well suited to the question of early onset of SSRI effects because there are many RCTs comparing SSRIs with placebo in the treatment of depression, and these are typically of similar design, with repeated (often weekly) assessments using standardized rating scales. We conducted a systematic review and meta-analysis testing the alternative hypotheses of delayed vs early onset of antidepressant action with SSRIs.
Published randomized trials comparing all SSRIs licensed in the United Kingdom (fluoxetine hydrochloride, fluvoxamine maleate, citalopram hydrobromide or hydrochloride, escitalopram oxalate, sertraline hydrochloride, and paroxetine hydrochloride) with placebo in the short-term treatment of unipolar depression in adults were sought by a single reviewer (M.J.T.). Trials were identified by (1) searching CENTRAL, The Cochrane Collaboration database of controlled trials (in The Cochrane Library, issue 1, 2005; using the search terms SSRI or fluoxetine or fluvoxamine or citalopram or escitalopram or sertraline or paroxetine and the key words placebo and depression); (2) the reference lists of identified RCTs; and (3) the reference lists of other systematic reviews.1,15,20,21 The CENTRAL bibliographic database includes the results of group searches of the MEDLINE, EMBASE, CINAHL, PsycLIT, PSYNDEX, and LILACS databases and controlled trials from traditionally hard-to-access sources, such as conference proceedings.22 Identification of eligible trials was further maximized by consulting previous reviews, and searches were not restricted to English language studies. Trials were excluded if they were limited to older adults (age >65 years) or specific comorbidities along with depressive episodes (eg, comorbid substance dependence). In view of the study hypothesis, trials were included only if they reported outcome measures for at least 2 time points in the first 4 weeks of treatment.
Data were extracted for trial design, participant characteristics, and outcomes. Data for depression rating scale scores, or changes in scores, across multiple time points in the first 6 weeks of treatment were used when reported. Group numbers and means were obtained from the text or, when necessary, from figures using a computer program (Dexter; Harvard-Smithsonian Center for Astrophysics, Cambridge, Mass).23 Group standard deviations were extracted from the text or were estimated using standard formulas.24 When data from multiple depression rating scales were available for the same study, for example, the Hamilton Depression Rating Scale (HDRS)25 and the Montgomery-Asberg Depression Rating Scale (MADRS),26 then primary analyses were performed using the scale specified as the primary outcome measure. When rating scale outcomes were reported in a dichotomous manner, for example, the number of participants with a greater than 50% reduction in HDRS scores, these were also extracted. Data from individual patient meta-analyses were used when they provided information not available from primary trial reports.
For each study, at each weekly time point, we calculated the standardized effect size for the treatment and control groups for the treatment severity score. We used generalized estimating equations to examine the effects of trial, treatment, modeling different forms of the treatment effect, accounting for different periods within trials as repeated measures, and defining a new covariance structure for each trial by defining these as random effects. Each trial's specific estimate was weighted by the number of randomized patients in that trial. Treatment effects were described in the models as (1) a sudden-onset treatment response equating to a step function at week 4, (2) a step function at week 4 and a linearly increasing treatment effect, (3) a linearly increasing treatment effect, (4) a logarithmically increasing treatment effect defined as log(week), or (5) a treatment effect described as a function of week.1 An autoregressive variance function was used, and the best-fitting model was selected using the Akaike information criterion. Analyses were conducted using a statistical software program (SAS9.12 Proc Mixed; SAS Institute Inc, Cary, NC).
Secondary analyses were performed for groups of trials that reported total or change scores with standard deviations for the HDRS (17 or 21 items) and the MADRS. Analyses were also performed using data for dichotomous outcomes (response or remission defined by HDRS score or improvement on the Clinical Global Impression). Pooled estimates of weighted mean difference or relative risk (RR) and 95% confidence intervals (CIs) at each time point were calculated using a software program (RevMan version 4.2 for Windows; Cochrane Collaboration, Oxford, England).
The search strategy identified more than 500 citations, from which 107 trial reports were retrieved for more detailed study. From these 107 trials, 50 randomized placebo-controlled trials of SSRIs in the short-term treatment of unipolar depression in adults were identified as being suitable for inclusion (Figure 1).27 In these studies, 6153 participants had been randomized to receive SSRIs and 3968 to receive placebo. Not all of these identified trials reported sufficient data on the standard deviations of results for inclusion in the specific analyses reported herein, so for each analysis, the trials contributing data are identified. The characteristics of these studies are summarized in the Table. These randomized trials were of largely uniform methodological quality, all except one38 reporting the use of a double-blind design. Data were available for intention-to-treat populations using last observation carried forward in approximately half of the cases (15 of 28 RCTs in the primary analysis). The participants in these studies were mainly female, with only a few studies recruiting mostly men.42,45,46 Recruited individuals were largely from outpatient or primary care populations but with inclusion criteria requiring that at least a moderately severe depressive illness be present at baseline.
Overall, 20 articles13- 16,29,32,35- 39,42,43,45- 51 describing 28 trials (placebo: n = 2254 and SSRI: n = 3618) provided data for analysis across multiple periods for symptom score. The best-fitting model according to the Aikaike information criterion was for the logarithmically increasing treatment function (model 4), which was statistically better than the next best–fitting model, which assumed a treatment effect based on the square root of the week (model 5) (χ21 = 8.5; P = .004). In other words, the incremental treatment effect was greatest in the first week, with a gradual decline in the magnitude of incremental benefits week by week (Figure 2A). Model 4 was substantially better than model 3 (linearly increasing treatment effect) (χ21 = 11.1), model 2 (step function at week 4 plus linear effect) (χ21 = 13.4), and model 1 (step function at week 4) (χ21 = 37.8) (P<.001 for all). The estimate of treatment effect by log(week) from the final model was −0.17 (95% CI, −0.13 to −0.21; P<.001). The effect of treatment across time for the standardized effect size is described in Figure 2A.
In the secondary analyses of rating scale scores, 16 articles13- 15,28,32,34,35,38,41,43- 46,48- 50 reporting 25 RCTs (placebo: n = 1623 and SSRI: n = 2260) provided data for estimates of weighted mean difference on the 17- or 21-item HDRS and 10 articles16,28- 30,33,41,44,46,49,52 describing 16 trials (placebo: n = 1382 and SSRI: n = 2058) for the MADRS (Figure 2B). There was also evidence of a difference favoring SSRIs by week 1 and increasing thereafter, with a difference in mean HDRS score of −1.07 (95% CI, −1.69 to −0.44) at week 1 (n = 1893), increasing to −3.3 (95% CI, −4.14 to −2.45) at week 6 (n = 3432), and similarly a difference in mean MADRS score of −0.84 (95% CI, −1.36 to −0.32) at week 1 (n = 2062), increasing to −3.25 (95% CI, −4.11 to −2.39) at week 6 (n = 3159).
A few studies provided data on dichotomous outcomes at multiple time points (Figure 3). Five RCTs31,35,40,45,52 (placebo: n = 574 and SSRI: n = 791) provided data on achieving a 50% reduction in HDRS score (Figure 3). A small absolute difference in the event rates favoring SSRIs is seen by the end of week 1 (0.04; 95% CI, 0.01-0.07) (n = 1428), with a greater difference seen in later weeks (0.12; 95% CI, 0.07-0.18) (n = 1203 at week 6). This is equivalent to a number needed to treat with SSRIs rather than placebo for 1 additional person to achieve a 50% reduction in HDRS score of 25 (95% CI, 15 to 100) at week 1 and reducing to 9 (95% CI, 6 to 15) at week 6. However, when the difference between groups is expressed as an RR, the CIs for weeks 1 to 6 overlap (week 1: RR, 1.64; 95% CI, 1.2 to 2.25; and week 6: RR, 1.32; 95% CI, 1.18 to 1.47).
On another measure of treatment response, a Clinical Global Impression rating of moderately or much improved, only 2 RCTs43,50 (placebo: n = 170 and SSRI: n = 170) provided data (Figure 3). Consistent with the smaller numbers in the analyses, the CIs around estimates of effect are wider here than for the 50% reduction in HDRS. However, a qualitatively similar pattern can be observed with a greater absolute difference between groups later in the study but a largely constant RR throughout.
Two RCTs35,48 (placebo: n = 267 and SSRI: n = 429) provided data on numbers achieving remission, defined as an HDRS score of less than 7 or 8. The pattern of results appears different, with a trend in favor of placebo in week 1 (RR, 0.46; 95% CI, 0.15 to 1.41) (n = 684) but a statistically significant benefit of SSRIs from weeks 2 to 6 (week 6: RR, 1.79; 95% CI, 1.4 to 2.29) (n = 684).
This analysis supports the hypothesis that SSRIs begin to have observable beneficial effects in depression during the first week of treatment. The early treatment effect was seen on the primary outcome of differences in depressive symptom rating scale scores and on a secondary outcome of increased likelihood of achieving a 50% reduction in the HDRS score. The best-fitting model is described using a response variable in which the greatest absolute effect is observed in the first week and incremental responses by week diminish. This model was significantly better than all the other models, apparently excluding the possibility that treatment response from antidepressant drugs is subject to a period of delay.
The underlying placebo response was taken into account by analyzing the difference between the SSRI and placebo groups, and so the differences seen reflect drug-attributable effects. An early simple numerical difference in score on symptom rating scales may reflect an effect on particular symptoms rather than a true antidepressant drug effect. For example, an antidepressant agent with hypnotic effects could improve symptoms of insomnia before other effects were apparent. If this was the case, then it might be expected that a more global measure, such as the Clinical Global Impression ratings of improvement, would not show the same early benefits of treatment compared with placebo, and rating scale scores might show a stepped pattern of improvement after a period of delay. Neither of these patterns are apparent in this analysis, indicating that the benefits of SSRI treatment seen by the end of week 1 are true antidepressant drug effects.
A key question for physicians and patients is whether the early effects of SSRIs are clinically observable. This may indeed be the case. Of the improvement in symptom ratings attributable to treatment, that is, that seen in addition to placebo, approximately one third of the total effect after 6 weeks of treatment is seen in the first week (eg, HDRS: −1.07/−3.30 = 0.32). One week of treatment is also associated with an increased probability of achieving treatment response (RR, 1.64; 95% CI, 1.2 to 2.25). The absolute benefits of treatment for individuals depend on baseline risk; however, these benefits increase further across time, with, at a best estimate, a number needed to treat for 1 additional person to respond to SSRIs rather than placebo by week 6 of approximately one third of that by week 1. Therefore, individuals involved in treatment decisions will continue to need to wait several weeks for key treatment goals, such as remission,53 to be met.
Next, one has to consider the reliability of the estimates of effect calculated. Some sources of potential bias related to incomplete data may be of particular note in meta-analyses, including publication bias, failure to identify all available trials, and incomplete reporting of trial data.54 The search strategy used herein was comprehensive, identifying 6153 patients randomized to receive SSRIs and including data from 3618 patients compared with only 1549 in a recent analysis.55 However, there are likely to be eligible unpublished trials that could not be included in the analysis. Not all studies that took repeated outcome measurements presented the results for all time points. The effect of all these factors may be to overestimate the true treatment effect.56 However, there does not seem to be any compelling reason to expect the pattern of the response across time to differ in a systematic way between included and excluded trials.
A key issue is whether it is biologically plausible for SSRIs to produce early symptomatic response. Inhibition of serotonin transporters occurs rapidly in vitro and in vivo. A delayed response to antidepressant drug treatment is often linked with the time taken for a variety of adaptive neurobiologic changes to occur, for example, desensitization of serotonin 1A receptors and expression of neurotrophic factors such as brain-derived neurotrophic factor.57,58 However, there is evidence in human volunteers of downstream neurochemical59 and psychological60,61 effects of SSRI treatment in the first week of use that are associated in clinical populations with improvement.62,63
It is possible that the shape of the response curve for SSRIs across time is in some way determined by the design of the trials that contributed data. In particular, the use of last-observation-carried-forward analysis to account for missing data, which has become standard in trials of pharmaceutical treatments for depression and is used by approximately half of the trials analyzed herein, could make a constant effect appear log linear. This is because participants who leave a trial early would otherwise on average have been expected to improve regardless of their treatment allocation. Thus, last observation carried forward could add a negative bias to the results across time. An alternative and attractive hypothesis is that the effect of treatment may be constant on a relative scale, and as improvement occurs across time the absolute benefits achieved from each additional week decrease proportionately. Thus, the changes in rating scale scores (Figure 2) would be analogous to the results found for improvement in Clinical Global Impression score (Figure 3), with the absolute benefit changing week to week while the relative benefit is constant across time. However, if either hypothesized effect applies to the present data, neither would indicate that the effects of treatment are delayed for a period, and, thus, they do not undermine the overall conclusion.
Why have many previous analyses not found evidence of antidepressant drug effects as early as the end of week 1? It is not surprising that statistically significant early differences are infrequently seen in individual RCTs powered to demonstrate treatment effects at a trial end point. There is an approximately inverse square law relationship between sample size and effect size such that to have equal power to find an effect of one third the magnitude requires a 9-fold increase in sample size.64,65 A variety of approaches can be taken in trial design to maximize sensitivity to early antidepressant drug effects, including more frequent assessments early in treatment and the use of pattern analysis or survival analytic approaches.8,18,66 The classic studies of Quitkin et al5,8 establishing the delayed-onset hypothesis differed in several respects from those analyzed herein. The participants included some with depressive illnesses of milder severity than those included herein; for example, in the 1987 study, the minimum score on the 21-item HDRS required for inclusion was 10. They also used a range of non-SSRI antidepressant agents, and as is often seen in clinical practice, these were titrated up to therapeutic doses during the first 2 weeks of treatment. This dose titration period, which is much less typical with SSRIs, may be the key difference, particularly because some other analyses using non-SSRIs with more rapid dose titration have found that effects begin to emerge more rapidly.9,12,45 One of these studies45 included a paroxetine treatment arm, finding a time of onset of its effects during the second week of treatment; one issue here may have been its relatively small size, with 82 patients randomized to 3 treatments.
The present analysis does not readily provide answers to some important related questions, such as whether there are particular patient characteristics, such as sex or disease severity at baseline, or features early in treatment that predict better eventual outcome. Studies66 specifically designed to address these questions can provide some answers; however, it may be that new, more sensitive measures of treatment response will have to be developed before we can reliably identify early response in smaller samples. It is intriguing to speculate that the types of investigation that have already revealed subtle early effects of SSRIs in the laboratory60 might be adapted in due course to the clinic. In summary, treatment with SSRIs is associated with symptomatic improvement in depression by the end of the first week of use. An early response is not necessarily a placebo response.
Correspondence: Matthew J. Taylor, MRCPsych, Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford OX3 7JX, England (email@example.com).
Submitted for Publication: November 14, 2005; final revision received February 23, 2006; accepted February 24, 2006.
Author Contributions: Dr Taylor had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Financial Disclosure: Dr Taylor has received honoraria from AstraZeneca and Recip for educational services and has attended meetings sponsored by Janssen-Cilag and AstraZeneca. Dr Freemantle has received funding for research and consulting, speakers fees, and travel expenses from a variety of companies that manufacture antidepressant drugs and has also received funding for research from the Department of Health in England, the Medical Research Council, and other research charities. Dr Geddes has received research funding and support from GlaxoSmithKline, Sanofi-Aventis, the UK Government Department of Health, the Medical Research Council, and the Stanley Medical Research Institute. Dr Bhagwagar is on the speaker's panel for Bristol-Myers Squibb and AstraZeneca USA; has been on advisory boards for Bristol-Myers Squibb, Janssen, and Lilly; and has also received funding from the Medical Research Council, the Stanley Medical Research Institute, and the National Alliance for Research on Schizophrenia and Depression.
Funding/Support: This study was supported by a Wellcome Trust Research Training Fellowship (Dr Taylor) and a Young Investigator Award from the National Alliance for Research on Schizophrenia and Depression (Dr Bhagawar).