Ham-D indicates Hamilton Depression Rating Scale; TICS, Telephone Interview for Cognitive Status; PHQ-9, Patient Health Questionnaire–9; MINI, Mini International Neuropsychiatric Interview.
David C. Mohr, Joyce Ho, Jenna Duffecy, Douglas Reifler, Leslie Sokol, Michelle Nicole Burns, Ling Jin, Juned Siddique. Effect of Telephone-Administered vs Face-to-face Cognitive Behavioral Therapy on Adherence to Therapy and Depression Outcomes Among Primary Care PatientsA Randomized Trial. JAMA. 2012;307(21):2278–2285. doi:10.1001/jama.2012.5588
Author Affiliations: Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois (Drs Mohr, Ho, Duffecy, Burns, and Siddique and Ms Jin); Department of Medicine Division of General Internal Medicine, Northwestern University (Dr Reifler); and Beck Institute for Cognitive Therapy, Bala Cynwyd, Pennsylvania (Dr Sokol). Dr Reifler is now with Rosalind Franklin University of Medicine and Science, North Chicago, Illinois.
Context Primary care is the most common site for the treatment of depression. Most depressed patients prefer psychotherapy over antidepressant medications, but access barriers are believed to prevent engagement in and completion of treatment. The telephone has been investigated as a treatment delivery medium to overcome access barriers, but little is known about its efficacy compared with face-to-face treatment delivery.
Objective To examine whether telephone-administered cognitive behavioral therapy (T-CBT) reduces attrition and is not inferior to face-to-face CBT in treating depression among primary care patients.
Design, Setting, and Participants A randomized controlled trial of 325 Chicago-area primary care patients with major depressive disorder, recruited from November 2007 to December 2010.
Interventions Eighteen sessions of T-CBT or face-to-face CBT.
Main Outcome Measures The primary outcome was attrition (completion vs noncompletion) at posttreatment (week 18). Secondary outcomes included masked interviewer-rated depression with the Hamilton Depression Rating Scale (Ham-D) and self-reported depression with the Patient Health Questionnaire–9 (PHQ-9).
Results Significantly fewer participants discontinued T-CBT (n = 34; 20.9%) compared with face-to-face CBT (n = 53; 32.7%; P = .02). Patients showed significant improvement in depression across both treatments (P < .001). There were no significant treatment differences at posttreatment between T-CBT and face-to-face CBT on the Ham-D (P = .22) or the PHQ-9 (P = .89). The intention-to-treat posttreatment effect size on the Ham-D was d = 0.14 (90% CI, −0.05 to 0.33), and for the PHQ-9 it was d = −0.02 (90% CI, −0.20 to 0.17). Both results were within the inferiority margin of d = 0.41, indicating that T-CBT was not inferior to face-to-face CBT. Although participants remained significantly less depressed at 6-month follow-up relative to baseline (P < .001), participants receiving face-to-face CBT were significantly less depressed than those receiving T-CBT on the Ham-D (difference, 2.91; 95% CI, 1.20-4.63; P < .001) and the PHQ-9 (difference, 2.12; 95% CI, 0.68-3.56; P = .004).
Conclusions Among primary care patients with depression, providing CBT over the telephone compared with face-to-face resulted in lower attrition and close to equivalent improvement in depression at posttreatment. At 6-month follow-up, patients remained less depressed relative to baseline; however, those receiving face-to-face CBT were less depressed than those receiving T-CBT. These results indicate that T-CBT improves adherence compared with face-to-face delivery, but at the cost of some increased risk of poorer maintenance of gains after treatment cessation.
Trial Registration clinicaltrials.gov Identifier: NCT00498706
Depression is common, with the 1-year prevalence rate of major depressive disorder estimated at between 6.6% and 10.3% in the general population1,2 and roughly 25% of all primary care visits involving patients with clinically significant levels of depression.3 Psychotherapy is effective at treating depression,4 and most primary care patients prefer psychotherapy to antidepressant medication.5 When referred for psychotherapy, however, only a small percentage of patients follow through.6 Attrition from psychotherapy in randomized controlled trials is often 30% or greater7 and can exceed 50% in clinical practice.8
The discrepancy between patients' preference for psychotherapy and the low rates of initiation and adherence is likely due to access barriers. Approximately 75% of depressed primary care patients report barriers that make it extremely difficult or impossible to attend regular psychotherapy sessions.9,10 These barriers are largely structural and include time constraints, lack of available and accessible services, transportation problems, and cost.
A meta-analysis of trials of telephone-administered psychotherapy found a mean attrition rate of 7.6%, suggesting that telephone delivery may reduce attrition.7 Telephone care has also been incorporated into aspects of collaborative care models that integrate mental health specialists into primary care settings.11 Although telephone-administered psychotherapy for depression has been tested as a tool to deliver care and overcome access barriers within primary care, the underlying assumptions that it is as effective as face-to-face care and that it reduces attrition have not been examined.
We describe a randomized trial comparing standard face-to-face cognitive behavioral therapy (CBT) vs a telephone-administered cognitive behavioral therapy (T-CBT) for the treatment of depression in primary care. It was hypothesized that T-CBT would produce lower levels of attrition and secondarily that it would not be inferior in efficacy to face-to-face CBT.
Participants were recruited from November 2007 to December 2010 from general internal medicine clinics in the Northwestern Medical Faculty Foundation and Northwestern Memorial Physician's Group and from 4 primary care clinic members of Northwestern's Practice-Based Research Network in the Chicago area.
Participants were included if they met criteria for major depressive disorder, had a Hamilton Depression Rating Scale (Ham-D) score greater than or equal to 16, were aged 18 years or older, could speak and read English, and were able to participate in face-to-face or telephone therapy. Participants were excluded if they had visual or hearing impairments that would prevent participation; met diagnostic criteria for a severe psychiatric disorder (eg, bipolar disorder, psychotic disorders) or depression of organic etiology (eg, hypothyroidism) for which psychotherapy would be inappropriate; reported alcohol or substance abuse severe enough that 2 psychologists (D.C.M. and J.H.) agreed psychotherapy would be inappropriate; met criteria for dementia by scoring less than 25 on the Telephone Interview for Cognitive Status12; exhibited severe suicidality, including a plan and intent or a suicide attempt in the past 5 years; were receiving or planning to receive individual psychotherapy; or had initiated antidepressant pharmacotherapy in the previous 10 days. Race and ethnicity were measured by self-report to characterize the sample.
This trial was approved by the Northwestern University institutional review board and was monitored by an independent data and safety monitoring board. In accordance with the Northwestern IRB-approved protocol, participants were sent a consent form, which was reviewed over the telephone with research staff prior to the eligibility interview. Patients signed and returned the consent form prior to randomization.
An independent statistician used computer-generated randomization with a 1:1 ratio, stratified by antidepressant status and therapist, with block size of 4 within each stratum. To prevent allocation bias, randomization was conducted after entry criteria were confirmed. Clinical evaluators, who were masked to treatment assignment, enrolled and evaluated participants; if they became unmasked, participants were reassigned to another masked evaluator.
Face-to-face CBT and T-CBT used the same CBT protocol,13 with treatment delivery medium being the only factor that varied between conditions. This treatment model has been adapted and validated for telephone administration.14,15 Participants received 18 45-minute sessions: 2 sessions weekly for the first 2 weeks, followed by 12 weekly sessions, with 2 final booster sessions during 4 weeks. All participants received a patient workbook that included 8 chapters covering CBT concepts, including behavioral activation, cognitive restructuring, and social support, along with 5 optional modules that covered common comorbidities and treatment content, including anxiety and worry, relaxation training, communication and assertiveness training, anger management, and insomnia.14,15
T-CBT telephone calls were initiated by the therapist. Nine therapists, all PhD-level psychologists, provided both face-to-face CBT and T-CBT to eliminate therapist effects. Face-to-face CBT was provided in the Preventive Medicine clinic at Northwestern University, located in the same medical center as the primary recruitment clinics. T-CBT was provided entirely over the telephone. Specific rules to ensure privacy and safety were discussed in the first session, such as being in a private place during telephone calls and not engaging in therapy while driving. Protocols were in place to ensure safety, which could include calling local emergency personnel to conduct a health and safety check in the event of severe suicidality.
All therapists received 2 days of initial training, followed by weekly supervised training from the Beck Institute Director of Education (L.S.) until the therapists reached the competence criterion defined as consistent scores of greater than or equal to 40 on the Cognitive Therapy Scale,16 at which point they began treating study participants. Once trained, therapists received weekly supervision by the Beck Education Director or a Beck-certified psychologist for at least 6 months, which could then be reduced to once every 2 to 3 weeks, as determined by the supervisor. All sessions were audiorecorded and 8% were rated on the Cognitive Therapy Scale for fidelity. Fidelity ratings were used in the supervision of the therapists.
This trial is focused on the treatment delivery medium. To prevent confounding through differences in the management of nonadherent patients across treatment arms, the therapist protocol included specific instructions for handling missed sessions and cancellations. A session was considered missed if less than 24 hours' notice was given. If patients missed a session, they received 2 therapist telephone calls followed by a letter, after which, if still nonresponsive, the patients were determined to have discontinued treatment. Participants did not pay for treatment.
The primary outcome was adherence to treatment, defined as attending therapy sessions. Participants were permitted to reschedule sessions if they notified their therapist 24 hours before a cancellation. The primary outcome was dichotomized as completion vs noncompletion of 18 sessions. Secondarily, we examined failure to engage in treatment (completion of ≤4 sessions), failure to complete (>4 sessions but <18 sessions), and number of sessions completed.
The secondary outcome, depression severity, was measured with the interviewer-rated 17-item Ham-D17 and the self-reported Patient Health Questionnaire–9 (PHQ-9).18 Psychiatric diagnoses, including major depressive disorder, were evaluated with the Mini International Neuropsychiatric Interview.19 Remission used the Ham-D abbreviated 7-item scale criterion, whereas response was a 50% decrease in Ham-D symptoms.20 To eliminate potential loss to follow-up because of access barriers, all interview assessments were conducted by telephone, and self-reports were administered online or by mail. Antidepressant use was assessed by interview. Before randomization, participants reported their treatment preference (face-to-face, telephone, or no preference). To identify any potential effects of systematic therapist expectation bias that might occur as a result of crossing therapist by treatment arm, therapists rated their expectations for patient outcome on a 7-point Likert scale after the second session.
Clinical evaluators, who had at least a bachelor's degree, received no fewer than 16 hours of training on the Mini International Neuropsychiatric Interview and Ham-D, including receiving didactic instruction, role playing, and performing ratings on a library of existing taped interviews. All study interviews were audiotaped and were supervised by a psychologist until reliable proficiency was established. Supervision continued thereafter every 1 to 2 weeks. One audiotape was randomly selected every 1 to 2 weeks for calibration ratings to ensure interrater reliability. The mean interclass correlations were 0.96. All training and supervision were performed by a licensed PhD-level psychologist (J.H.).
The study was designed to enroll 322 participants, resulting in 90% power for a 2-sided test at α = .05 to detect a difference in nonadherence rates of 15% vs 30%.21 Although a meta-analysis found an attrition rate of 7.4% in telephone psychotherapy, 15% was used because heterogeneity was high and many trials were small.7 Attrition of 30% and greater is commonly observed in trials of face-to-face psychotherapy.7 Differences in baseline characteristics by treatment group, nonadherence rates, treatment preference, and posttreatment major depressive disorder were analyzed with t tests for continuous variables and χ2 tests for categorical variables.
Although the rate of missing depression outcomes was low at each postbaseline point, ranging from 9% to 22%, we multiply imputed missing depression scores and generated 20 imputations for each missing value, using the R package MICE,22 in which incomplete variables are imputed one at a time according to a set of conditional densities.23 Imputations were conducted separately by treatment group, and every imputation model was conditioned on a large number of relevant variables, including depression scores, demographics, and total number of CBT sessions attended. Using an imputation model that includes many auxiliary variables preserves relationships among variables and provides more precise and accurate imputations.24 In particular, by including the number of CBT sessions attended, we were able to preserve the relationship between amount of treatment received and depression symptoms among patients with missing depression scores.
Longitudinal depression scores were modeled with repeated-measures linear regression models as implemented in the SAS procedure PROC MIXED (version 9.02). Time was treated as a categorical variable to account for nonlinear effects of time, and an unstructured covariance matrix was assumed. Differences by treatment group in major depressive disorder and remission at 3- and 6-month follow-up were assessed with logistic regression, as was treatment response. These analyses were performed on each of the 20 imputed data sets and results were combined by using the rules of Rubin.25
During the trial, the data and safety monitoring board recommended changes in the planned analyses to replace the originally proposed analyses with a noninferiority analysis for depression outcomes. The noninferiority margin was not determined before the initiation of the trial, but it was determined before any analyses of outcome data and with the full knowledge and approval of oversight bodies. Noninferiority is established by showing that the true difference between 2 treatment arms is likely to be smaller than a prespecified noninferiority margin that separates clinically important from clinically negligible (acceptable) differences.26,27
The clinical community has generally accepted 30% to 50% of the difference between treatment and control conditions as an acceptable definition for the noninferiority margin,27,28 and noninferiority trials of pharmacologic treatments have used the 50% criterion.29- 31 A recent meta-analysis of CBT found an overall effect size of d = 0.82.32 Accordingly, we used d = 0.41 as the noninferiority criterion. A 1-sided test at α = .05 of whether the difference in treatment groups is less than the noninferiority margin is equivalent to testing whether a 2-sided 90% CI around the treatment difference falls within the noninferiority margin. Accordingly, we calculated 90% CIs and rejected the null hypothesis of inferiority (in favor of noninferiority) if the upper bound of the CI was less than d = 0.41.
To assess variable antidepressant use across treatment arms, a repeated-measures analysis of the binary antidepressant use outcomes over time33 was performed. To evaluate whether antidepressant use had differential effects by treatment group, antidepressant use and its interaction with treatment was included in our repeated-measures regression models for Ham-D and PHQ-9.
The flow of patients through the study is depicted in the Figure. Table 1 summarizes the baseline demographics and psychiatric characteristics of the participants. There were no significant differences in these baseline variables across treatment arms. There was no significant difference in therapist expectations of participant outcomes across treatments (P = .83).
Significantly fewer participants discontinued T-CBT (n = 34; 20.9%) before session 18 compared with face-to-face CBT (n = 53; 32.7%; P = .02). Attrition before week 5 was significantly lower in T-CBT (n = 7; 4.3%) than in face-to-face CBT (n = 21; 13.0%; P = .006), but there was no significant difference in attrition between sessions 5 and 18 (P = .31). T-CBT patients attended significantly more sessions (mean, 15.5; median, 17; SD, 4.4; interquartile range, 16-18) than those receiving face-to-face CBT (mean [SD], 13.7 [6.1]; median [IQR], 17 [11-18]; P = .003).
Table 2 shows the intention-to-treat depression outcomes on the Ham-D and the PHQ-9 according to multiply imputed values. In terms of changes from baseline, patients demonstrated significant improvements at posttreatment in both face-to-face (Ham-D Δ = −10.32; PHQ-9 Δ = −10.03; P < .001) and T-CBT (Ham-D Δ = −9.25; PHQ-9 Δ = −10.12; P < .001). At 6-month follow-up, changes from baseline remained significant in face-to-face CBT (Ham-D Δ = −10.69; PHQ-9 Δ = −10.46; P < .001) and T-CBT (Ham-D Δ = −7.78; PHQ-9 Δ = −8.35; P < .001). There were no significant posttreatment differences between T-CBT and face-to-face CBT on the Ham-D (difference = 1.07; P = .22) or PHQ-9 (difference = −0.09; P = .89), although this difference was significant at 6-month follow-up on both the Ham-D (difference = 2.91; P < .001) and PHQ-9 (difference = 2.12; P = .004).
Among T-CBT patients, 23% met criteria for major depressive disorder at posttreatment compared with 25% in face-to-face CBT (P = .69). At 6-month follow-up, major depressive disorder rates were 29% and 26% in the T-CBT and face-to-face CBT groups, respectively (P = .57). At posttreatment, 27% of both T-CBT and face-to-face CBT participants met the Ham-D abbreviated 7-item scale criterion for full remission (P = .95). By 6-month follow-up, 19% of T-CBT vs 32% of face-to-face CBT participants were fully remitted (P = .009). At posttreatment, 44% of T-CBT and 49% of face-to-face CBT participants met the response to treatment criterion of a 50% decrease on the HAM-D (P = .40).
The posttreatment effect size was d = 0.14 (90% CI −0.05 to 0.33) on the Ham-D and −0.02 (90% CI −0.20 to 0.17) on the PHQ-9. Both of these values were within the inferiority margin of d = 0.41, indicating that T-CBT was not inferior to face-to-face CBT at the end of treatment. The 6-month follow-up effect size was d = 0.37 (90% CI 0.19-0.55) on the Ham-D and 0.33 (90% CI, 0.14-0.52) on the PHQ-9. Both of these CIs were outside the inferiority margin, indicating that T-CBT was inferior to face-to-face CBT at 6-month follow-up.
At baseline, 52 (32%) face-to-face CBT patients and 54 (33%) T-CBT patients were receiving antidepressants. During the course of the study, antidepressant use did not change significantly (P = .41), was not different across treatment arms (P = .70), and was not associated with depression outcomes in either the face-to-face CBT (P = .92) or T-CBT (P = .83) patients. Baseline antidepressant use was also not associated with discontinuation of treatment in either face-to-face CBT (P = .29) or T-CBT (P = .91).
Before randomization, 117 (36.6%) participants indicated they would prefer face-to-face CBT, 89 (27.8%) preferred T-CBT, 114 (35.6%) indicated no preference, and 5 did not answer (P = .60). Receiving or not receiving one's preferred treatment was not statistically associated with adherence (P = .39) or depression outcomes (P = .76 for Ham-D; P = .18 for PHQ-9).
There were no significant 2-way (demographic × treatment) or 3-way (demographic × treatment × time) effects for age, sex, race, education, or marital status on depression. Age, sex, race, marital status, and antidepressant status at baseline were unrelated to attrition. Education was significantly related to attrition (P = .02); participants with advanced degrees were more likely to complete treatment than those with some college education (P < .05), but there were no other significant differences across education groups.
There were no adverse events (eg, suicide, suicide attempt, psychiatric hospitalization) for either treatment condition.
This study confirmed that T-CBT produces significantly lower attrition rates compared with face-to-face CBT among depressed primary care patients, suggesting that telephone delivery can overcome barriers to adhering to face-to-face treatment. The effect of telephone administration on adherence appears to occur during the initial engagement period. These effects may be due to the capacity of telephone delivery to overcome barriers and patient ambivalence toward treatment. Access barriers likely exert their effects early in treatment, and thus the effect of the telephone on overcoming those barriers is most prominent in the first sessions. Patients who continue in face-to-face treatment for 5 or more sessions likely have fewer access barriers or are more motivated, and thus use of the telephone likely reduces attrition less during that period.
This trial found that T-CBT was as effective in reducing depressive symptoms as traditional face-to-face CBT at posttreatment, supporting our hypothesis. However, face-to-face treatment was significantly superior to T-CBT during the 6-month follow-up period. The size of these differences in group analyses did not reach the PHQ-9 criterion of 5 or more points for clinical significance35 but was close to the Ham-D criterion of 3 points.34 However, it is likely that these effects are driven by subgroups who show greater risk of failure to maintain therapeutic gains. This effect may be an artifact of T-CBT's capacity to differentially retain patients with characteristics that leave them at greater risk for posttreatment deterioration.
If the finding that face-to-face treatment produces better maintenance of gains after treatment cessation is not an artifact, it suggests that longer-term follow-up is critical in research examining the effects of tele–mental health interventions, and telemedicine more broadly. There are at least 2 possible reasons that some patients may have poorer posttreatment outcomes in T-CBT. One is that the requirement that patients in face-to-face therapy physically attend sessions may serve as a form of behavioral activation. That is, that act of physically attending treatment may be therapeutic in a manner that promotes maintenance of gains in some patients. The other possibility is that the physical presence of the therapist, although not having an effect during treatment, contributes to the maintenance of gains, which suggests that human contact may have unique qualities that exert their effects and contribute to resilience after contact has ceased.
The patient-clinician interaction can be conceptualized as a variety of cues and information transmitted through different verbal and nonverbal channels, each of which carries some unique information. Various telemedicine media (eg, telephone, videoconference, e-mail) limit the effectiveness of specific cues,36 which may have both disadvantages and benefits. For example, in the context of a positive relationship, individuals are likely to make positive attributions in the absence of cues (for example in the absence of visual cues, patients would likely imagine a provider as being more like themselves and more sympathetic than the provider actually is).37 However, if difficulties or suspicions arise, attributions regarding missing cues can become overly negative (eg, imagining the clinician to be more uncaring than a complete set of cues would suggest), which may reduce the patient's commitment to treatment. Thus, future research should not only examine overall effects of the use of treatment delivery media on patient-clinician relationships and clinical outcomes but also identify the circumstances and patients for which specific media are most advantageous.
The findings of this study suggest that telephone-delivered care has both advantages and disadvantages. The acceptability of delivering care over the telephone is growing, increasing the potential for individuals to continue with treatment. A survey of primary care patients found that nearly 19% of patients who desired behavioral and psychological care wanted telephone treatment, and an additional 44% would consider it.38 The data from this trial suggest that preferences for delivery medium do not affect adherence or outcome. The telephone offers the opportunity to extend care to populations that are difficult to reach, such as rural populations, patients with chronic illnesses and disabilities, and individuals who otherwise have barriers to treatment.14,39 Telephone psychotherapy would also meet at least 1 of the key attributes of the advanced medical home, namely, to “provide enhanced and convenient access to care not only through face-to-face visits but also via telephone, e-mail, and other modes of communication.”40 However, the increased risk of posttreatment deterioration in telephone-delivered treatment relative to face-to-face treatment underscores the importance of continued monitoring of depressive symptoms even after successful treatment.
Several limitations and caveats exist in interpreting these data. First, this efficacy trial used CBT for depression. Although we are unaware of reasons why these results cannot be generalized to other forms of psychotherapy and other common mental health problems such as anxiety disorders, we cannot rule out the possibility that these findings are treatment or disorder specific. Second, this sample was fairly well educated, potentially limiting generalizability to lower socioeconomic groups. Third, it was not possible to mask patients to treatment arm.
Our findings demonstrate that T-CBT can reduce attrition and is as effective as face-to-face CBT at posttreatment for depression among primary care patients. However, the increased adherence associated with T-CBT may come at the cost of some increased risk of poorer outcomes after treatment cessation.
Corresponding Author: David C. Mohr, PhD, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lakeshore Dr, Ste 1400, Chicago, IL 60611 (firstname.lastname@example.org).
Author Contributions: Dr Mohr had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Mohr, Duffecy, Sokol.
Acquisition of data: Mohr, Ho, Duffecy, Reifler, Burns.
Analysis and interpretation of data: Mohr, Ho, Duffecy, Jin, Siddique.
Drafting of the manuscript: Mohr, Ho, Duffecy, Jin, Siddique.
Critical revision of the manuscript for important intellectual content: Mohr, Ho, Duffecy, Reifler, Sokol, Burns, Siddique.
Statistical analysis: Ho, Jin, Siddique.
Obtained funding: Mohr.
Administrative, technical, or material support: Mohr, Ho, Duffecy, Reifler, Burns.
Study supervision: Mohr, Ho, Duffecy, Reifler, Sokol.
Conflict of Interest Disclosures: All authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest and none were reported.
Funding/Support: This study was funded by research grant NIMH R01-MH059708 to Dr Mohr.
Role of the Sponsor: The funding agency had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.
Online-Only Material: The Author Video Interview is available
Additional Contributions: We thank Greg Simon, MD, Group Health Research Institute; Amber Bauer-Gambla, private practice; Kelly Glazer-Baron, PhD, Northwestern University; Sarah Kinsinger, PhD, Northwestern University; Kenneth Lehman, PhD, Northwestern University (now at Birmingham, Alabama Veterans Administration Medical Center); Katherine Schaefer-Berg, PhD, Depression and Anxiety Specialty Clinic of Chicago; Vicky Singh, PhD, Northwestern University; and Paula Young, PhD, The Family Institute at Northwestern University. All therapists were compensated by the study for treatment provided. Dr Simon was also compensated for his role as chair of the data and safety monitoring board.