Background
Previous studies have demonstrated the short-term efficacy of pharmacotherapy and behavior therapy for attention-deficit/hyperactivity disorder (ADHD), but no longer-term (ie, >4 months) investigations have compared these 2 treatments or their combination.
Methods
A group of 579 children with ADHD Combined Type, aged 7 to 9.9 years, were assigned to 14 months of medication management (titration followed by monthly visits); intensive behavioral treatment (parent, school, and child components, with therapist involvement gradually reduced over time); the two combined; or standard community care (treatments by community providers). Outcomes were assessed in multiple domains before and during treatment and at treatment end point (with the combined treatment and medication management groups continuing medication at all assessment points). Data were analyzed through intent-to-treat random-effects regression procedures.
Results
All 4 groups showed sizable reductions in symptoms over time, with significant differences among them in degrees of change. For most ADHD symptoms, children in the combined treatment and medication management groups showed significantly greater improvement than those given intensive behavioral treatment and community care. Combined and medication management treatments did not differ significantly on any direct comparisons, but in several instances (oppositional/aggressive symptoms, internalizing symptoms, teacher-rated social skills, parent-child relations, and reading achievement) combined treatment proved superior to intensive behavioral treatment and/or community care while medication management did not. Study medication strategies were superior to community care treatments, despite the fact that two thirds of community-treated subjects received medication during the study period.
Conclusions
For ADHD symptoms, our carefully crafted medication management was superior to behavioral treatment and to routine community care that included medication. Our combined treatment did not yield significantly greater benefits than medication management for core ADHD symptoms, but may have provided modest advantages for non-ADHD symptom and positive functioning outcomes.
ATTENTION-deficit/hyper-activity disorder (ADHD) occurs in 3% to 5% of school-aged children, accounts for as many as 30% to 50% of child referrals to mental health services,1,2 and results in substantial impairment in peer, family, and academic functioning.2,3 Although benefits of short-term treatments (principally stimulants, behavior therapy, and both combined) have been well documented,4-8 few controlled studies have examined effectiveness beyond 3 months. Two exceptions9,10 suggested that stimulant effects can persist 1 to 2 years when taken faithfully. However, the generalizability and usefulness of these 2 studies are constrained by sample sociodemographic homogeneity, exclusion of stimulant nonresponders, lack of unimodal comparisons (medication management vs behavioral interventions), and inadequate statistical power to examine subject factors that predict treatment response.9,10
Given public concerns regarding stimulant treatment,11,12 wide variations in treatment practices,13 and lack of evidence to guide long-term treatments of this chronic disorder, in 1992 the National Institute of Mental Health and Department of Education cosponsored a randomized clinical trial, the Multimodal Treatment Study of Children With ADHD (MTA). Its rationale14 and methods15-19 have been detailed previously.
The MTA Cooperative Group posed 3 questions: How do long-term medication and behavioral treatments compare with one another? Are there additional benefits when they are used together? What is the effectiveness of systematic, carefully delivered treatments vs routine community care? This report constitutes the first-ever description of the relative effectiveness of these treatments through 14 months, using an intent-to-treat analytic strategy with random-effects regression techniques.20,21
Participants, materials, and methods
Recruitment procedures and sample characteristics
Recruitment, screening, and selection procedures aimed to collect a carefully diagnosed sample of impaired children with ADHD and a wide range of comorbid conditions and demographic characteristics representative of patients seen in clinical practice. To minimize site-specific referral biases, sites' referral sources necessarily included mental health settings, pediatricians, advertisements, and school notices. For eligibility, children (of either sex) were between ages 7 and 9.9 years, in grades 1 through 4, and in residence with the same primary caretaker(s) for the last 6 months or longer. All met the DSM-IV criteria for ADHD Combined Type (the most common subtype in this age group22), using the Diagnostic Interview Schedule for Children (DISC), parent report, version 3.0,23 supplemented with up to 2 symptoms identified by children's teachers for cases falling just below the DISC diagnostic threshold. Exclusion criteria were limited to situations that would prevent families' full participation in assessments or treatment, or that might require additional treatments incompatible with study treatments (Table 1).16-18 The presence of comorbid conditions, such as oppositional-defiant disorder (ODD), conduct disorder (CD), internalizing disorders, or specific learning disabilities, did not lead to exclusions per se; an important aim of the study was to examine their interactions with treatment outcomes.
A 4-phase entry procedure (Table 2) screened potential participants, determined ADHD diagnostic status, and assessed each recruit prior to randomization.17,18 Ninety-five percent of subjects entering the fourth-phase baseline assessment were randomized; they did not differ from initial telephone screen subjects in parental education, ethnicity, or sex. Informed consent (parental permission and child assent) was obtained for all participating families, using forms approved by both local institutional review boards and the National Institutes of Health Office for Protection From Research Risks, Bethesda, Md.
In a 4-group parallel design, children were assigned randomly to medication management, behavioral treatment, combined treatment, or community care for 14 months. (In the method articles for this study, treatment assignments were referred to as medication, psychosocial treatment, combined treatment, and community-treatment/assessment and referral. To reflect more accurately the actual treatments, we have changed the terminology for all outcome articles to medication management, behavioral treatment, combined treatment, and community care.)Rather than testing fixed single treatments, we designed each MTA treatment arm as a management strategy, such that each was sufficiently robust and flexible to stand on its own and to respond to individual patients' clinical needs throughout the study.
Power analyses indicated 24 subjects per treatment condition per site (96 families at each of 6 sites) for comparisons of core ADHD symptoms between any 2 treatment arms (critical effect size, 0.4; power, 0.81, with a 5% 2-tailed test).17,24,25 Accordingly, 576 subjects were required; 579 were recruited. Sample demographics, mean scores on standardized Conners teacher and parent scales,26 comorbidity profile, and impairment ratings (Columbia Impairment Scale) are typical of other ADHD samples this age26-30 (Table 3).
Randomization was done centrally by the National Institute of Mental Health Data Center, Rockville, Md, stratified by site in blocks of 16 (4 to each group). Sealed, ordered envelopes were sent to sites for successive entries. Treatment assignment was concealed until the family confirmed agreement to accept randomization.
Principal components analyses narrowed the selection of outcome domains from the comprehensive assessment battery,28 eliminating redundant measures. Six major outcome domains were identified. Within domains, the measures loading highest from each informant were selected: (1) ADHD symptoms were measured with inattention and hyperactivity-impulsivity subscales of parent- teacher-completed SNAP ratings (an acronym denoting the names of the instrument's developers)31; (2) oppositional/aggressive symptoms were measured with a parent and teacher SNAP oppositional-defiant disorder subscale; (3) social skills were measured with a parent- and teacher-completed subscale from the Social Skills Rating System (SSRS)32; (4) internalizing symptoms (anxiety and depression) were measured with an internalizing subscale from parent- and teacher-completed SSRS,32 and children's self-ratings on the Multidimensional Anxiety Scale for Children (MASC)33; (5) parent-child relations were measured with 2 composite scales from a parent-child relationship questionnaire; and (6) academic achievement was measured with 3 subscales from the Wechsler Individual Achievement Test34 (reading, math, and spelling). These measures show acceptable psychometric properties, and are reviewed extensively elsewhere.31-34 Subjects were assessed at baseline, and at 3, 9, and 14 months (treatment end point). Baseline and end-point values for the outcome domains are presented in Table 4.
The open parent, teacher, and child ratings for domains 1 through 5 were augmented by blinded ratings of school-based ADHD and oppositional/aggressive symptoms using the Abikoff Classroom Observational System35 and social skills and peer relations using peer sociometric procedures. Finally, we videotaped parent-child interactions during standardized laboratory tasks performed by raters blind to treatment condition. Videotaped interactions, parental characteristics, family functioning, cognitive ability, general impairment, and services use, attitudes, and barriers18 will be reported in later articles.
Treatment conditions and protocols
The 3 MTA-delivered treatment strategies were chosen for well-established efficacy (at least in the short term), portability, sufficient intensity to stand alone, and distinction from each other. For all 3 arms, subjects had up to 8 additional sessions provided when needed to address clinical emergencies or instances of possible study attrition.
Behavioral treatment included parent training, child-focused treatment, and a school-based intervention organized and integrated with the school year. The parent training, based on work by Barkley36 and Forehand and MacMahon,37 involved 27 group (6 families per group) and 8 individual sessions per family. It began weekly on randomization, concurrent with biweekly teacher consultation; both were tapered over time. The same therapist-consultant conducted parent training and teacher consultation, with each therapist-consultant having a caseload of 12 families
The child-focused treatment was a summer treatment program (STP) developed by Pelham38 as a therapeutic summer camp. The 8-week, 5-days-per-week, 9-hours per-day STP employed intensive behavioral interventions administered by counselors/aides, supervised by the same teacher-consultants who performed parent training and teacher consultation. Behavioral interventions were delivered in group-based recreational settings, and included a point system tied to specific rewards, time out, social reinforcement, modeling, group problem-solving, sports skills, and social skills training. Summer treatment program classrooms provided individualized academic skills practice and reinforcement of appropriate classroom behavior.
The school-based treatment had 2 components: 10 to 16 sessions of biweekly teacher consultation focused on classroom behavior management strategies8 and 12 weeks (60 school days) of a part-time, behaviorally trained, paraprofessional aide working directly with the child (methods adapted from Swanson31). The aides had been STP counselors, and the program continued in the fall classroom, which helped to generalize STP gains to classrooms. Throughout the school year, a daily report card linked home and school. The daily report card8,39 was a 1-page teacher-completed checklist of the child's successes on specific preselected behaviors, and was brought home daily by the child to be reinforced by the parent with home-based rewards (eg, television time, snacks).
The treatments outlined above constitute the maximum "dose" of behavioral treatment children/families could receive, given perfect attendance and compliance. In practice, families (given both behavioral and combined treatment) attended an average of 77.8% of parent training sessions and 36.2 of 40 possible STP days. The school component averaged 10.7 teacher consultation visits and 47.6 days (of 60 possible) of work with classroom aides. Sites differed significantly in the extent of attendance for 2 components (parent training and classroom aides), but there were no significant differences between behavioral and combined treatment, either within or across sites, in degree of attendance/implemenation for any behavioral components (tables available from the authors on request). As described in our companion report, a summary measure of attendance/compliance for all behavioral treatment components was unrelated to treatment outcomes; likewise, attendance did not mediate any site × treatment interactions on outcomes.40
Consistent with the time-limited involvement of providers in clinical practice, the involvement of our personnel in the delivery of the behavioral treatments was gradually tapered, with the goal that parents would increasingly manage the child's behavioral treatment. In most cases, therapist contact with parents had been reduced to once-monthly sessions or stopped altogether prior to end-point assessment.
Medication management15 started with a 28-day, double-blind, daily-switch titration of methylphenidate hydrochloride, using 5 randomly ordered repeats each of placebo, 5 mg, 10 mg, and 15 or 20 mg (higher doses for children >25 kg). Each of the doses listed was given at breakfast and lunch, with a half-dose (rounded to the nearest 5 mg) in the afternoon. Cross-site teams of experienced clinicians blindly reviewed graphs portraying parent and teacher ratings of responses to each of the 4 doses and by consensus selected each child's best dose. After agreement on best dose, the blind was broken, and the agreed-on dose (if not placebo) became the subject's initial maintenance dose. For subjects not obtaining an adequate response to methylphenidate during titration, alternate medications were titrated openly in the following order until a satisfactory one was found: dextroamphetamine, pemoline, imipramine, and, if necessary, others approved by a cross-site panel. Of 289 subjects assigned to medication management (n = 144) and combined treatment (n = 145) for initial titration, 18 had no titration: 17 because they refused the entire medication component and 1 who moved away. An additional 15 subjects started but did not complete titration: 4 because of side effects, 7 because of difficulties tolerating the titration procedures, and 4 who supplied inadequate data. Thus, 256 subjects (88.6%) successfully completed titration; of these, 198 (68.5%) of 289 subjects were assigned to an individually titrated best dose of methylphenidate, with average initial doses of 30.5 mg/d. The remaining titration completers were either openly titrated to dextroamphetamine (n = 26) because of unsatisfactory methylphenidate response or initially given no medication (n = 32) because of a robust placebo response (2 of these placebo responders failed to cooperate further after titration).
During half-hour monthly medication maintenance visits, pharmacotherapists provided support, encouragement, and practical advice (but not behavioral treatment). When deemed necessary by the clinician or requested by the parent, readings from an approved list were supplied. After careful review of parent- and teacher-provided information, pharmacotherapists could make algorithm-guided dose adjustments of ±10 mg/d of methylphenidate (or an equipotent amount if the subject was taking another drug). Additional adjustments beyond ±10 mg/d could be authorized by a cross-site panel of experienced pharmacotherapists. In general, dose reductions were allowed only to address dose-related side effects.
By study end, 212 (73.4%) of the 289 subjects given medication management and combined treatment were being successfully maintained on methylphenidate, 30 (10.4%) on dextroamphetamine, 4 (1.4%) on pemoline, 3 (1.0%) on imipramine, 1 (0.3%) on buproprion, 1 (0.3%) on haloperidol, and 18 (3.1%) on no medication (combined treatment [14 subjects] and medication management [4 subjects]) (3.1%), with 20 persistently unmedicated (18 since study outset, 2 during maintenance). There were no differences between the medication management and combined treatment groups in the proportion of subjects maintained on the various medications.
Side effects were monitored monthly—not present, mild, moderate, or severe—using the parent-completed 13-item Pittsburgh Side Effects Rating Scale,41 reviewed by the pharmacotherapist. At end point, 245 combined treatment/medication management families provided information on side effects, with some reporting more than one: 88 (35.9) reported no side effects, 122 (49.8%) reported mild side effects only, 28 (11.4%) reported moderate side effects, and 7 (2.9%) reported severe side effects. These figures may overestimate side effects, because 6 of 11 reported severe side effects (depression, worrying, or irritability) could have been due to nonmedication factors.
Combined treatment provided all treatments outlined above for medication management and behavioral treatment; namely, titration followed by monthly medication maintenance, parent group and individual sessions, teacher consultation, STP, and the classroom aide. However, to approximate clinical practice, we integrated the 2 treatment modalities; information was regularly shared between the teacher-consultant and pharmacotherapist and used to guide overall decisions. Manualized guidelines determined if and when an adjustment in one treatment should be made, vs intervening first with the other.16,17 Consequently, the multimodal combination was not the simple addition of the two unimodal treatments. Consistent with the literature,42 by treatment end point combined treatment subjects received lower total daily doses of methylphenidate (31.2 mg) than medication management subjects (37.7 mg). Though sites differed significantly in total daily methylphenidate doses (range of mean doses per site, 30.2-41.3 mg), there were no site × treatment interactions in total daily doses (treatment group: F = 14.6, P = .01; site: F = 3.2, P = .002; site × treatment group: F11,200 = 1.2, P = .24).
Community care participants received none of our treatments, but were provided a report of their initial study assessments, along with a list of community mental health resources. They were subsequently reassessed in parallel with participants in our 3 treatment arms. At each assessment point, the types of treatments they obtained in the community were documented. Most community care subjects (n = 97 [67.4%]) received ADHD medications (principally one of the stimulants) from their own provider during the 14 months: methylphenidate (n = 84), pemoline (n = 7), amphetamine (n = 6), tricyclics (n = 6), clonidine/guanfacine (n = 4), and/or buproprion (n = 1) (10 subjects received more than 1 medication). In addition, 16 of these 97 children were treated by their physician with another antidepressant (not counting tricyclics or buproprion). For those treated with methylphenidate, the mean total daily dose at study completion was 22.6 mg, averaging 2.3 doses per day (vs 3.0 doses per day for MTA-treated subjects). Information concerning community care psychotherapeutic treatments has not yet been coded and will not be presented in this article.
The MTA study achieved a high degree of adherence to protocol by cross-arm emphasis on subject rapport, manualization of all treatments, regular supervision of pharmacotherapists and psychotherapists by skilled clinician investigators, cross-site weekly treatment panels, and audiotaping of all sessions. Good compliance (reflected by acceptance and attendance at treatment sessions) by patients with the protocol was facilitated by monthly pill counts, intermittent saliva measurements to monitor taking of methylphenidate, and encouragement of families to make up missed visits. Only 13 (9.0%) of 144 medication management subjects and 5 (3.4%) of 145 combined treatment subjects failed to start medication. More remarkably, none of 144 behavioral treatment subjects and only 1 (0.7) of 145 combined treatment subjects refused behavioral treatment. There was no difference between medication management or combined treatment in medication session attendance, nor between behavioral treatment and combined treatment in the degree of behavioral treatment acceptance or attendance. These factors did not affect the overall findings.40
Subjects/families refusing their respective treatments were encouraged to reconsider their decision throughout the study, as well as to continue to complete all assessments. Thus, including all subjects who continued to participate in assessments (despite refusing part of all of their assigned treatments), the absolute attrition rate over the course of the study was 3.5%, with only 20 complete dropouts by 14 months (6 in the community care group, 3 in behavioral treatment, 8 in medication management, 3 in combined treatment).
Given the well-described advantages of random-effects regression (RR) techniques over traditional analyses of variance for clinical trials data,43-46 we used RR whenever possible for our primary intent-to-treat analyses.20,21 Rather than define a single outcome, we specified multiple outcome variables, anticipating differential impacts of the treatment modalities on various outcome domains.5-7,14 Based on our data reduction procedures outlined above, the 6 domains were represented by 19 measures (Table 4). For each outcome variable, tests for site, time, time × treatment (the treatment effect over time), and site × treatment × time were conducted within the intent-to-treat RR analyses. When omnibus RR analyses comparing all 4 groups were significant, 3 sets of pairwise comparisons were performed, each set addressing 1 of the principal study questions: (1) medication management vs behavioral treatment (2 tailed); (2) combined treatment vs medication management and combined treatment vs behavioral treatment (1 tailed, assuming the superiority of combined treatment); and (3) community care vs medication management, community care vs behavioral treatment, and community care vs combined treatment (1 tailed, assuming the superiority of MTA treatments).
Bonferonni corrections were applied to all omnibus tests, based on the number of measures in the respective domain. Thus, for each of 5 measures within the ADHD domain, standard definitions of significance (P<.05) were corrected by dividing by 5, requiring P<.01 for significance. The 6 pairwise contrasts were likewise adjusted by dividing omnibus-corrected significance levels further by 6 (P<.01 ÷ 6 = .0017 in this example). Given our a priori hypotheses about the superiority of multimodal treatment (combined treatment vs medication management and behavioral treatment), as well as the superiority of all 3 of our treatments over community care (community care vs combined treatment, medication management, behavioral treatment), we used 1-tailed tests for these specific contrasts. This approach was taken as a means of striking a balance between the dangers of committing type II errors vs overinterpreting significant findings that occurred simply by chance (type I errors). Readers are encouraged to exercise caution when interpreting 1-tailed findings and may choose to double 1-tailed P values.
Because initial RR analyses revealed both quadratic and linear effects of time on treatment outcomes, we computed the log of the number of days since randomization for each assessment point and used these log values in all RR analyses.
Despite high compliance, we checked whether compliance with assessments (ie, missing data) could have changed our findings. Random-effects regression analyses were completed 2 ways: once with inclusion of all subjects, and then with only those subjects who provided data over multiple time points during the study. No differences emerged among these 2 sets of analyses, lending confidence to the overall findings.
For 10 of 19 variables, omnibus tests revealed significant treatment effects over time. For these analyses, we describe the results of the paired comparisons, in order of our original hypotheses.
Do medication and behavioral treatments result in comparable levels of improvement in pertinent outcomes at the end of treatment? Robust differences were found according to 2 different data sources, indicating the superiority of medication management over behavioral treatment for ADHD symptoms (Table 5); namely, parents' and teachers' ratings of inattention and teachers' ratings of hyperactivity-impulsivity. Medication management and behavioral treatment did not differ significantly on any other outcomes.
Do participants assigned to combined treatment show higher levels of improvement in overall functioning in pertinent outcome domains than those assigned to either medication management or behavioral treatment at the end of treatment (1-tailed hypotheses)? These analyses indicate that combined treatment and medication management did not differ significantly across any domain.
Compared with behavioral treatment, combined treatment was superior in benefitting ADHD symptoms, according to parents' and teachers' ratings of inattention and parent-rated hyperactivity-impulsivity. Combined treatment also significantly outperformed behavioral treatment on parents' SNAP oppositional/aggressive behaviors, parent-rated internalizing symptoms (Table 5), and Weschler Individual Achievement Test reading achievement score (Table 4).
Do participants assigned to each of the 3 MTA treatments (medication management, behavioral treatment, and combined treatment) show greater improvement over 14 months than those assigned to community care (1 tailed)? These analyses reveal that combined treatment and medication management were generally superior to community care for parent- and teacher-reported ADHD symptoms, whereas behavioral treatment was not (Table 5). In non-ADHD domains, medication management and behavioral treatment were superior to community care on 1 domain only (teacher-reported social skills and 1 measure of parent-child relations, respectively). In contrast, combined treatment was significantly superior to community care on all 5 non-ADHD domains of functioning (parent-reported oppositional/aggressive behaviors, internalizing symptoms, teacher-reported social skillls, parent-child relations, and Weschler Individual Achievement Test reading achievement scores).
Because our RR intent-to-treat analyses included all subjects' data points through 14 months, it is possible that some treatment groups (especially behavioral treatment, where 38 crossovers to medication occurred) may have fared better because of the number who had received additional treatments over the course of the study. To address this issue, we conducted additional RR analyses, censoring any observations obtained after crossover subjects had received the additional treatments. These analyses yielded no differences from the findings noted in Table 5 (analyses available from the authors on request).
Figure 1 shows the 4 RR analyses selected to highlight findings from different domains, as well as from raters who were likely to witness the target behaviors: hyperactivity-impulsivity (a core ADHD symptom)(teacher report), internalizing symptoms (parent report), social skills (teacher report), and parent-child arguing (power assertion, parent report). The remaining RR graphs are available from the authors on request.
All 4 groups showed marked reductions in symptoms over time, with significant differences among them in degrees of change. Combined treatment and medication management treatments were clinically and statistically superior to behavioral treatment and community care in reducing children's ADHD symptoms. Combined behavioral intervention and stimulant medication—multimodal treatment, the current criterion standard for ADHD interventions—yielded no significantly greater benefits than medication management for core ADHD symptoms; this parallels findings reported by others.10,42 Also consistent with previous reports, our combined treatment outcomes were achieved with significantly lower medication doses than used in medication management.42,47
For other areas of function (oppositional/aggressive behaviors, internalizing symptoms, social skills, parent-child relations, and academic achievement), few differences among our treatments were noted, and when found, were generally of smaller magnitude. In fact, combined treatment, medication management, and behavioral treatment never differed significantly among themselves, with 3 exceptions (combined treatment > behavioral treatment for parent-reported internalizing problems and oppositional/aggressive symptoms, and Weschler Individual Achievement Test reading achievement score).
With respect to comparisons of MTA treatments with community care, combined treatment and medication management fared substantially better than community care on most ADHD outcome measures, while behavioral treatment did not. According to least 1 informant, combined treatment also fared significantly better than community care for all 5 non-ADHD domains: parent-reported oppositional/aggressive symptoms, parent-reported internalizing problems, teacher-reported social skills, parent-child relations, and reading achievement. In contrast, medication management and behavioral treatment each fared better than community care in 1 non-ADHD domain only (teacher-rated social skills and parent-child relations).
Our finding that MTA treatments (most notably combined treatment) offered greater benefits than community care for oppositional/aggressive behaviors, internalizing symptoms, peer interactions, parent-child relations, and reading achievement has not been previously reported in long-term studies.3,5,14 However, the differential benefits in these non-ADHD domains are consistent with the theoretical aims of multimodal approaches.14 For example, medication is known to reduce negative peer interactions dramatically, but increases in positive social behavior are far less robust.48 Such changes might require intensive and long-term application of the behavioral components of combined treatments, such as those found in our STP and school-based interventions.38 Similarly, parent training, which includes positive parental attention and rewards for the child's appropriate behavior, when combined with medication, might be expected to decrease oppositionality and enhance parent-child relations more than medication alone. For internalizing symptoms, the relatively greater improvements for subjects given combined treatment are particularly noteworthy, as none of our treatments were designed to address this domain specifically.
The MTA study extends the findings of previous studies that demonstrated short-term, robust efficacy of medication management, showing that these benefits persist during treatment up to 14 months. In contrast to frequently expressed concerns, children given combined treatment and medication management tolerated medication well, including a third dose given in the afternoon. The relative improvements attributed to medication management also parallel findings from other, longer-duration stimulant trials.9,10,49 Given the MTA's size and scope, however, we saw effects across diverse settings, patient groups, provider characteristics, and outcome domains. These findings were further strengthened by the absence of any site×treatment interactions.
Although combined treatment and medication management were generally superior to community care, community treatments usually included medication; hence, it is unclear which components of the 2 MTA medication treatments may have rendered them more effective than community care. Further analyses of these findings are presented in our companion report,40 but reviewing the apparent differences is instructive. We used a manualized medication titration procedure and "thrice-daily" dosing, as well as higher, carefully monitored daily doses to maximize positive effects and minimize side effects.15 We met with parents monthly and obtained systematic feedback, from both them and the children's teachers. Parent guidance and selected readings were provided as needed; this is reported to provide benefits over simple pill dispensing alone.50 These components, particularly the systematic and regular feedback from teachers, do not seem to be part of routine pediatric ADHD treatment practices,13 and may have enhanced the effectiveness of our medication management.
The modest benefits for some non-ADHD domains obtained by multimodal treatments have been reported previously5,51,52 after 3 to 4 months of treatment. In contrast, the study by Hechtman and Abikoff10 failed to demonstrate these effects after 12 months of active treatment, perhaps because of their smaller sample (103 subjects distributed across 3 groups). In addition to the MTA's sample size advantages, its behavioral component design required that behavioral treatment interventions be delivered across multiple settings and caretakers (home, school, and STP), augmented with further strategies to facilitate the generalization of effects across settings and over time17,19—all enhancements not found in previous studies.8
Whether there is greater value for multimodal treatments for ADHD depends on which intervention is considered as the comparison. If one assumes that a behavioral intervention should always be used as the first-line ADHD treatment (often the preference for many parents, and the practice in many European countries), and that the possibly greater benefits of combined treatment should be determined, then combined treatment seems to offer a great deal of benefit over behavioral treatment alone. But if one provides carefully monitored medication treatment similar to that used in this study as the first line of treatment, our results suggest that many treated children may not require intensive behavioral interventions.47
The significantly lower total daily doses of methylphenidate in the combined treatment arm are noteworthy but not unforeseen.42,47 The importance of this finding is unclear, and a rigorous test of this question would likely require a different design. Nonetheless, this issue remains an ongoing source of concern of many parents and clinicians and should not be dismissed, particularly since side effects are usually related to dosage. If equivalent, sometimes better results can be obtained by a combined treatment that uses lower doses, such findings may have public health importance.
Concerning the relative benefits of our behavioral treatment alone, results must be understood within the context of the limitations of our study design. Most importantly, our design did not include a no-treatment or placebo group (an ethically unacceptable option for an ADHD study of this length), and most subjects given community care received medication for most of the study period—a treatment of known efficacy. Substantial improvement occurred over time across all groups (including community care), regardless of rating source or method. While some of this change could be regression to the mean, this explanation cannot account for the substantial group differences reported here. More than three fourths of subjects given behavioral treatment were successfully maintained without medication throughout the study. Consequently, it should not be concluded that behavioral treatment interventions did not work.
One caveat concerns our choice and number of outcome measures. We felt that the treatment effects in different domains and from different respondents might vary, and that these variations were necessary in interpreting the results of the study. Consequently, despite the loss of power caused by Bonferonni corrections, we chose 19 primary outcome measures rather than a single summary outcome score. Power calculations17 that underlay the setting of sample size were based on setting as the lower limit of clinical significance an effect size of 0.4 ("small to moderate"53) and on the requirement of 80% power (5% level of confidence) to detect effects of that magnitude. Thus, the chance is high of declaring effect sizes lower than 0.4 not statistically significant, even though some clinicians might regard such effects as clinically significant.
Hence, for purposes of assessing clinical significance, the overall pattern of results may be more instructive than any single test statistic or P value reported in Table 4 and Table 5. The general ordering of treatment groups' benefits is confirmed by simple inspection of the number of times each group placed first compared with all others on the 19 outcome measures: combined treatment, 12; medication management, 4; behavioral treatment, 2; and community care, 1. While combined treatment scored numerically "best" on most outcome measures, we did not have statistical power to detect small effects, such as those that might exist between combined treatment and medication management.25 Statistical significance, of course, cannot be interpreted as necessarily indicative of clinical or practical significance, and lack of significance is never proof of the equivalency of treatments.
By way of caution, we note that subgroup analyses may yield different treatment effects for specific patient groups than the main intent-to-treat analyses presented herein. In our companion article40 we report analyses indicating that children with ADHD who have co-occurring disorders, as well as those with fewer family resources, are more likely to benefit specifically from combined and behavioral treatments for some outcome domains. In addition, other unidentified individual factors may yield excellent responses to one of the several treatments.47 Response patterns and characteristics of such "excellent responders" will likely be obscured by large-group analyses. Future studies will examine these issues in more detail.
Such considerations highlight the need for caution in interpreting our results and argue against a "one size fits all" approach to treatment. Moreover, our results cannot necessarily be generalized beyond ADHD Combined Type; other ADHD subtypes (eg, inattentive subtype) may warrant somewhat different treatments.
Previous research has shown that the medication benefits persist only so long as treatment is continued.6,54 Whenever possible, our subjects given combined treatment and medication management received medication throughout the 14 months, including all assessment points. By contrast, and consistent with clinical practice, for subjects given behavioral treatment, the frequency of contact with therapists was gradually reduced to once-monthly contacts 3 to 6 months prior to posttreatment assessment; in general, they were assessed up to 1 month after their last treatment contact. Procedures for maintenance and generalization were incorporated throughout behavioral treatment's implementation; the goal was that benefits would persist as parents and children learned and consolidated their skills. However, our behavioral treatment procedures, increasingly nonintensive during the once-monthly contacts, were insufficient to produce overall effects comparable or additive with 14 months of ongoing medication management for core ADHD symptoms.
Unanswered in our study are important questions concerning behavioral and combined treatments for ADHD. Are there some children for whom medication management is no longer necessary,9 and if so, why does this occur? Might the behavioral component of combined treatment allow some children to be successfully tapered off medication? Can and should behavioral and pharmacologic treatments be tapered, and how can that be accomplished while maintaining effects? Will findings differ as children age, such that those who have learned increased skills via behavioral interventions eventually function better than those receiving only medication? Follow-up study of our subjects past 14 months (currently under way) will address some of these critical questions.
Since ADHD is now regarded by most experts as a chronic disorder,14 ongoing treatment often seems necessary. As with other chronic conditions, such as diabetes and asthma, the need for active treatment may wax and wane. Just as exercise, diet, and pollen load may affect these illnesses, persons' learning or work environments and intercurrent stressors may affect the need for, type, and intensity of ADHD treatments over the life course.55 Under such conditions, behavioral treatments may help families actively cope with their child's disorder and make the necessary life accommodations to optimize family functioning, even when such treatments are not as effective as medication in reducing children's ADHD symptoms. Indeed, 14-month end-point analyses indicated that parent satisfaction ratings differed significantly by treatment group; pairwise contrasts showed that treatment satisfaction scores for combined treatment and behavioral treatment parents were significantly superior to medication management parents' ratings (though not differing between themselves), suggesting that the behavioral treatment components benefitted this area of family-relevant outcomes.
The absence of any site×treatment×time interactions suggests that both our pharmacological and behavioral treatments could be delivered with fidelity across 6 very different clinical settings. However, the use of these treatments will ultimately be determined by the degree to which they are feasible, transportable, and affordable in real-world settings—topics for future research. In the interim, however, the MTA study, by virtue of its size; scope; length; parallel group design; explicit use of manualized, evidence-based treatments; high degree of compliance across arms and over the course of the study; and comprehensive range of outcome assessments, sets an important benchmark for future trials testing new treatments for childhood ADHD.
Accepted for publication July 28, 1999.
This study was supported by grants UO1 MH50461 (Drs Hinshaw and Elliot), UO1 MH50447 (Drs Conners, Wells, and March), UO1 MH50440 (Drs Swanson, Cantwell, and Wigal), UO1 MH50453 (Drs Abikoff and Hechtman), UO1MH50454 (Drs Greenhill and Newcorn), and UO1 MH50467 (Drs Pelham and Hoza), from the National Institute of Mental Health, Bethesda, Md.
The opinions and assertions contained in this study are the private views of the authors and are not to be construed as official or reflecting the views of the Department of Health and Human Services, Washington, DC, or the National Institute of Mental Health, Bethesda, Md.
Principal Collaborators
National Institute of Mental Health, Rockville, Md: Peter S. Jensen, MD (Office of the Director); L. Eugene Arnold, MD (Department of Psychiatry, Ohio State University); John E. Richters, PhD (Developmental Psychopathology and Prevention Research Branch); Joanne B. Severe, MS (Research Projects and Publications Branch); Donald Vereen, MD (Office of Drug Control Policy); and Benedetto Vitiello, MD (Child and Adolescent Treatment and Preventive Interventions Research Branch).
Office of Special Education Programs, US Department of Education, Washington, DC: Ellen Schiller, PhD.
Principal Investigators and Coinvestigators
University of California, Berkeley/University of California, San Francisco: Stephen P. Hinshaw, PhD (Department of Psychology, University of California, Berkeley); Glen R. Elliott, MD, PhD (Department of Psychiatry, University of California, San Francisco).
Duke University, Durham, NC: C. Keith Conners, PhD, Karen C. Wells, PhD, and John March, MD (Department of Psychiatry and Behavioral Sciences).
University of California, Irvine/University of California at Los Angeles: James Swanson, PhD, and Timothy Wigal, PhD (Department of Pediatrics and Cognitive Science, University of California, Irvine); Dennis P. Cantwell, MD (deceased) (Department of Psychiatry, Neuropsychiatric Institute, University of California at Los Angeles).
Long Island Jewish Medical Center, New York, NY/Montreal Children's Hospital, Montreal, Quebec: Howard B. Abikoff, PhD (Department of Psychiatry, New York University School of Medicine); Lily Hechtman, MD (Department of Psychiatry, McGill University, Montreal).
New York State Psychiatric Institute/Columbia University/Mount Sinai Medical Center, New York, NY: Laurence L. Greenhill, MD (Department of Psychiatry, Columbia University); Jeffrey H. Newcorn, MD (Department of Psychiatry, Mount Sinai School of Medicine).
University of Pittsburgh, Pittsburgh, Pa: William E. Pelham, PhD (Department of Psychology, State University of New York at Buffalo); Betsy Hoza, PhD (Department of Psychological Sciences, Purdue University, West Lafayette, Ind).
Statistical and Design Consultation
Helena C. Kraemer, PhD (Department of Psychiatry and Behavioral Science, Stanford University, Palo Alto, Calif).
Corresponding author: Peter S. Jensen, MD, Department of Child Psychiatry, Unit 78, Center for the Advancement of Children's Mental Health, New York State Psychiatric Institute/Columbia University, 1051 Riverside Dr, New York, NY 10032 (e-mail: jensenp@child.cpmc.columbia.edu).
1.Popper
CW Disorders usually first evident in infancy, childhood, or adolescence. Talbott
JAHales
REYudofsky
SCeds.
Textbook of Psychiatry Washington, DC American Psychiatric Press1988;649- 735
Google Scholar 2.Barkley
RA Attention-deficit hyperactivity disorder. Mash
EJBarkley
RAeds.
Child Psychopathology New York, NY Guilford Press1996;63- 112
Google Scholar 3.Hinshaw
SP Externalizing behavior problems and academic underachievement in childhood and adolescence: causal relationships and underlying mechanisms.
Psychol Bull. 1992;111127- 155
Google ScholarCrossref 4.Spencer
TBiederman
JWilens
THarding
MO'Donnell
DGriffin
S Pharmacotherapy of attention-deficit hyperactivity disorder across the life cycle.
J Am Acad Child Adolesc Psychiatry. 1996;35409- 432
Google ScholarCrossref 5.Swanson
J Effect of stimulant medication on hyperactive children: a review of reviews.
Except Child. 1993;60154- 162
Google Scholar 6.Pelham
WEMurphy
HA Behavioral and pharmacological treatment of hyperactivity and attention-deficit disorders. Herson
MBreuning
SEeds.
Pharmacological and Behavioral Treatment An Integrative Approach New York, NY John Wiley & Sons Inc1986;108- 147
Google Scholar 7.Hinshaw
SPKlein
RGAbikoff
H Childhood attention-deficit hyperactivity disorder: nonpharmacologic and combination treatments. Nathan
PEGorman
JMeds.
A Guide to Treatments That Work New York, NY Oxford University Press1998;26- 41
Google Scholar 8.Pelham
WEWheeler
TChronis
A Empirically supported psychosocial treatments for attention deficit hyperactivity disorder.
J Clin Child Psychol. 1998;27190- 205
Google ScholarCrossref 9.Gillberg
CMelander
Hvon Knorring
ALJanols
LOThernlund
GHagglof
BEidevall-Wallin
LGustafsson
PKopp
S Long-term stimulant treatment of children with attention-deficit hyperactivity disorder symptoms: a randomized, double-blind, placebo-controlled trial.
Arch Gen Psychiatry. 1997;54857- 864
Google ScholarCrossref 10.Hechtman
LAbikoff
H Multimodal treatment plus stimulants vs stimulant treatment in ADHD children: results from a two year comparative treatment study. Proceedings of the Annual Meeting of the American Academy of Child and Adolescent Psychiatry October 17-22, 1995 New Orleans, La.
11.Jensen
PSBhatara
VSVitiello
BHoagwood
KFeil
MBurke
LB Psychoactive medication prescribing practices for US children: gaps between research and clinical practice.
J Am Acad Child Adolesc Psychiatry. 1999;38557- 565
Google ScholarCrossref 12.Safer
DJZito
JMFine
EM Increased methylphenidate usage for attention deficit disorder in the 1990s.
Pediatrics. 1996;981084- 1088
Google Scholar 13.Sloan
MJensen
PKettle
L Assessing services for children with ADHD: gaps and opportunities.
J Attention Disord. 1999;313- 29
Google ScholarCrossref 14.Richters
JArnold
LEAJensen
PSAbikoff
HConners
CKGreenhill
LLHechtman
LHinshaw
SPPelham
WESwanson
JM NIMH Collaborative Multisite, Multimodal Treatment Study of Children With ADHD, I: background and rationale.
J Am Acad Child Adolesc Psychiatry. 1995;34987- 1000
Google ScholarCrossref 15.Greenhill
LLAbikoff
HBArnold
LECantwell
DPConners
CKElliott
GHechtman
LHinshaw
SPHoza
BJensen
PSMarch
JSNewcorn
JHPelham
WESevere
JBSwanson
JMVitiello
BWells
KC Medication treatment strategies in the MTA: relevance to clinicians and researchers.
J Am Acad Child Adolesc Psychiatry. 1996;351304- 1313
Google ScholarCrossref 16.Arnold
LEAbikoff
HBCantwell
DPConners
CKElliott
GGreenhill
LLHechtman
LHinshaw
SPHoza
BJensen
PSKraemer
HCMarch
JSNewcorn
JHPelham
WERichters
JESchiller
ESevere
JBSwanson
JMVereen
DWells
KC National Institute of Mental Health Collaborative Multimodal Treatment Study of Children With ADHD (the MTA): design challenges and choices.
Arch Gen Psychiatry. 1997;54865- 870
Google ScholarCrossref 17.Arnold
LEAbikoff
HCantwell
DPConners
CKElliott
GGreenhill
LLHechtman
LHinshaw
SPHoza
BJensen
PSKraemer
HCMarch
JSNewcorn
JHPelham
WERichters
JESchiller
ESevere
JBSwanson
JMVereen
DWells
KC NIMH Collaborative Multimodal Treatment Study of Children With ADHD (MTA): design, methodology, and protocol evolution.
J Attention Disord. 1997;2141- 158
Google ScholarCrossref 18.Hinshaw
SPMarch
JSAbikoff
HArnold
LECantwell
DPConners
CKElliott
GEHalperin
JGreenhill
LLHechtman
LTHoza
BJensen
PSNewcorn
JHMcBurnett
KPelham
WERichters
JESevere
JBSchiller
ESwanson
JVereen
DWells
KWigal
T Comprehensive assessment of childhood attention-deficit hyperactivity disorder in the context of a multisite, multimodal clinical trial.
J Attention Disord. 1997;1217- 234
Google ScholarCrossref 19.Wells
KPelham
WEKotkin
RAHoza
BAbikoff
HArnold
LEAbramowitz
ACantwell
DPConners
CKDel Carmen
RElliott
GGreenhill
LLHechtman
LTHinshaw
SPJensen
PSMarch
JSSchiller
ESevere
JSwanson
JM Psychosocial treatment strategies in the MTA study: rationale, methods, and critical issues in design and implementation.
J Abnorm Child Psychol. In press.
Google Scholar 20.Hedeker
DGibbons
RD MIXREG: a computer program for mixed-effects regression analysis with autocorrelated errors.
Comput Methods Programs Biomed. 1996;49229- 252
Google ScholarCrossref 21.Hedeker
DGibbons
RD MIXOR: a computer program for mixed-effects ordinal regression analysis.
Comput Methods Programs Biomed. 1996;49157- 176
Google ScholarCrossref 22.Wolraich
MLHannah
JNBaumgaertel
AFeurer
ID Examination of
DSM-IV criteria for attention deficit/hyperactivity disorder in a county-wide sample.
J Dev Behav Pediatr. 1998;19162- 168
Google ScholarCrossref 23.Shaffer
DFisher
PDulcan
MDavies
MPiacentini
JSchwab-Stone
MLahey
BBourdon
KJensen
PBird
HCanino
GRegier
D The second version of the NIMH Diagnostic Interview Schedule for Children (DISC-2).
J Am Acad Child Adolesc Psychiatry. 1996;35865- 877
Google ScholarCrossref 24.Cohen
J Statistical Power Analyses for the Behavioral Sciences. 2nd ed. Hillsdale, NJ Lawrence A Erlbaum Associates1988;
25.Kazdin
AEBass
D Power to test differences between alternative treatments in comparative psychotherapy outcome research.
J Consult Clin Psychol. 1989;57138- 147
Google ScholarCrossref 26.Conners
CK Manual for the Conners' Rating Scales. Toronto, Ontario Multi-Health Systems1990;
27.Lahey
BBApplegate
BMcBurnett
KBiederman
JGreenhill
LHynd
GWBarkley
RANewcorn
JJensen
PRichters
JGarfinkel
BKerdyk
LFrick
POllendick
TPerez
DHart
EWaldman
IShaffer
D
DSM-IV field trials for attention deficit hyperactivity disorder in children and adolescents.
Am J Psychiatry. 1993;1511673- 1685
Google Scholar 28.Biederman
JNewcorn
JSprich
S Comorbidity of attention deficit hyperactivity disorder with conduct, depressive, anxiety, and other disorders.
Am J Psychiatry. 1991;148564- 577
Google Scholar 29.Nottelmann
EJensen
P Comorbidity of disorders in children and adolescents: developmental perspectives. Ollendick
TPrinz
Reds.
Advances in Clinical Child Psychology Vol 17 New York, NY Plenum Publishing Corp1995;109- 155
Google Scholar 30.Szatmari
POfford
DRBoyle
MH Correlates, associated impairments and patterns of services utilization of children with attention deficit disorder: findings from the Ontario Child Health Study.
J Child Psychol Psychiatry. 1989;30205- 217
Google ScholarCrossref 31.Swanson
JM School-based Assessments and Interventions for ADD Students. Irvine, Calif KC Publications1992;
32.Gresham
FMElliott
SN Social Skills Rating System: Automated System for Scoring and Interpreting Standardized Test [computer program]. Version 1 Circle Pines, Minn American Guidance Systems1989;
33.March
JSParker
JDSullivan
KStallings
PConners
CK The Multidimensional Anxiety Scale for Children (MASC): factor structure, reliability, and validity.
J Am Acad Child Adolesc Psychiatry. 1997;36554- 565
Google ScholarCrossref 34.Not Available, Wechsler Individual Achievement Test: Manual. San Antonio, Tex Psychological Corp1992;
35.Abikoff
HGittelman
R Classroom observation code: a modification of the Stony Brook Code.
Psychopharmacol Bull. 1985;21901- 909
Google Scholar 36.Barkley
RA Defiant Children: A Clinician's Manual for Parent Training. New York, NY Guilford Press1987;
37.Forehand
RMacMahon
R Helping the Noncompliant Child: A Clinician's Guide to Parent Training. New York, NY Guilford Press1980;
38.Pelham
WEHoza
B Comprehensive treatment for ADHD: a proposal for intensive summer treatment programs and outpatient follow-up. Hibbs
EJensen
Peds.
Psychosocial Treatment Research of Child and Adolescent Disorders Washington, DC American Psychiatric Press1996;311- 340
Google Scholar 39.Pelham
WWaschbusch
D Behavioral intervention in ADHD. Quay
HPHogan
AEeds.
Handbook of Disruptive Behavior Disorders New York, NY Plenum Publishing Corp1999;255- 278
Google Scholar 40.The MTA Cooperative Group, Mediators and moderators of treatment response for children with attention-deficit/hyperactivity disorder: the Multimodal Treatment Study of Children With Attention Deficit Hyperactivity Disorder Study.
Arch Gen Psychiatry. 1999;561088- 1096
Google ScholarCrossref 41.Pelham
W Pharmacotherapy for children with attention deficit hyperactivity disorder.
School Psychol Rev. 1993;22199- 227
Google Scholar 42.Carlson
CLPelham
WEMilich
RDixon
J Single and combined effects of methylphenidate and behavior therapy on the classroom performance of children with ADHD.
J Abnorm Child Psychol. 1992;20213- 231
Google ScholarCrossref 44.Bryk
ASRaudenbush
SW Toward a more appropriate conceptualization of research on school effects: a three-level heirarchical linear model.
Am J Educ. 1988;9768- 108
Google ScholarCrossref 45.Gibbons
RDHedeker
DElkin
IWaternaux
CKraemer
HCGreenhouse
JShea
MTImber
SDSotsky
SMWatkins
JT Some conceptual and statistical issues in analysis of longitudinal psychiatric data: application to the NIMH Treatment of Depression Collaborative Research Program dataset.
Arch Gen Psychiatry. 1993;50739- 750
Google ScholarCrossref 46.Hedeker
DGibbons
RDFlay
BR Random-effects regression models for clustered data with an example from smoking prevention research.
J Consult Clin Psychol. 1994;62757- 765
Google ScholarCrossref 47.Swanson
JMKraemer
HCHinshaw
SP Conceptual and methodologic issues in the evaluation of treatment effectiveness for attention deficit hyperactivity disorder.
J Child Psychol Psychiatry. In press.
Google Scholar 48.Hinshaw
SPHenker
BWhalen
CKErhardt
DDunnington
RE Aggressive, prosocial, and nonsocial behavior in hyperactive boys: dose effects of methylphenidate in naturalistic settings.
J Consult Clin Psychol. 1989;57636- 643
Google ScholarCrossref 49.Schachar
RJTannock
RCunningham
CCorkum
PV Behavioral, situational, and temporal effects of treatment of ADHD with methylphenidate.
J Am Acad Child Adolesc Psychiatry. 1997;36754- 763
Google ScholarCrossref 50.Long
NRickert
VIAshcraft
EW Bibliotherapy as an adjunct to stimulant medication in the treatment of attention-deficit hyperactivity disorder.
J Pediatr Health Care. 1993;782- 88
Google ScholarCrossref 51.Ialongo
NSHorn
WFPascoe
JMGreenberg
GPackard
TLopez
MWagner
APuttler
L The effects of a multimodal intervention with attention-deficit hyperactivity disorder children: a 9-month follow-up.
J Am Acad Child Adolesc Psychiatry. 1993;32182- 189
Google ScholarCrossref 52.Klein
RGAbikoff
H Behavior therapy and methylphenidate in the treatment of children with ADHD.
J Attention Disord. 1997;289- 114
Google ScholarCrossref 53.Kraemer
HC Reporting the size of effects in research studies to facilitate assessment of practical or clinical significance.
Psychoneuroendocrinology. 1992;17527- 536
Google ScholarCrossref 54.Firestone
PCrowe
DGoodman
JTMcGrath
P Vicissitudes of follow-up studies: differential effects of parent training and stimulant medication with hyperactives.
Am J Orthopsychiatry. 1986;56184- 194
Google ScholarCrossref 55.Biederman
JMilberger
SFaraone
SVKiely
KGuite
JMick
EAblon
SWarburton
RReed
E Family-environment risk factors for attention-deficit hyperactivity disorder: a test of Rutter's indicators of adversity.
Arch Gen Psychiatry. 1995;52464- 470
Google ScholarCrossref