Peikes D, Chen A, Schore J, Brown R. Effects of Care Coordination on Hospitalization, Quality of Care, and Health Care Expenditures Among Medicare Beneficiaries15 Randomized Trials. JAMA. 2009;301(6):603–618. doi:10.1001/jama.2009.126
Author Affiliations: Mathematica Policy Research Inc, Princeton, New Jersey.
Context Medicare expenditures of patients with chronic illnesses might be reduced through improvements in care, patient adherence, and communication.
Objective To determine whether care coordination programs reduced hospitalizations and Medicare expenditures and improved quality of care for chronically ill Medicare beneficiaries.
Design, Setting, and Patients Eligible fee-for-service Medicare patients (primarily with congestive heart failure, coronary artery disease, and diabetes) who volunteered to participate between April 2002 and June 2005 in 15 care coordination programs (each received a negotiated monthly fee per patient from Medicare) were randomly assigned to treatment or control (usual care) status. Hospitalizations, costs, and some quality-of-care outcomes were measured with claims data for 18 309 patients (n = 178 to 2657 per program) from patients' enrollment through June 2006. A patient survey 7 to 12 months after enrollment provided additional quality-of-care measures.
Interventions Nurses provided patient education and monitoring (mostly via telephone) to improve adherence and ability to communicate with physicians. Patients were contacted twice per month on average; frequency varied widely.
Main Outcome Measures Hospitalizations, monthly Medicare expenditures, patient-reported and care process indicators.
Results Thirteen of the 15 programs showed no significant (P<.05) differences in hospitalizations; however, Mercy had 0.168 fewer hospitalizations per person per year (90% confidence interval [CI], −0.283 to −0.054; 17% less than the control group mean, P=.02) and Charlestown had 0.118 more hospitalizations per person per year (90% CI, 0.025-0.210; 19% more than the control group mean, P=.04). None of the 15 programs generated net savings. Treatment group members in 3 programs (Health Quality Partners [HQP], Georgetown, Mercy) had monthly Medicare expenditures less than the control group by 9% to 14% (−$84; 90% CI, −$171 to $4; P=.12; −$358; 90% CI, −$934 to $218; P=.31; and −$112; 90% CI, −$231 to $8; P=.12; respectively). Savings offset fees for HQP and Georgetown but not for Mercy; Georgetown was too small to be sustainable. These programs had favorable effects on none of the adherence measures and only a few of many quality of care indicators examined.
Conclusions Viable care coordination programs without a strong transitional care component are unlikely to yield net Medicare savings. Programs with substantial in-person contact that target moderate to severe patients can be cost-neutral and improve some aspects of care.
Trial Registration clinicaltrials.gov Identifier: NCT00627029
Chronic illnesses pose a significant expense to the Medicare program and a major detriment to beneficiaries' quality of life. The cost and complexity of care are greater for those patients with multiple chronic illnesses. In 2002, for example, half of Medicare beneficiaries had been treated for 5 or more conditions but accounted for a disproportionately large 75% of Medicare spending.1
The high Medicare expenditures generated by these beneficiaries are driven primarily by hospital admissions and readmissions.2 Several factors appear to contribute to the high rate of hospitalizations. Chronically ill patients may have received inadequate counseling on diet, medication, and self-care, or may find it hard to adhere to such regimens,3- 9 leading to acute exacerbations of their conditions.10- 15 Patients may lack the knowledge to recognize early warning signs of decompensation or the skills to respond to such signs, or they may not have ready access to medical help other than the emergency department.13,16 Physicians may be unaware of patients' deficits in knowledge and skills, or of patients' barriers to adherence.17- 19 Chronically ill patients often see multiple physicians (1 study20 found a median of 7 different physicians per year) who may be incompletely aware of each others' care, prescribe incompatible or contraindicated treatments, or provide conflicting advice; often no one physician is responsible for a beneficiary's care.20 Reimbursement under the current Medicare fee-for-service program for education and counseling, care coordination, and ongoing monitoring is limited. Finally, chronically ill beneficiaries often do not receive treatment that has been shown to be effective for their conditions.21- 23
Studies have thus suggested that interventions to address the barriers faced by chronically ill patients could reduce avoidable hospitalizations and thereby decrease Medicare expenditures. Such interventions might include components to deal with each of the problems listed above by (1) promoting patient-centered care and improving physician-patient communication,24,25 (2) increasing patients' adherence to recommended medications and self-care regimens, (3) facilitating greater communication between physicians, and (4) making medical care more evidence-based.26- 34 Many health plans thus either develop their own care coordination programs or outsource these programs to commercial disease management vendors.35,36 A few relatively small, single-center trials of interventions that included some or all of these components have successfully lowered hospitalizations for Medicare patients.28,37- 40 However, there have been few large, rigorously designed studies of such interventions. Published studies show mixed effects on health outcomes and cost, and it is unclear how these interventions should be designed.32,41- 46
To study whether care coordination improves the quality of care and reduces Medicare expenditures, the Balanced Budget Act of 1997 mandated that the Secretary of Health and Human Services conduct and evaluate care coordination programs in the Medicare fee-for-service setting.47 The legislation authorized permanent implementation of those demonstration components, if any, that either (1) reduced total Medicare expenditures, including program fees; or (2) increased the quality of health care services and satisfaction of beneficiaries and health care organizations without increasing expenditures.
In mid-2000, the Centers for Medicare & Medicaid Services (CMS) solicited proposals for programs to be sites in the Medicare Coordinated Care Demonstration (MCCD). An expert panel convened by CMS reviewed proposals; scored each numerically in the areas of intervention design, organizational capabilities, capacity to implement the intervention (including ability to recruit enough participants to achieve adequate statistical power), and evidence for potential cost-savings and cost-neutrality; ranked the applications; and provided written assessments and recommendations.48 The CMS made the final selections and, in early 2002, competitively awarded 15 demonstration programs. In addition, CMS contracted with Mathematica Policy Research Inc (MPR) to conduct the independent evaluation. This analysis summarizes the results from the randomized controlled trials of these 15 programs on how they affected Medicare expenditures and quality of care.
Each program was allowed to define, within broad boundaries, its own target population and exclusion criteria, and designed its intervention accordingly. Medicare beneficiaries who resided in the programs' catchment areas, were covered by fee-for-service (traditional) Medicare, and had 1 or more of the chronic conditions targeted by the local program were eligible. Chronic conditions selected by the programs included coronary artery disease (CAD), congestive heart failure (CHF), diabetes, chronic pulmonary disease, and other conditions to a lesser extent. (See Table 1 for the full list. Because the sample sizes—and hence the power to detect effects—varied, all tables in this manuscript arranged the programs in 3 groups, according to their samples sizes, which were defined post hoc). Ten programs required that enrollees had to be admitted to a hospital (6 programs required that it be for the target condition) within the year before enrollment. Four programs explicitly excluded beneficiaries younger than 65 years and 13 programs excluded those patients with end-stage renal disease or receiving dialysis. Fourteen programs excluded beneficiaries with certain other conditions, including terminal illness, conditions that affected their ability to learn self-management (eg, serious mental illness or dementia), or conditions that were complex to manage but unrelated to target diagnoses (eg, human immunodeficiency virus/AIDS); the specific conditions varied by program. Nine programs also excluded long-term nursing home residents.
Programs began enrolling patients between April and September 2002 and were initially authorized to operate for 4 years. Our study reports on patients enrolled through June 2005. Complete Medicare claims data were available for this study for services rendered through June 2006, the originally planned end of the 4-year demonstration period. The analysis sample was restricted to beneficiaries enrolling through June 2005 to ensure that at least 1 year of follow-up was potentially available for all sample members and that those in the treatment group would have at least 1 year of potential exposure to the intervention.
The Secretary of Health and Human Services, acting through the CMS, determined that the overall demonstration and evaluation met all criteria in both the Common Rule and National Institutes of Health's Exemption Number 5 for exemption from institutional review board review for research and demonstration projects on public benefit and service programs.49- 51 Mathematica Policy Research Inc has a Federal-wide Assurance of Protection for Human Subjects for demonstrations conducted by governmental agencies. Although neither the legislation nor the US Department of Health and Human Services required certification of institutional review board review for this exempt research, each of the programs decided on its own whether to claim the exemption or to instead seek approval of its protocols from its local institutional review board. All study participants provided written informed consent; each program determined whether proxies could provide consent.
Each program's intake staff recruited patients for its program and transmitted patient information on consenting beneficiaries to MPR's study Web site; MPR checked the information for previous enrollment, completeness, and validity, and then randomized eligible applicants within each program to the treatment or control group in a 1:1 ratio, using randomly generated, concealed 4-digit “strings” of treatment-control assignments. Strings with all treatments or all controls were excluded to minimize runs of more than 6 consecutive treatment or control group assignments. The random assignment result was returned within seconds to the program via the Web site. Five programs requested and were allowed to have their patients randomized within program-defined severity of illness strata. Because of the nature of the intervention, no individuals were blinded to which group participants were randomized.
The 15 program hosts included 5 commercial disease management companies, 3 community hospitals, 3 academic medical centers, 1 integrated delivery system, 1 hospice, 1 long-term care facility, and 1 retirement community (Table 1). Programs served patients in Maine (statewide); Baltimore, Maryland (2 programs); Washington, DC; eastern Virginia; southern Florida; east central Illinois; St Louis, Missouri; northwestern Iowa and southeastern South Dakota (2 programs); Phoenix, Arizona; New York City; eastern Pennsylvania; Houston, Texas; and 2 counties in central California.
To better understand the programs' activities, MPR reviewed their protocols and documentation and interviewed staff at each program at the end of the first and third years of operations (2003 and 2005). The care coordination interventions of the 15 programs differed widely (described briefly in Table 1 due to space constraints; full descriptions are provided elsewhere52). All of the programs assigned patients to a care coordinator. Although 1 program used licensed practical nurses, all other programs required care coordinators to be registered nurses and 4 programs required them to have bachelor of science in nursing degrees. In all programs, the care coordinators assessed patients' needs and developed patient care plans.
All but 1 of the programs educated patients to improve adherence to medication, diet, exercise, and self-care regimens, mostly through the nurses conveying factual information. Seven programs also used behavior change models such as the transtheoretical approach53 or techniques such as motivational interviewing.54 (The University of Maryland program was meant only to test the effect of home telemonitoring and thus did not educate patients or coordinate their care.) Almost all of the programs used standardized curricula and evaluated educational effectiveness, through such means as monitoring clinical indicators, assessing patients' knowledge and self-reported behavior, and having patients repeat or explain information back to the care coordinator. All programs sent physicians regular written reports on patients. However, only 4 programs focused on increasing physicians' adherence to evidence-based or guideline-based care (eg, monitoring microalbuminuria among patients with diabetes or prescribing angiotensin-converting enzyme inhibitors or angiotensin receptor blockers for patients with reduced ejection fraction) (Table 1), in part from the program staff 's concerns about burdening and possibly alienating physicians. Most programs referred patients to or arranged for support services (such as home-delivered meals or transportation), but this was not usually their focus, in part because only a minority of patients required such services.
Fourteen of the 15 programs attempted to improve care coordination (eg, by improving communication between patients and physicians, better managing care setting transitions, and addressing problems of polypharmacy). However, they used different approaches to meet this goal. Twelve programs taught patients to communicate with their physicians more effectively through role-playing and by helping patients articulate their concerns more clearly. Ten programs had timely information on the majority of patient hospitalizations or emergency department encounters; thus, only these programs could systematically intervene with patients at the time of transition. However, procedures for addressing such adverse events tended to be relatively unstructured and not consistently applied. Fourteen programs relied on patients to furnish care coordinators with lists of current medications. Only 4 programs also received this information from other sources, such as medical chart review. When medication problems were identified, care coordinators in 8 programs regularly consulted with pharmacists (or the programs' medical directors) before asking physicians to resolve them.
In addition to qualitatively describing each program's interventions, we also used site visit data and a structured instrument with high interrater reliability to quantitatively score each program's efforts in 10 specific intervention areas, such as patient education, improving physician practice, and health information technology (Table 1).52 When interrater and interobserver reliability were assessed, 5 of the 10 domains had intraclass correlation coefficients of excellent (range, 0.82-0.93) and 2 were good (0.69 and 0.80, respectively).52
The caseloads of care coordinators for half of the programs ranged between 40 and 70 patients. Eleven programs contacted patients 1 to 2.5 times per month, but 3 programs contacted patients 4 to 8 times per month (the other program did not record all contacts). All programs contacted patients primarily by telephone; however, 4 programs contacted patients in person nearly once a month as well. In 13 programs, at least 85% of contacts were initiated by care coordinators. Six programs (3 for all patients and 3 only for selected patients) used home telemonitoring devices for daily transmission of physiological readings and symptom reports.
The CMS paid each program a negotiated fixed fee ranging from $80 to $444 per member per month, with an average of $235. The CMS limited the fees of the programs to 20% or less of the projected average monthly Medicare expenditures of the target population, because prior literature suggested larger savings were unlikely.55,56 The actual amounts paid to the programs over the follow-up, which were lower than the negotiated rates because patients who disenrolled remained in the study, ranged from $60 to $270 per member per month, with an average of $164.
Data on hospital admissions and Medicare expenditures were obtained from the Medicare Standard Analytic File. The Medicare National Claims History File provided data on all other services used and the program fees paid. Patient characteristics and eligibility for Medicare were taken from the Medicare Enrollment Database. A computer-assisted telephone survey of patients conducted by MPR in each of the 12 programs with sufficient enrollment for analysis provided data on patient behavior, health outcomes, and satisfaction with health care. Approximately 350 patients in the treatment group and 350 patients in the control group in each of 7 programs were randomly selected for the survey (in the 5 programs with 400 to 700 total study patients, all were included) 12 to 18 months after program start-up and were interviewed approximately 10 months after their randomization date. The survey response rates for all 12 programs were very high (mean, 95%) and were very similar for the treatment and control groups.
The patients who were surveyed are generally comparable with those who were not (eTable 1). The patients selected for the survey were those enrolled in the first 12 months of the program, so the small differences reflect changes in composition of enrollees and the secular increases in Medicare expenditures over time. For all 15 programs, MPR selected and interviewed by telephone a sample of physicians caring for enrolled patients.52 A total of 1018 physicians were surveyed and the overall response rate was approximately 64%. The 25% of surveyed physicians who were not familiar with the program were not interviewed further. The survey instrument examined 8 areas, including physicians' opinions about the program's perceived effects, physician rating of care coordinators' clinical competence, and an overall assessment of the program.
Outcome measures drawn from Medicare claims data included postenrollment Medicare hospital use, Part A and B Medicare expenditures, and program fees covering the period from enrollment through June 2006. The study did not collect information on non–Medicare-covered expenditures such as those covered by Medicaid, private or supplemental insurance, or beneficiary out-of-pocket payments. Prescription drug expenditures were not available (and were not covered by Medicare during the study period). All outcome measures were prespecified in a detailed study design protocol.57
Medicare claims data were also used to construct 6 disease-specific and 2 general preventive clinical service quality-of-care process measures. The beneficiary survey also collected data on several broad categories of quality-of-care process measures: receipt of health education, receipt of service arrangement assistance, and general preventive clinical services; most of these categories included several measures. Quality-of-care outcome measures in the survey included patients' knowledge and adherence, unmet needs, functional status, and health-related quality of life (with each category again consisting of several measures). Finally, quality-of-care outcome measures from Medicare claims data consisted of 8 different types of general and disease-specific hospitalizations that are thought to be preventable with good quality primary care (also known as potentially preventable hospitalizations or ambulatory care-sensitive conditions).58- 60
Effects were calculated using prespecified analyses and an intention-to-treat design that included all sample members randomized to the treatment and control groups regardless of whether they actually participated in the intervention. Results were calculated for each program separately because the 15 interventions, their target populations, and their practice environments differed widely. Two-tailed statistical tests were conducted by using SAS version 9.1 (SAS Institute Inc, Cary, North Carolina). A P<.10 level was used to test outcomes rather than a P<.05 level because we were concerned with type II as well as type I errors and, in the event that no or few programs were effective at the P<.05 level, we wanted to be sure that potentially effective programs at the P<.10 level were identified. Examination of related outcomes was used to help assess whether differences significant at only the P<.10 level were likely due to chance or true effects.
Given the mandated focus on total Medicare expenditures (including program fees), the study assessed whether each program generated savings in regular Medicare expenditures for Part A and B services, and whether those savings were sufficient to cover the program's fees. The analyses of number of hospitalizations and Medicare expenditures included all beneficiaries enrolled in the study through June 2005 for whom the program had provided a correct Medicare Health Insurance Claim number (<0.1% of cases were excluded). Mean annualized hospitalizations and expenditures per month were measured from program enrollment through June 2006 or until the sample member died or became ineligible (ie, was no longer covered under both Medicare Parts A and B, no longer had Medicare as the primary payer, or enrolled in a Medicare managed care plan). Observations were weighted to reflect the number of months for which the sample member met these criteria. The analyses of claims-based quality-of-care measures included all beneficiaries enrolled in the study through June 2004 and outcomes were measured over the first 24 months after program enrollment or until the sample member died or became ineligible. The claims-based quality-of-care measures were all binary; cases in which no event was observed were weighted to reflect the proportion of the 24-month follow-up for which the sample member met these criteria but cases in which an event was observed were given a weight of 1.
Treatment-control comparisons of hospitalizations, expenditures, and claims-based quality-of-care measures were regression-adjusted by using ordinary least squares; logistic regression was used for the quality-of-care binary outcomes (eg, whether or not someone received a service). The regression controlled for age, sex, race, original reason for Medicare entitlement, whether the sample member also enrolled in Medicaid, Medicare expenditures per month over the 2 years before randomization, whether the sample member used specific services over that period (home health care, skilled nursing facility, hospital services), and prior diagnoses on claims for 10 chronic conditions (CAD, CHF, stroke, diabetes, cancer, chronic obstructive pulmonary disease, dementia, peripheral vascular disease, depression, and asthma).
To address the problem of multiple test bias, given the large number of outcome measures examined for quality of care, we grouped outcomes by domain. Statistically significant treatment-control differences (at the 5% level) in any domain were not considered meaningful unless their number exceeded 5% of the outcomes included in that domain (the number expected to occur by chance) and were all in the same direction (either all favoring the treatment group or the control group, in contrast to the roughly equal numbers of differences expected to favor both groups by chance alone).
Twelve of the 15 programs (all except Georgetown, Quality Oncology, and University of Maryland) had sample sizes large enough to yield power of 65% to 99% to detect effects of 20% or more on number of hospitalizations and on Medicare expenditures without program fees. Thus, the evaluation was likely to detect sizable true effects, such as those that were being touted by disease management programs at the time the demonstration began.61,62 Given the need to cover program fees and the desire to find programs that generate net savings, this power was sufficient for addressing the primary questions. However, to reduce the likelihood that smaller effects were overlooked, we also examined the treatment-control differences in hospitalizations and expenditures to identify sites with sizable differences in both P values just above the .10 level used in testing, recognizing that this step could further increase the chance for type I error (finding an association where no true association existed).
Compared with all Medicare beneficiaries, enrollees had higher educational levels and were less likely to be Hispanic, younger than 65 years, or be enrolled in Medicaid (a proxy for poverty) (Table 2). This reflects a combination of where the programs operated, their eligibility criteria, and which beneficiaries chose to enroll. The most common conditions for which the study sample had been treated in the 2 years before enrollment were CAD (61%), CHF (48%), diabetes (39%), and chronic obstructive pulmonary disease (32%); 17% were treated for depression (Table 2). Most of the programs enrolled high-cost patients; preenrollment Medicare expenditures averaged more than $2000 per month during the year before enrollment for the research sample in 6 programs, but less than $600 per month for 3 other programs. The average monthly Medicare expenditure of $1535 for the research sample overall was nearly 3 times that of beneficiaries nationwide ($552 per month in 2003).63 Across all of the 15 programs and the baseline characteristics shown in Table 2, the treatment and control groups differed significantly on only 11 of the 255 comparisons at the P<.05 level, less than the expected number of statistically significant differences that would be observed by chance (Table 3).
The maximum number of follow-up months was 51; on average, members of the research sample had 30 months of eligibility during follow-up. Programs varied widely in their patients' average length of potential exposure to the program, with a range of 18 to 38 months (Table 4).
Two programs had treatment-control differences in the number of annual hospitalizations that were significant at the 5% level. Mercy reduced annual hospitalizations by 0.168 per person per year (17% of the control group mean, P = .02) (Table 4). The Charlestown program had an increase of 0.118 hospitalizations per person per year or 19% more than the control group mean (P = .04). In addition, Georgetown reduced annual hospitalizations by 0.494 per person per year (24% of the control group mean, P = .07).
None of the programs reduced regular Medicare expenditures, even without the fees paid to the care coordination programs (Table 5). Only 2 programs had a significant difference in expenditures and, in both of these programs, the treatment group had higher expenditures (Charlestown by $186 per member per month [19%, P = .03] and Carle by $61 per member per month [9%, P = .08]). For 2 programs, the treatment group had substantially lower expenditures than the control group (monthly treatment group expenditures for Health Quality Partners [HQP] were $84 [12%] lower than those of the control group; monthly treatment group expenditures for Mercy were $112 [9%] lower than those of the control group). However, these differences were not statistically significant (P = .12 for each). As a sensitivity test, we calculated the effects on expenditures after trimming the values of outlier cases to the value for the 98th percentile of that program's sample members (available upon request). Mercy's 9% reduction (P = .12) increased to 10% and became significant using the 10% level (P = .07). Using the logarithm of cost to reduce the variance of estimates inflated by outliers led to increases in precision similar to those obtained from trimming the outliers. Results for HQP did not change substantially.
For total Medicare expenditures including program fees, the treatment groups for 9 programs had 8% to 41% higher total expenditures than the control groups did, all statistically significant (P < .05) (Table 5). However, one of these programs, Mercy, had 17% fewer hospitalizations in the treatment group than in the control group (P = .02), and 9% lower monthly Medicare Part A and B expenditures ($112, P = .12). The Mercy program received a large fee of $236 per member per month, resulting in a net increase in total expenditures of $134 or 11% (P = .07) relative to the control group.
The 90% confidence intervals (CIs) on the treatment-control difference in total expenditures are indistinguishable statistically from zero for the remaining 6 programs (HQP, Medical Care Development, QMed, Georgetown, Quality Oncology, and University of Maryland). We cannot reject the hypothesis that those 6 programs may have been cost-neutral (ie, the savings in regular Medicare expenditures might be enough to offset the program fees). Among these 6 programs, the treatment group at Georgetown had 24% fewer hospitalizations than the control group did and 14% ($358) lower Medicare expenditures, excluding fees (P = .31), more than enough to offset the program fee it received of $240, but the program was small and could not be sustained. Health Quality Partners had lower hospitalization rates and Medicare expenditures without program fees for the treatment group of 11% and 12%, respectively, but the comparisons were not statistically significant (P=.19 and P=.12). In addition, the 90% CI of −$68 to $107 around the difference between treatment and control groups is consistent with either an increase or a decrease in Medicare cost. The pattern of evidence, including lack of effects on hospitalizations, suggests cost neutrality is quite unlikely for the other 4 programs (Medical Care Development, QMed, and the small Quality Oncology program clearly had no effects on hospitalizations and University of Maryland's program had expenditures 35% higher in the treatment group than in the control group).
The patient survey collected information on several process-of-care quality measures. Treatment group members were approximately 1.3 to 2.6 times (6.1 to 40 percentage points) more likely than control group members to recall receiving education during the preceding 12 months on diet, exercise, and warning signs of disease exacerbation, and receiving educational materials (Table 6). Treatment group members were also much more likely to report having received help in arranging care. Results for the other process measures were less uniformly positive, with only a few scattered effects for self-reported influenza and pneumococcal vaccinations, mammography (measured using Medicare claims data), and various routine tests in the care of diabetes and CAD (measured using claims data) (Table 6).
For the survey-based outcomes-of-care measures, despite reporting much higher rates of being taught self-management skills, treatment group members were no more likely than control group members to say they understood proper diet and exercise, or to state that they were adhering to prescribed or recommended diet, exercise, and medications (eTable 2). The treatment group had only a few favorable differences of modest size for functioning, measured using activities of daily living and instrumental activities of daily living, and health-related quality of life, such as freedom from emotional distress or pain, or the 12-Item Short Form Health Survey physical component score. Among the outcomes-of-care measures ascertained from Medicare claims data, no sustained pattern for any programs emerged that suggested that potentially preventable hospitalizations had been reduced.
Because only 2 programs, HQP and Mercy, had sizable enrollments and large treatment-control differences in regular Medicare expenditures, we will focus on these 2 remaining programs. Both programs had significantly more patients in the treatment group than in the control group reporting they received health education on diet, received help in arranging care, and had been taught about exercise, medication use, warning signs, and provision of educational materials (Table 6). Both HQP and Mercy increased treatment group members' satisfaction with specific aspects of their regular health care, such as physicians keeping in touch with each other and explanations of treatment, with P<.10 but P>.05 for the comparisons (eTable 2). Like most of the other programs, no overall improvements in patient behaviors, quality of life, activity of daily living, instrumental activity of daily living, or preventable hospitalizations were observed for HQP. For Mercy, there were improvements in only 7 of the 39 measures, including ability to bathe independently, whether felt calm or peaceful most of the time, and preventable hospitalizations for CHF (all with P<.10 but P>.05).
Physicians generally believed that the programs had favorable effects on their practices, such as by decreasing paperwork and telephone traffic, and increasing quality of care. Physicians valued care coordinators' progress reports on patients and found the programs helpful in arranging transportation, meals, and therapy for patients. Physicians did not think programs helped much with specialist appointments or expensive prescription medications, and they had mixed opinions on programs' abilities to coordinate care with other physicians, foster continuity of care, reduce duplicate testing, or improve communication with family members. Physicians did not believe that programs improved patients' self-management behaviors. Overall, physicians thought the programs produced few negative effects and generally liked them.52
Our results suggest that care coordination, as practiced by the programs participating in the demonstration from 2002 to 2006, holds little promise of reducing total Medicare expenditures for beneficiaries with chronic illnesses. Only 2 programs (Mercy Medical Center and Georgetown) had favorable statistically significant treatment-control differences in hospitalizations and sizable differences (−9.3% and −14.0%, respectively) in Medicare expenditures. However, the Georgetown program was not viable, enrolling only 230 patients over the first 3 years of program operations (approximately one-third of the target for the first year) and dropped out of the demonstration after 3.5 years. One other program, HQP, also showed promise, with both hospitalizations and expenditures approximately 11% lower for the treatment group than the control group. Further exploration of the findings for this program, which risk-stratified its patients at enrollment, showed that these treatment-control differences were concentrated entirely in the program's highest severity cases, comprising approximately 30% of the sample, for which the control group's average expenditures were approximately $900 per member per month (data available from authors on request). For this subgroup, both differences were large (−29% for hospitalizations and −20% for expenditures) and statistically significant (P = .009 and P = .07, respectively).
Despite these underwhelming results for care coordination interventions in general, the favorable findings for Mercy and HQP suggest that the potential exists for care coordination interventions to be cost-neutral and to improve patients' well-being. Health Quality Partners' low fee was nearly fully offset by the treatment group's lower regular Medicare expenditures, and based on the subgroup analyses these savings would have been substantially larger had the program targeted only the highest severity group. Mercy Medical Center had substantially higher fees ($236 per member per month) that were only partially offset by the lower Medicare expenditures for the treatment group ($112 per member per month). However, if the sizeable reduction in hospitalizations could be achieved more efficiently, the program could be cost-effective. Unfortunately, despite the positive effects on patient education, patient behavior, functional ability, and quality of life were not improved.
A driving factor in CMS's initial decision to include many programs in the demonstration and allow them the flexibility to define their own interventions and target populations was the need to define which features of a care coordination program were important to improving outcomes. Comparing the 2 programs with the most positive results, HQP and Mercy, with the 10 unsuccessful programs with enough enrollees to yield credible estimates reveals 5 noteworthy differences. First, both of the successful programs averaged nearly 1 in-person contact per month per patient, far higher than the median of 0.3 for the 10 unsuccessful programs (only 2 of the unsuccessful programs averaged close to 1 in-person contact per month). Relatively frequent in-person contacts may be necessary to develop the level of trust that patients and their families need to consider the care coordinator an integral part of their care network and to confide in the coordinator and rely on his or her judgment. Second, these 2 programs had favorable effects on populations with average monthly Medicare expenditures of approximately $900 and $1200 in follow-up, whereas only 1 of the 10 unsuccessful programs enrolled a mix of beneficiaries with average Medicare expenditures in this range. This pattern suggests that programs may need to target patients who are neither at too low a risk of hospitalizations for the program to have effects in a 2- to 4-year follow-up, nor so seriously ill that it is too late for such interventions to ward off hospitalizations. Third, in both programs, treatment group members were significantly more likely than control group members to report being taught how to take their medications (only 1 other program displayed such a difference). Fourth, care coordinators for both HQP and Mercy worked closely with local hospitals, which provided the programs with timely information on patient hospitalizations and enhanced their potential to manage transitions and reduce short-term readmissions. Finally, care coordinators in both programs had frequent opportunities to interact informally with physicians. Many physician practices from which HQP drew its patients made onsite space available for the care coordinator to meet with the patients privately before or after their visits. Similarly, care coordinators for Mercy, which served a rural area, frequently arranged to meet their patients in local clinics at the time of patients' appointments with their primary care physicians. Many of the care coordinators knew the physicians well, because all were employees of Mercy Medical Center. Both of these programs further strengthened the care coordinator−physician ties by assigning all of a given physician's patients to the same care coordinator whenever practical. Only 2 of the other 10 programs had both features, care coordinators co-located with physicians and a single care coordinator assigned to each physician's patients. The 2 programs also had the 2 highest researcher ratings on patient education, among all 15 programs. Unfortunately, despite these intensive programs, few substantial changes in patient behavior were observed.
Based on the findings from this study, CMS offered these 2 most successful MCCD programs the opportunity to continue their interventions for up to 3 additional years, with continuation depending on whether an interim evaluation shows evidence of persistent favorable effects. Both programs accepted. The fees at Mercy Medical Center were reduced to $113 per member per month to match the savings it generated in Medicare expenditures during the demonstration period, less than half the average fee it received during the demonstration. The interim results will be available in 2010.
The main limitation of this study is that the large variance in Medicare expenditures and (for some programs) low program fees resulted in only 4 sites having adequate power to detect reductions in standard Medicare expenditures large enough to offset the program fees (available upon request). Nevertheless, the sample sizes are larger than most published studies of care coordination and the estimates for hospitalizations clarify the conclusions regarding cost neutrality. In any case, it is clear that even if savings could be achieved they would be modest, even for the most successful programs. The strengths of the study are the use of a randomized design in each program and a considerably longer follow-up than any prior care coordination studies we have identified, as well as evaluation of 15 different interventions in different settings.
The mixed findings from the MCCD are more favorable than the negative results of 2 other efforts by CMS to introduce care coordination or disease management into Medicare fee-for-service—the Medicare Health Support64 pilot program and the Medicare Disease Management Demonstration.65 Although the programs participating in those 2 studies differ from the MCCD programs in that they are much larger and operated exclusively by commercial disease management companies, they had similar goals and target populations. None of those programs showed significant reductions in hospitalizations or Medicare expenditures.
Our results, when coupled with other recent evidence from the literature, suggest that the most effective intervention for care coordination would be a combination of an ongoing model such as that offered by Mercy and HQP, with a proven transitional care model to prevent hospital readmissions. Two proven transitional care models16,37,38 enroll patients while they are in the hospital and both have shown large reductions in readmissions within 30 or 60 days after discharge when the patient is at high risk (18% of hospitalized Medicare beneficiaries are readmitted within 30 days), making this a potentially rewarding area for generating savings.66 Neither the successful MCCD programs nor similar types of effective interventions39,40 have implemented structured interventions such as these when a hospitalization does occur—a time when patients with chronic illnesses are disoriented, receiving fragmented care, and at especially high risk of not understanding what they need to do to avoid a readmission. Thus, a hybrid model would seem to be more effective than either model in isolation and the 2 models would be logical complements.
These findings are relevant to recent policy interest in medical homes as a way to improve care coordination, improve quality, and reduce costs.67 By providing close links between the patient's nurse coordinator and physician, substantial in-person contact between the patient and the care coordinator, and (presumably) timely information on hospital admissions, the medical home model may be able to replicate or exceed the success of the most effective MCCD programs. However, the modest benefits suggest that future research will need to determine how to more effectively improve patient outcomes. The successful interventions also may offer more detailed lessons for medical homes about how best to educate and monitor patients, the types of patients for whom they are likely to be most effective, and how to help patients overcome barriers to better self-care.
Corresponding Author: Randall Brown, PhD, Mathematica Policy Research Inc, 600 Alexander Pk, Princeton, NJ 08550 (email@example.com).
Author Contributions: Dr Brown had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Peikes, Chen, Schore, Brown.
Acquisition of data: Peikes, Chen, Brown.
Analysis and interpretation of data: Peikes, Chen, Schore, Brown.
Drafting of the manuscript: Peikes, Chen, Schore, Brown.
Critical revision of the manuscript for important intellectual content: Peikes, Chen, Schore, Brown.
Statistical analysis: Peikes, Chen, Schore, Brown.
Obtained funding: Peikes, Chen, Schore, Brown.
Administrative, technical, or material support: Peikes, Brown.
Study supervision: Brown.
Financial Disclosures: The authors are all salaried employees of Mathematica Policy Research Inc and receive no compensation from any other source. They do not own stock in any of the programs being evaluated or stand to profit in any way, directly or indirectly, from particular findings in this article.
Funding/Support: Data collection and original analysis were funded by grant 500-95-0047 from the Centers for Medicare & Medicaid Services (CMS). There were no industry sponsors of this study. The writing of the manuscript was funded solely by Mathematica Policy Research Inc.
Role of the Sponsor: The CMS staff were involved in the selection of program sites for the demonstration and the decision to require random assignment. CMS was not otherwise involved in the design and conduct of the study, in the collection, management, analysis, or interpretation of the data, or in the preparation, review, or approval of the manuscript.
Disclaimer: The opinions and interpretations expressed in this article are solely the authors and do not represent the views of CMS staff or the programs that were evaluated.
Previous Presentations: Results from an earlier version of the study, similar to those shown here, are contained in a report to Congress dated February 21, 2008, for which broad public release has been delayed until publication in this journal. A brief overview of some of the results was presented in part at the Academy Health's Annual Research meeting; June 9, 2008; Washington, DC.
Additional Contributions: We thank our Mathematica colleague Carol Razafindrakoto for programming support. Mr Razafindrakoto, a salaried Mathematica employee like the authors, received no special compensation for his work on the manuscript. We also appreciate the care coordination programs and Medicare beneficiaries for participating in the study site visits and surveys. They received no compensation for these activities and had no role in the analysis or preparation of the manuscript.