Colin P. West, Mashele M. Huschka, Paul J. Novotny, Jeff A. Sloan, Joseph C. Kolars, Thomas M. Habermann, Tait D. Shanafelt. Association of Perceived Medical Errors With Resident Distress and EmpathyA Prospective Longitudinal Study. JAMA. 2006;296(9):1071–1078. doi:10.1001/jama.296.9.1071
Author Affiliations: Division of General Internal Medicine, Department of Medicine (Dr West), Division of Biostatistics, Department of Health Sciences Research (Dr Sloan, Ms Huschka, and Mr Novotny), Division of Gastroenterology and Hepatology, Department of Medicine (Dr Kolars) and Division of Hematology, Department of Medicine (Drs Habermann and Shanafelt), Mayo Clinic College of Medicine, Rochester, Minn.
Context Medical errors are associated with feelings of distress in physicians, but little is known about the magnitude and direction of these associations.
Objective To assess the frequency of self-perceived medical errors among resident physicians and to determine the association of self-perceived medical errors with resident quality of life, burnout, depression, and empathy using validated metrics.
Design, Setting, and Participants Prospective longitudinal cohort study of categorical and preliminary internal medicine residents at Mayo Clinic Rochester. Data were provided by 184 (84%) of 219 eligible residents. Participants began training in the 2003-2004, 2004-2005, and 2005-2006 academic years and completed surveys quarterly through May 2006. Surveys included self-assessment of medical errors and linear analog scale assessment of quality of life every 3 months, and the Maslach Burnout Inventory (depersonalization, emotional exhaustion, and personal accomplishment), Interpersonal Reactivity Index, and a validated depression screening tool every 6 months.
Main Outcome Measures Frequency of self-perceived medical errors was recorded. Associations of an error with quality of life, burnout, empathy, and symptoms of depression were determined using generalized estimating equations for repeated measures.
Results Thirty-four percent of participants reported making at least 1 major medical error during the study period. Making a medical error in the previous 3 months was reported by a mean of 14.7% of participants at each quarter. Self-perceived medical errors were associated with a subsequent decrease in quality of life (P = .02) and worsened measures in all domains of burnout (P = .002 for each). Self-perceived errors were associated with an odds ratio of screening positive for depression at the subsequent time point of 3.29 (95% confidence interval, 1.90-5.64). In addition, increased burnout in all domains and reduced empathy were associated with increased odds of self-perceived error in the following 3 months (P=.001, P<.001, and P=.02 for depersonalization, emotional exhaustion, and lower personal accomplishment, respectively; P=.02 and P=.01 for emotive and cognitive empathy, respectively).
Conclusions Self-perceived medical errors are common among internal medicine residents and are associated with substantial subsequent personal distress. Personal distress and decreased empathy are also associated with increased odds of future self-perceived errors, suggesting that perceived errors and distress may be related in a reciprocal cycle.
Medical errors and patient safety are an important concern for patients and physicians. Medical errors have received increased attention since 1999, when the Institute of Medicine reported that up to 100 000 US patients die each year because of preventable adverse events.1 The proportion of hospitalized patients affected by medical errors has been estimated to be 5% to 10%,2- 6 although it has approached 50% in some studies.7,8 The morbidity, mortality, and financial costs of these events may be great.9- 13
Many reports on medical errors have focused on the rate at which errors affect patients. Less commonly addressed is the proportion of physicians who commit errors. Several studies have evaluated this rate among resident physicians.14- 16 In a cross-sectional study examining self-defined errors, Mizrahi14 found that 47% of internal medicine residents reported making serious errors during their training. In another cross-sectional study of internal medicine residents, Wu et al15 found that 45% of surveyed residents at various stages of training reported making at least 1 mistake (defined as “an act of omission by any caregiver which would have been judged wrong by knowledgeable peers at the time it occurred”) during their residency. More recently, Jagsi et al16 surveyed residents across multiple specialties and found that 18% of respondents acknowledged at least 1 adverse event (defined as “a complication, injury, or harm to a patient resulting from medical management [not from the patient's underlying condition or disease]”) in a patient under their care during the previous week. More than one third of these events was classified as a mistake by the resident, using the same definition as Wu et al.15 An additional 23% of respondents reported a near miss (defined as “a mistake that does not reach the patient or if it reaches the patient does not result in injury or harm”) for which they felt at least partially responsible.
These studies suggest that a significant proportion of graduate medical trainees make medical errors. Committing errors can have a significant impact on clinicians. If patients are the first victims of medical errors, physicians have been termed the “second victims”17 and commonly experience feelings of distress, guilt, shame, and depression in response to medical errors.15,18- 23 These effects may be long lasting,14,15,24- 26 with some physicians feeling “permanently wounded” as a result.20
To date, studies of the effect of medical errors on physicians have been descriptive cross-sectional evaluations that preclude quantitative assessment of the temporal relationship or magnitude of the association between errors and physician distress. We used a prospective longitudinal design (the Mayo Internal Medicine Well-being Study) to evaluate the frequency of perceived medical errors among internal medicine residents and to measure the association of these medical errors with resident quality of life (QOL), burnout, symptoms of depression, and empathy using validated metrics.
All entering categorical and preliminary internal medicine trainees in academic years 2003-2004, 2004-2005, and 2005-2006 at the Mayo Clinic Rochester Internal Medicine Residency program were eligible to participate. Residents in these academic classes had attended 108 US and international medical schools. These residents spend approximately half of their rotations on in-hospital services with overnight call responsibilities, with the remaining time spent in outpatient and subspecialty consultation rotations. Residents were invited to participate in this study during their orientation before beginning residency or by telephone if unable to attend orientation. Participation was elective. Individuals who desired to participate signed written informed consent to be surveyed every 3 months. The Mayo Clinic institutional review board approved this study.
Residents were electronically surveyed every 3 months throughout their training. The current study focuses on data collected through May 2006. As of this date, categorical residents beginning training in 2003, 2004, and 2005 had received 12, 8, and 4 surveys, respectively. Preliminary (1-year) residents beginning training in 2003, 2004, and 2005 had received 5, 5, and 4 surveys, respectively. Surveys included questions about demographic characteristics, current rotation characteristics, coping strategies for dealing with stress, and report of self-perceived medical errors. Validated survey tools were used to measure QOL, burnout, symptoms of depression, and empathy, as described below. Self-reported medical errors and QOL were assessed quarterly; to avoid an excess burden on participants, burnout, symptoms of depression, and empathy were evaluated every 6 months. No member of the Mayo Clinic Department of Medicine had access to identifying information on study items for individual participants. Nonidentifying numeric codes were used by statisticians to preserve resident anonymity when longitudinal data were collected and analyzed. Residents who screened positive for depression were identified by a separate study statistician and received a letter by certified mail notifying them of this result and informing them of confidential resources available to those desiring help.
Self-reported Medical Errors. Perceived medical errors were evaluated by self-report every 3 months by asking residents, “Are you concerned you have made any major medical errors in the last 3 months?” The intent of this question was to identify events internalized by residents as major medical errors, rather than to document events associated with patient risk. Accordingly, major medical errors were not specifically defined for the residents. Thus, self-reported errors in this study represent major medical errors as perceived by each resident. Residents reporting errors were also asked to indicate to whom they had spoken about these errors.
QOL, Burnout, and Depression. Quality of life was measured by a single-item linear analog scale assessment (LASA). This instrument measures overall QOL on a 0 to 10 scale, with response anchors ranging from “as bad as it can be” (0) to “as good as it can be” (10). This scale has been validated across a wide range of medical conditions and populations.27- 31
Burnout is a syndrome encompassing 3 domains (depersonalization, emotional exhaustion, and a sense of low personal accomplishment) that is associated with decreased work performance.32 Burnout was measured using the Maslach Burnout Inventory (MBI), a 22-item tool evaluating each of these domains.32 Responders rate the frequency with which they experience various feelings or emotions on a 7-point Likert scale, with response options ranging from “never” to “daily.” Higher values of depersonalization (Maslach Burnout Inventory–Depersonalization) and emotional exhaustion (Maslach Burnout Inventory–Emotional Exhaustion [MBI-EE]) and lower values of personal accomplishment (Maslach Burnout Inventory–Personal Accomplishment) signify burnout. This instrument has been used in numerous previous studies of physicians.33- 36
Depression screening used the 2-question approach described by Spitzer et al37 and validated by Whooley et al.38 This instrument has been used in a variety of patient populations,37,38 including 2 studies of physicians.33,39 This tool includes questions about depressed mood and anhedonia: “During the past month, have you often been bothered by feeling down, depressed, or hopeless?” and “During the past month, have you often been bothered by little interest or pleasure in doing things?” A positive screen for depression is defined as a “yes' response to either question. This screening instrument has reported positive likelihood ratios of up to 3.42 for the diagnosis of current major depression and negative likelihood ratios as low as 0.07.37,38 These likelihood ratios are typical of other depression screening instruments reported in the literature.38,40 Assuming a 25% prevalence of depression similar to that reported in other samples of internal medicine residents,33,36,41 screening positive on this instrument implies a probability of depression of up to 53%, whereas a negative screen result implies a probability of depression as low as 2%.
Empathy. Empathy is a multidimensional construct with cognitive and emotive domains. Cognitive empathy relates to an individual's ability to understand the perspective of another person about his or her circumstances. Emotive empathy refers to an individual's concern for the feelings of others.42- 46 Empathy was measured with the Interpersonal Reactivity Index (IRI), a 28-item instrument with 4 separate 7-item subscales evaluating different dimensions of empathy that are considered independently.42 Each question item is scored on a Likert scale from 0 to 4, with response anchors ranging from “does not describe me well” (0) to “describes me well” (4), so that the maximum score for each subscale is 28. We included the IRI subscales measuring the cognitive and emotive domains of empathy according to previous studies demonstrating the utility of these subscales for evaluating empathy among resident physicians.36,46- 48
Standard univariate statistics were used to characterize the sample. Comparisons between residents reporting errors and residents reporting no errors were initially made using summary statistics, collapsing responses within each individual into a single average outcome.49 These comparisons were analyzed using the Wilcoxon-Mann-Whitney test for continuous variables and the Fisher exact test for proportions.
To incorporate the repeated-measures study design, the association of self-perceived errors with QOL, empathy, burnout, and depression was evaluated using generalized estimating equations (GEE), an extension of generalized linear models that accounts for correlated repeated measurements within individuals.49,50 Analyses were performed examining the association of self-perceived errors with distress at the subsequent time point. Because the surveys asked about self-perceived errors during the previous 3 months, these errors preceded the assessment of all distress variables except depression, for which some overlap occurred because the depression screening tool assesses symptoms of depression during the previous 4 weeks.
Finally, we analyzed the association of distress and empathy with the likelihood of a self-perceived error during the following 3 months. For these analyses, the assessment of all distress variables preceded the self-reported errors.
Because of multicollinearity among distress variables, each model included self-reported errors and 1 distress variable. To properly calculate variance terms for repeated-measures analyses, the GEE method requires that a correlation structure be specified. Selecting the correct correlation structure for GEE analyses does not in general affect parameter estimation but does allow more precise estimates. Where allowed by the data, we specified unstructured correlations. Alternative structures in order of preference were autoregressive and exchangeable.
Statistical analyses were conducted with SAS version 8.2 (SAS Institute Inc, Cary, NC). Statistical significance was set at the .05 level, and all tests were 2-tailed.
Participants were 184 (84%) of 219 eligible residents; there were no statistically significant differences in age, sex, or program type between participants and nonparticipants. The demographic characteristics of study participants are shown in Table 1. Age 30 years was used as a threshold to approximately separate residents with more standard medical education histories from those who may have had other life experiences before beginning their residency. The categories for debt were intended to be comparable with those of a study that investigated the relationship between debt level and stress.51 Of the participants, 100% completed at least 1 survey during the study period, with response rates to individual surveys ranging from 64% to 94% (mean, 72.2%). Baseline participant characteristics for QOL, burnout, depression screening, and empathy are shown in Table 2.
Errors were reported in 130 (14.7%) of 883 resident-quarters. Perceived error rates by quarter of training ranged from 4.3% to 23.1%. Overall, 34% of study participants reported at least 1 major medical error during the study period, and 43% of residents completing at least 1 year of training reported errors. Of the participants, 20% reported 1 error, 6% reported 2 errors, and 8% reported 3 or more errors during the study period. Self-perceived error rates did not vary significantly by age, sex, program type, amount of student loan debt, relationship status, or parental status. Of perceived errors, 97% were discussed with at least 1 individual. The most common group with whom errors were discussed was other residents (83%), although a majority of residents also reported discussing perceived errors with close family and friends (65%) or supervisory faculty (54%).
Summary measures to identify general associations between self-perceived errors and resident QOL, burnout, symptoms of depression, and empathy are shown in Table 2. Residents reporting at least 1 error during the study period had significantly lower overall QOL on the LASA (−0.52; P = .03). Residents reporting errors also had higher levels of burnout, as evidenced by increased depersonalization (+3.23; P<.001), increased emotional exhaustion (+6.85; P<.001), and a lower sense of personal accomplishment (−2.99; P = .001) on the MBI. More than 60% of residents reporting an error screened positive for depression at least once during the study period, nearly twice the rate in residents reporting no errors. Residents reporting errors had nonstatistically significant lower emotive empathy scores (−0.89; P = .15) and cognitive empathy scores (−0.65; P = .31) on the IRI.
The association of a self-perceived error with QOL, burnout, symptoms of depression, and empathy at the subsequent survey time point is shown in Table 3. Self-perceived medical errors had a statistically significant adverse association with overall QOL, all 3 domains of burnout, and the likelihood of screening positive for depression. For example, a self-perceived major medical error was associated with a 4.58-point increase in emotional exhaustion on the MBI-EE scale and with an increased odds of 3.29 (95% confidence interval, 1.90-5.64) for a positive depression screen result at the subsequent survey time point.
The association between distress at each survey point and a self-perceived error in the subsequent 3 months is shown in Table 4. Diminished empathy and higher levels of burnout in all domains were associated with increased odds of a self-perceived error in the subsequent 3 months. Each 1-point increase in depersonalization and emotional exhaustion score was associated with a 10% and 7% increase, respectively, in the odds of reporting an error in the following 3 months. Similarly, each 1-point increase in personal accomplishment, emotive empathy, and cognitive empathy score was associated with a 7%, 9%, and 9% decrease, respectively, in the odds of a self-perceived error in the following 3 months.
The addition of other potential confounding or interacting factors (type of clinical rotation, self-reported satisfaction with work-life balance, occurrence of a major negative life event [eg, divorce, death in the family], occurrence of a major positive life event [eg, marriage, birth in the family], and preferred coping strategies) to these models did not significantly alter the results. Residents in training for a shorter duration necessarily contributed fewer time points to this study. To evaluate this potential source of bias, analyses examining only residents who had completed their first year of training were conducted and yielded similar results.
This prospective longitudinal study shows that self-perceived major medical errors were common among the studied internal medicine residents, with about one third of participants reporting a major error at least once during the study period. In addition to their potential effects on patients, perceived medical errors exhibited a strong association with multiple domains of physicians' personal well-being. In particular, self-perceived errors were associated with decreased QOL and increases in burnout and symptoms of depression. Declines in empathy were also associated with perceived medical errors.
Previous research has demonstrated that changes in LASA scores approaching 0.5 SDs or more, as we observed, are clinically significant.52 Additionally, our reported associations of perceived errors with burnout are large, given that “low” and “high” categories on the MBI scales are separated by only 5 points (depersonalization), 9 points (emotional exhaustion), and 7 points (personal accomplishment).32 Bellini et al47 suggested that 1.0- to 1.5-point changes in resident empathy scores on the IRI are important, although it remains unclear what constitutes a clinically meaningful change in empathy score. Overall, this evidence suggests that our findings are not only statistically significant but also clinically meaningful.
When considered with previous studies demonstrating a link between personal distress and empathy,36,46 these results imply that medical errors represent an important contributor to the personal distress and loss of compassion reported in numerous studies of residents,36,47,48 which is significant not only because the personal effects of making an error can be profound but also because personal distress appears to negatively affect patient care.33,34,53,54 Our results also support the link between physician distress and subsequent self-reported errors. Taken together, these results suggest a vicious cycle whereby medical errors may lead to personal distress, which then contributes to further deficits in patient care.
In response to these findings, one step for residency programs should be to reduce medical errors to the greatest extent possible. System issues are cited as major contributors to medical error,1,55- 58 and system-based solutions have been explored,59,60 including interventions designed to reduce medication errors, control nosocomial infection rates, and prevent falls among hospitalized patients. Within graduate medical education, training environments that result in excessive resident fatigue have been targeted by duty-hour reforms.61- 63 However, all components of the Accreditation Council for Graduate Medical Education resident duty-hour limitations62 were in effect, with documented compliance, at our training program throughout the entire study, suggesting that the self-reported error rates we observed already reflect the benefits of this intervention.
System efforts are unlikely to eliminate errors completely, and when errors do occur, physicians often have limited resources on which to call for support.17,64,65 In our study, the majority of residents discussed their perceived errors with colleagues, supervisory staff, and close family or friends. Additional coping strategies reported in the literature include discussing errors with patients, accepting responsibility, and working to put error prevention methods in place after error analysis.18,23,64,66 Specific curricula on personal awareness and self-care to promote strategies for coping with the emotional impact of errors are needed but have been slow to develop.20,67- 69 Additional research is required to identify effective approaches to assist physicians who have made medical errors.
Our results also suggest that residency programs should ensure that efforts are in place to prevent, identify, and treat burnout and to promote empathy and well-being for the welfare of residents and patients. Further investigations of the impact of personal distress on error occurrence rates will help clarify the roles that issues such as physician QOL, burnout, and depression play in patient safety.
Our study has several limitations. First, the definition and reporting of major medical errors in this study were based on self-perception. The extent to which these self-perceived errors accurately reflect the frequency of medical errors and whether these perceived errors actually affected patient outcomes cannot be determined. Although no single method of measuring errors is ideally suited to all purposes,70 this approach reflects the study's aim of measuring the effects of perceived errors on physician well-being: errors physicians are not aware of would not be expected to have any such effects. Previous work has suggested that physician-identified adverse events differ from events identified by medical record review but that such self-reported adverse events may be more likely to represent preventable medical errors.71 It is also possible that a perceived error that results in adverse patient consequences could affect the level of distress experienced by the resident. Because we did not assess patient consequences, we are unable to address this issue.
Second, the generalizability of these results from a single academic medical center to other training programs is unknown. Our participation and survey response rates were high relative to those of other physician surveys,72,73 suggesting that the results of this study are representative of residents in our training program. Residents in this program attended a wide range of medical schools and work in clinical settings characteristic of academic residency training programs, suggesting they are likely to be representative of other academic training programs. In addition, the error rates,14- 16 burnout scores,33- 36,39 rates of a positive depression screen result,33,39 and empathy scores36,47,48 observed in this study were similar to those found in other samples of medical residents.
Third, our survey structure measured distress at defined survey points, but unmeasured distress could occur between survey time points and therefore could precede a perceived error within each period. Additionally, it is possible that retrospective error reporting is distorted by feelings of distress. Although it is unclear whether such feelings of distress would make reporting of errors more likely or less likely, the potential exists for current distress to influence error reporting and therefore affect our results. Because of these issues, our results are best interpreted as associations rather than as definitive evidence of causation.
Fourth, the depression screening instrument we used cannot diagnose depression by itself. Although the positive likelihood ratio for this instrument is similar to that of other accepted depression screening tools,38,40 the posttest probability of about 50% described previously means that additional evaluation would be necessary to diagnose depression in residents with positive screen results. Our findings suggest that a positive depression screening result is associated with self-perceived errors, but because of these limitations, further study is needed to conclusively link clinical depression with medical errors.
Fifth, some potential confounding variables could not be evaluated. For example, it is possible that personality traits such as being highly self-critical, confident, or reflective affect some aspects of how physicians perceive or respond to errors. Also, sleep deprivation could contribute to medical errors and resident distress and was not assessed in this study beyond documented institutional compliance with resident duty-hour limitations.
Finally, the models reported in this study evaluate the relationship between self-perceived errors and each distress variable individually. Because of multicollinearity between the distress variables, we have limited ability to separate the effects of individual distress variables from one another.
In summary, these results suggest that self-perceived medical errors are common among internal medicine residents. These errors are associated with significant subsequent personal distress. Burnout and loss of empathy are also associated with an increased risk of future self-perceived major medical errors. A majority of residents discuss their errors with colleagues, supervising faculty, or friends and family, but formal programs to provide additional support for physicians who make errors appear warranted. Further investigation to identify the most effective posterror support mechanisms is needed in parallel with ongoing system efforts to reduce error rates and resident distress.
Corresponding Author: Colin P. West, MD, PhD, Division of General Internal Medicine, Department of Medicine, Mayo Clinic College of Medicine, 200 First St SW, Rochester, MN 55905 (firstname.lastname@example.org).
Author Contributions: Dr West had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: West, Sloan, Kolars, Habermann, Shanafelt.
Acquisition of data: West, Huschka, Novotny, Sloan, Kolars, Habermann, Shanafelt.
Analysis and interpretation of data: West, Huschka, Novotny, Sloan, Shanafelt.
Drafting of the manuscript: West, Shanafelt.
Critical revision of the manuscript for important intellectual content: West, Huschka, Novotny, Sloan, Kolars, Habermann, Shanafelt.
Statistical analysis: West, Huschka, Novotny, Sloan.
Obtained funding: Sloan, Kolars, Habermann, Shanafelt.
Administrative, technical, or material support: Kolars, Habermann.
Study supervision: Sloan, Kolars, Habermann, Shanafelt.
Financial Disclosures: None reported.
Funding/Support: This work was supported by a Medicine Innovation Development and Advancement System grant from the Mayo Clinic Department of Medicine.
Role of the Sponsor: The funding source for this study played no role in the design and conduct of the study; in the collection, management, analysis, and interpretation of the data; or in the preparation of the manuscript. The funding source did review and approve the manuscript.