Patient-physician racial/ethnic and gender concordance regression models included the following variables: patient age, gender, race/ethnicity, education, income, marital status, insurance, primary language, overall health, and mental and emotional health; physician age, gender, race/ethnicity, academic track (gender concordance only), and academic rank; and clinical encounter type, location, and specialty. Scores refer to scores on the Press Ganey Outpatient Medical Practice survey. Patient-physician concordant pairs serve as the reference.
eAppendix. Patient Survey
eFigure 1. Study Selection Flow Diagram
eFigure 2. Adjusted Odds Ratios for Secondary Patient Experience Outcomes by Patient-Physician Gender Discordant Pairs
eTable 1. Patient Characteristics by Physician Race/Ethnicity
eTable 2. Adjusted Odds Ratios for Secondary Patient Experience Outcomes by Patient-Physician Racial/Ethnic Discordant Pairs
eTable 3. Primary Model for Patient-Physician Racial/Ethnic Binary Concordance
eTable 4. Primary Model for Patient-Physician Gender Binary Concordance
Customize your JAMA Network experience by selecting one or more topics from the list below.
Identify all potential conflicts of interest that might be relevant to your comment.
Conflicts of interest comprise financial interests, activities, and relationships within the past 3 years including but not limited to employment, affiliation, grants or funding, consultancies, honoraria or payment, speaker's bureaus, stock ownership or options, expert testimony, royalties, donation of medical equipment, or patents planned, pending, or issued.
Err on the side of full disclosure.
If you have no conflicts of interest, check "No potential conflicts of interest" in the box below. The information will be posted with your response.
Not all submitted comments are published. Please see our commenting policy for details.
Takeshita J, Wang S, Loren AW, et al. Association of Racial/Ethnic and Gender Concordance Between Patients and Physicians With Patient Experience Ratings. JAMA Netw Open. 2020;3(11):e2024583. doi:10.1001/jamanetworkopen.2020.24583
Is patient-physician racial/ethnic or gender concordance associated with the patient experience as measured by scores on the Press Ganey Outpatient Medical Practice Survey?
In this cross-sectional analysis of 117 589 Press Ganey surveys completed for the adult outpatient practices of an urban, academic health system from 2014 to 2017, physicians among racially/ethnically discordant patient-physician dyads had significantly lower odds of receiving the maximum patient experience score compared with those among concordant dyads.
In this study, higher Press Ganey survey scores were associated with racial/ethnic concordance between patients and their physicians; thus, efforts to improve the patient experience among racially/ethnically discordant patient-physician dyads may be necessary to improve health care delivery.
The Press Ganey Outpatient Medical Practice Survey is used to measure the patient experience. An understanding of the patient- and physician-related determinants of the patient experience may help identify opportunities to improve health care delivery and physician ratings.
To evaluate the associations between the patient experience as measured by scores on the Press Ganey survey and patient-physician racial/ethnic and gender concordance.
Design, Setting, and Participants
A cross-sectional analysis of Press Ganey surveys returned for outpatient visits within the University of Pennsylvania Health System between 2014 and 2017 was performed. Participants included adult patient and physician dyads for whom surveys were returned. Data analysis was performed from January to June 2019.
Patient-physician racial/ethnic and gender concordance.
Main Outcomes and Measures
The primary outcome was receipt of the maximum score for the “likelihood of your recommending this care provider to others” question in the Care Provider domain of the Press Ganey survey. Secondary outcomes included each of the remaining 9 questions in the Care Provider domain. Generalized estimating equations clustering on physicians with exchangeable intracluster correlations and cluster-robust standard errors were used to investigate associations between the outcomes and patient-physician racial/ethnic and gender concordance.
In total, 117 589 surveys were evaluated, corresponding to 92 238 unique patients (mean [SD] age, 57.7 [15.6] years; 37 002 men [40.1%]; 75 307 White patients [81.6%]) and 747 unique physicians (mean [SD] age 45.5 [10.6] years; 472 men [63.2%]; 533 White physicians [71.4%]). Compared with racially/ethnically concordant patient-physician dyads, discordance was associated with a lower likelihood of physicians receiving the maximum score (adjusted odds ratio [OR], 0.88; 95% CI, 0.82-0.94; P < .001). Black (adjusted OR, 0.73; 95% CI, 0.68-0.78; P < .001) and Asian (adjusted OR, 0.55; 95% CI, 0.50-0.60; P < .001) patient race were both associated with lower patient experience ratings. Patient-physician gender concordance was not associated with Press Ganey scores (adjusted OR, 1.00; 95% CI, 0.96-1.04; P = .90).
Conclusions and Relevance
In this study, higher Press Ganey survey scores were associated with racial/ethnic concordance between patients and their physicians. Efforts to improve physician workforce diversity are imperative. Delivery of health care in a culturally mindful manner between racially/ethnically discordant patient-physician dyads is also essential. Furthermore, Press Ganey scores may differ by a physician’s patient demographic mix; thus, care must be taken when publicly reporting or using Press Ganey scores to evaluate physicians on an individual level.
The patient experience is at the heart of patient-centered medical care. There have been increasing efforts not only to measure the patient experience but also to publicly share patient ratings for individual physicians, sometimes linking patient ratings to physician employment and compensation,1 with the ultimate goal of motivating improvements in health care delivery. The Press Ganey (PG) Outpatient Medical Practice Survey2 is used by many practices to evaluate the patient experience. Despite its wide use, the associations between the patient experience and patient, physician, and clinical encounter characteristics remain unclear.
Much work has focused on patient and physician race/ethnicity or gender and their associations with the patient experience and specific aspects of the patient-physician relationship.3-21 The existing studies on this topic are quite heterogenous in terms of populations evaluated, methods used, and outcomes measured, which may, in part, contribute to mixed findings. Although some studies suggest that there are benefits associated with patient-physician racial/ethnic or gender concordance,8,10-12,19 others do not.5,13-19,21 In light of existing racial/ethnic and gender health and health care disparities,22 as well as increasing diversification of the US population,23 a better understanding of the associations between patient and physician demographics and the patient experience of health care is a necessary and important step toward our collective goal of achieving health equity. Furthermore, an evaluation of these associations is critical to quality improvement efforts and appropriate interpretation and use of patient experience ratings. Many of the existing studies of the PG survey have been limited by relatively narrow patient populations, exclusion of patient and physician characteristics that may be important determinants of the patient experience, or focus on specific medical specialties.5,10,20,24 We evaluated the associations between the patient experience as measured by the PG survey and patient-physician racial/ethnic and gender concordance among a diverse, urban, ambulatory patient population across all medical specialties.
We conducted a cross-sectional analysis to evaluate the associations between the patient experience, as measured by PG scores obtained from the PG Outpatient Medical Practice Survey, and patient-physician racial/ethnic and gender concordance. Our primary data source was patient experience surveys, including the complete PG survey and components of the Clinician and Group Consumer Assessment of Healthcare Providers and Systems (CG CAHPS) surveys (eAppendix in the Supplement)25 for outpatient visits within the University of Pennsylvania Health System (UPHS) between July 2, 2014, and May 30, 2017. Patient and clinical encounter characteristics were obtained from the survey and the electronic medical record; physician characteristics were obtained from the University of Pennsylvania faculty database.
This study was approved by the University of Pennsylvania institutional review board. A waiver of informed consent was approved because the data were deidentified. The study is reported according to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline.26
The PG survey evaluates 6 domains (Access, Moving Through Your Visit, Nurse/Assistant, Care Provider, Personal Issues, and Overall Assessment) with 2 to 10 Likert-type item questions per domain and 5 answer options ranging from 1 (very poor) to 5 (very good).2 Paper or electronic surveys were sent to all patients who received outpatient care within UPHS. Patients did not receive surveys if they declined solicitation or did not have a valid mailing or email address on file. The survey response rate during the study period was 19.9%.
The study included returned surveys for all adult (aged ≥18 years old) outpatient visits with nontrainee physicians within UPHS during the study period. Surveys were excluded if any of the following applied: (1) clinician lacked MD or DO credentials; (2) patient or physician age, gender, or race/ethnicity was missing; (3) patient or physician race/ethnicity other than non-Hispanic White (hereafter referred to as White), non-Hispanic Black (hereafter referred to as Black), non-Hispanic Asian (including Southeast Asians and Pacific Islanders; hereafter referred to as Asian), or Hispanic ethnicity of any race (hereafter referred to as Hispanic); or (4) the primary outcome was missing. Only the first survey for any patient-physician pair was included, resulting in 1 survey for each unique patient-physician pair (eFigure 1 in the Supplement).
The primary exposures were patient-physician racial/ethnic and gender concordance. Concordance was evaluated both as a binary measure and by specific patient-physician racial/ethnic and gender pairs. Patient race/ethnicity was obtained from self-report in the CG CAHPS survey or, if unavailable, from the medical record. Patient gender was obtained from the medical record. Self-reported physician race/ethnicity and gender were obtained from the university’s faculty database. Race/ethnicity was categorized as White, Black, Asian, or Hispanic. Gender was categorized as male or female.
Other patient characteristics included age, marital status, insurance, highest education level, median annual household income of the patient’s zip code of residence, primary language, overall health, overall mental or emotional health, and level of assistance with survey completion. Highest education level, overall health, overall mental or emotional health, and level of assistance with survey completion were patient-reported from the survey. All other variables were self-reported and obtained from the medical record.
Other physician characteristics were obtained from the university’s faculty database and included self-reported age, type of medical degree, additional degrees, and faculty track and rank. Clinical encounter characteristics were obtained from the medical record and included medical specialty, visit type, location, and year.
The primary outcome was derived from the score for the item, “Likelihood of your recommending this care provider to others” in the Care Provider domain of the PG survey. Because the responses were highly skewed, with most scores clustered at the high values, we primarily used the top box scoring method, whereby scores were dichotomized as 5 (maximum score) vs less than 5.2 This method is consistent with current practices for patient experience score reporting.10,24,27,28 We also reported mean scores on a 1 to 5 scale.29
Each of the remaining 9 questions under the Care Provider domain of the PG survey served as secondary outcomes. Each secondary outcome score was dichotomized as 5 vs less than 5.
Patient, physician, and clinical encounter characteristics were summarized with descriptive statistics. To investigate associations between each outcome and patient-physician racial/ethnic and gender concordance, we used generalized estimating equations clustering on physicians with exchangeable intracluster correlations and cluster-robust standard errors to obtain population-averaged estimates of statistical associations. We used purposeful selection to build the multivariable regression model for the primary outcome.30 Patient and physician age, gender, and race were included in all models, as well as any patient, physician, or clinical encounter characteristics that were significantly associated with the primary outcome at α level of .05 in univariable analyses. For the models using binary racial/ethnic or gender concordance, physician race/ethnicity or gender, respectively, were excluded to avoid overfitting. For the regression models evaluating the associations between specific patient-physician racial/ethnic or gender pairs and the primary outcome, indicator variables for race/ethnicity or gender of the patient and physician and their interaction term (product), respectively, were included. To compare the odds of the outcome for each specific patient-physician racial/ethnic or gender pairs, we computed the odds ratios (ORs) from the linear combinations of each discordant pair from their respective final regression models. Associations between secondary outcomes and patient-physician racial/ethnic and gender concordance were evaluated using the final multivariable models for the association between the primary outcome and racial/ethnic and gender concordance, respectively. Concordant race/ethnicity or gender served as the reference in all models. P values were not adjusted for multiplicity for the secondary outcomes because they were exploratory in nature. Statistical analyses were performed using Stata statistical software version 15 (StataCorp). Statistical significance was determined by 2-sided P values at P < .05. Data analysis was performed from January to June 2019.
The total number of surveys during the study period was 246 138, representing 124 957 unique patients and 2249 unique physicians. After application of study selection criteria (eFigure 1 in the Supplement), the number of surveys was 117 589, representing 92 238 unique patients and 747 unique physicians.
Patient characteristics are summarized in Table 1. The mean (SD) age was 57.7 (15.6) years, and 37 002 patients (40.1%) were men. The racial/ethnic distribution was majority White (75 307 patients [81.6%]) followed by Black (11 759 patients [12.7%]), Asian (3087 patients [3.3%]), and Hispanic (2085 patients [2.3%]). Most patients had commercial insurance (49 537 patients [53.7%]) and reported being at least a 4-year college graduate (55 255 patients [59.9%]). The mean (SD) median annual household income was $78 834 ($31 470).
Physician characteristics are summarized in Table 2. The mean (SD) age was 45.5 (10.6) years, and 472 (63.2%) were men. Racial/ethnic distribution was majority White (533 physicians [71.4%]) followed by Asian (157 physicians [21.0%]), Black (34 physicians [4.6%]), and Hispanic (23 physicians [3.1%]). Most physicians had an MD degree (737 physicians [98.7%]) and were assistant professors (375 physicians [50.2%]). Patient characteristics stratified by physician race/ethnicity are summarized in eTable 1 in the Supplement.
Clinical encounter characteristics are summarized in Table 3. Most encounters represented return visits (75 844 encounters [64.5%]) and were for medical (40 010 encounters [34.0%]) and surgical (35 116 encounters [29.9%]) specialties.
Overall, physicians in 67 504 of 77 051 (87.6%) racially/ethnically concordant patient-physician encounters received the maximum score, compared with physicians in 33 280 of 40 538 (82.1%) discordant patient-physician encounters (a 5.5-point difference). In adjusted analyses, physicians in racially/ethnically discordant patient-physician pairs were significantly less likely to receive the maximum score compared with those in concordant pairs (adjusted OR, 0.88; 95% CI, 0.82-0.94; P < .001).
The proportions of encounters in which physicians received the maximum score and the mean scores for encounters by specific patient-physician racial/ethnic pairs are summarized in Table 4. The distribution of scores differs by physician race/ethnicity within each patient racial/ethnic group but also differs by patient racial/ethnic group. White patients were generally most likely to provide the maximum score for their physicians across all physician racial/ethnic groups (range, 84.9% [3025 of 3562 encounters] to 87.9% [65 775 of 74 850 encounters]; mean score, 4.80 among all physicians), whereas Asian patients were the least likely to provide the maximum score for their physicians (range 70.3% [116 of 165 encounters] to 76.5% [78 of 102 encounters]; mean score, 4.61 among all physicians).
Adjusted ORs for the primary outcome by specific patient-physician racial/ethnic pairs are summarized in the Figure. Among White patients, Asian physicians had lower odds of receiving the maximum score compared with White physicians (adjusted OR, 0.87; 95% CI, 0.78-0.97; P = .01); Black physicians also had lower odds of receiving the maximum score vs White physicians but the difference was not significant (adjusted OR, 0.79; 95% CI, 0.60-1.04; P = .09). The absolute difference in mean score between White and Asian physicians was 0.03, and that between White and Black physicians was 0.05. The likelihood of receiving the maximum score was similar between Hispanic and White physicians (adjusted OR, 1.02; 95% CI, 0.78-1.35; P = .87). Among Black patients, White (adjusted OR, 0.73; 95% CI, 0.55-0.97; P = .03) and Asian (adjusted OR, 0.67; 95% CI, 0.50-0.90; P = .01) physicians were each less likely to receive the maximum score compared with Black physicians. The OR for Hispanic physicians was not statistically significant (adjusted OR, 0.80; 95% CI, 0.56-1.16; P = .24). The absolute difference in mean score between Black and White physicians was 0.03, and that between Black and Asian physicians was 0.06. The ORs among Asian and Hispanic patients were not significant for any of the patient-physician pairs.
ORs for the secondary outcomes by discordant patient-physician racial/ethnic pairs are summarized in eTable 2 in the Supplement. The ORs are similar to those for the primary outcome.
Overall, physicians in 57 924 of 67 290 (86.1%) gender-concordant patient-physician encounters received the maximum score, compared with physicians in 42 860 of 50 299 (85.2%) gender-discordant patient-physician encounters. In adjusted analyses, physicians in gender discordant patient-physician pairs were equally as likely to receive the maximum score as those in concordant pairs (adjusted OR, 1.00; 95% CI, 0.96-1.04; P = .90). Adjusted ORs for the primary outcome by specific patient-physician gender pairs are summarized in the Figure and reveal no significant differences. Adjusted ORs for the secondary outcomes by patient-physician gender pairs are shown in eFigure 2 in the Supplement.
Associations between other patient, physician, and clinical encounter characteristics and the primary outcome are summarized in eTable 3 and eTable 4 in the Supplement. Patient characteristics consistently associated with higher odds of giving the maximum score in both race/ethnicity and gender concordance models include older age, male gender, married or partnered or divorced or widowed marital status, Medicare or Medicaid insurance, and better self-reported overall and mental or emotional health. Patient characteristics consistently associated with lower odds of giving the maximum score include Black (race/ethnicity concordance model, adjusted OR, 0.73; 95% CI, 0.68-0.78; P < .001) and Asian race (race/ethnicity concordance model, adjusted OR, 0.55; 95% CI, 0.50-0.60; P < .001), higher education levels, and non-English primary language. Higher physician rank (professor) was consistently associated with higher odds of receiving the highest score; older physician age was associated with lower odds of receiving the highest score. Return and procedural visits were each associated with higher odds of receiving the top score. Clinical encounters in specialties outside of medicine were associated with lower odds of receiving the highest score.
In this study of PG scores from patients seen in the ambulatory practices of an urban, academic medical center, we found that patient-physician racial/ethnic concordance was associated with patients’ reported experience with their physicians. Compared with physicians in racially/ethnically concordant patient-physician encounters, physicians in discordant dyads were less likely to receive the maximum patient rating. On an absolute level, there was a 5.5-point difference between the percentages of physicians receiving the highest score in discordant vs concordant dyads. Among specific patient-physician racial/ethnic dyads, relative and absolute measures showed that White patients who saw Asian (0.03 difference in mean score) or Black (0.05 difference in mean score) physicians were less likely to rate their physicians favorably than their counterparts who saw White physicians; the latter difference was underpowered to reach statistical significance in adjusted analyses. Black patients who saw White (0.03 difference in mean score) or Asian (0.06 difference in mean score) physicians were less likely to rate their physicians favorably compared with those who saw Black physicians. To put these differences into perspective, in another study of the PG survey,2 a decrease in mean score of 0.02 on the 1-to-5 scoring scale was associated with a substantial decrease in physician ranking from the 100th to 70th percentile.
Our findings add to the existing literature by presenting new information that supports the positive associations between the patient experience as measured by the PG survey and patient-physician racial concordance. Although 2 prior studies20,24 evaluated PG scores in broad medical settings, the associations between physician ratings and patient and physician characteristics were mixed. These studies were limited by the patient populations included (predominantly White and Asian patients24 and inpatients20) and the exclusion of characteristics that we found to be important determinants of physician ratings. We evaluated patient-physician racial/ethnic dyads beyond those of White and Asian racial groups and accounted for a more comprehensive group of patient, physician, and visit characteristics. As such, our study lends further support to the clinical benefits that others have found to be associated with racially/ethnically concordant patient-physician interactions.8,11,12,19 These benefits include better patient-physician communication,6,31,32 patient care,33 and outcomes34 and have been suggested to be attributable to decreased bias between patients and physicians.6,7 Together, this information should be viewed as a call to action to vigorously support the training of underrepresented minority medical students and residents while also ensuring the promotion and retention of underrepresented minority physicians. However, the provision of health care to minority patients should not fall solely to minority physicians. It is, therefore, imperative that we also improve cultural mindfulness among all physicians so that they are prepared to care for a diverse patient population in an equitable manner.
We also identified other PG score patterns that raise questions about how these scores should be used. PG scores for physicians who saw White patients were numerically higher than for physicians who saw non-White patients. In adjusted analyses, Black and Asian patient race were each associated with lower odds of the physician receiving the maximum score compared with White patient race. These findings are consistent with and expand upon prior studies24,35 that observed that Asian patients rate their physicians lower than do White patients. We also found that, among all patients except for Black patients, the absolute proportion of physicians who received the maximum score was lowest among Black physicians. Furthermore, among White patients, the odds of receiving the maximum score were similar between White and Hispanic physicians, the latter of whom may, in some cases, be less obviously identifiable by White patients as being ethnically discordant. Although these patterns may reflect truly different patient experiences, they also raise the possibility that there are racial/ethnic differences in patient expectations of or biases toward their physicians that may influence PG scores. This is especially relevant in our current environment of #WhatADoctorLooksLike,36 in which examples of the lay public questioning the medical authority of underrepresented minority physicians are too common. Thus, care should be taken in publicly reporting individual PG scores or using them for physician compensation and promotion decisions so as not to disincentivize physicians from caring for a diverse patient population or accelerate job dissatisfaction.1,37
With respect to gender, in contrast to some studies, we did not find an association between patient-physician gender concordance and PG scores. As suggested in prior work,10 it is possible that associations between patient-physician gender concordance and patient experience ratings are particular to specific medical specialties.
Study strengths include a large survey sample; robust patient-level, physician-level, and visit-level data; examination of all Care Provider domain PG survey questions; and inclusion of a racially/ethnically diverse patient population. Limitations include the low survey response rate and the potential for nonresponse bias, small numbers among some racial/ethnic groups (especially Black and Hispanic physicians), and lack of generalizability to patient and physician populations with different sociodemographic compositions. The survey response rate for our study is typical of the PG survey, as has been reported in other literature,2,10,20,24,38 and, although it is beyond the scope of the present study, the presence of nonresponse bias has been suggested in at least one study of the PG survey.38 Nonresponse bias may contribute to skewed responses in both the positive and negative directions. Future studies aimed at quantifying the effects of nonresponse bias are necessary for appropriate interpretation of PG scores.
This cross-sectional analysis provides new insights about the associations between the patient experience as measured by PG scores and patient-physician racial/ethnic concordance as well as patient race/ethnicity. Further investigation to understand the reasons for differential scoring across specific patient and physician characteristics is warranted. In the meantime, although we recognize the value of PG scores as a measure of the patient experience, we also encourage health care leaders, administrators, and insurers to reevaluate the appropriateness of using PG scores as a primary and reportable measure of physician performance on an individual level.
Accepted for Publication: September 9, 2020.
Published: November 9, 2020. doi:10.1001/jamanetworkopen.2020.24583
Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2020 Takeshita J et al. JAMA Network Open.
Corresponding Author: Junko Takeshita, MD, PhD, MSCE, University of Pennsylvania, 3400 Civic Center Blvd, PCAM South Tower, 7th Flr, Office 728, Philadelphia, PA 19104 (email@example.com).
Author Contributions: Drs Takeshita and Shin had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Drs Shin and Sawinski contributed equally as senior authors.
Concept and design: Takeshita, Loren, Shin, Sawinski.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: Takeshita, Wang, Loren, Shin, Sawinski.
Critical revision of the manuscript for important intellectual content: Takeshita, Loren, Mitra, Shults, Shin, Sawinski.
Statistical analysis: Wang, Mitra, Shults, Shin, Sawinski.
Administrative, technical, or material support: Takeshita.
Supervision: Takeshita, Shin, Sawinski.
Conflict of Interest Disclosures: Dr Takeshita reported receiving a research grant from Pfizer Inc (to the Trustees of the University of Pennsylvania) for work that is unrelated to this manuscript and receiving payment for continuing medical education work related to psoriasis that was supported indirectly by Eli Lilly and Novartis. Dr Mitra reported receiving grants from the National Institutes of Health during the conduct of the study. No other disclosures were reported.
Funding/Support: Dr Takeshita is supported by a National Institute of Arthritis and Musculoskeletal and Skin Diseases K23-AR068433 award to the Trustees of the University of Pennsylvania.
Role of the Funder/Sponsor: The funder had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Additional Contributions: Lisa M. Bellini, MD, McWelling H. Todman, and Craig J. Loundas, PhD (Perelman School of Medicine, University of Pennsylvania), granted permission to use and assisted with accessing the data necessary for our study. Ting-Shan Chiu, MS (Perelman School of Medicine, University of Pennsylvania), created an analytic data set. None of these individuals was compensated for these contributions.
Create a personal account or sign in to: