Comparison of Community-Level and Patient-Level Social Risk Data in a Network of Community Health Centers

This cross-sectional study explores the utility of community-level data for accurately identifying patients with social risks by comparing the social deprivation index score for the census tract where a patient lives with patient-level social risk screening data.


Introduction
Responding to the substantial research on the association between social risk factors and health, enthusiasm has grown around social risk screening in health care settings, and numerous health systems in the US are experimenting with social risk screening initiatives. [1][2][3][4] Recent studies have demonstrated the feasibility of implementing clinic-based screening and documenting social risks in electronic health records (EHRs), 5,6 yet no clear standard has emerged on how to implement social risk screening, nor how clinicians can or should use social risk information to adjust patient care or make referrals to community resources. 6,7 Moreover, some have questioned the benefit of integrating social risk screening into primary care, raising concerns about the additional burden of adding more required data collection to already busy primary care practices and the limited resources available to address identified social risk factors. [8][9][10] In the absence of standard social risk screening recommendations, some health systems are exploring obtaining social risk information without screening patients directly. 11,12 Community and neighborhood-level data characterizing the "conditions in which people are born, grow, live, work and age" 13 are readily available from public sources, such as the US Census or American Community Survey, and can be geocoded and linked to patients' addresses. Researchers have demonstrated the association between community-level measures of socioeconomic status and health. 14 Theoretically, such data could provide an alternative way to identify patients with social risks or to target patients for whom self-reported screening should be prioritized. 11 For example, community-level data could be used for "cold-spotting" to identify patients living in geographic areas that lack certain characteristics that support health, such as access to grocery stores and parks, higher rates of education and employment, clean air, and adequate housing. 15 Identifying patients who live in the most vulnerable communities, or "cold spots," could help clinics characterize and understand patients' social and economic contexts and/or target social risk screening efforts.
However, relying solely on community-level data to understand the social context of an individual patient and/or to guide patient-level interventions poses a risk of ecological fallacy, or making erroneous assumptions about individuals based on aggregate information. 16,17 Using patientlevel social risk screening data from a national network of community health centers (CHCs), 18 linked to geocoded data from the American Community Survey, we explored the utility of communitylevel data as a mechanism for identifying patients with social risks. Specifically, we compared the social deprivation index (SDI) score for the census tract where a patient lives with patient-level social risk screening data documented in the EHR to assess whether patients who live in "cold spots" have more patient-level social risks. 11,15

Study Setting
For this cross-sectional study, we analyzed data from the OCHIN network of CHCs. OCHIN Inc is a nonprofit health center-controlled network that hosts a centrally managed instance of the Epic EHR (Epic Systems Corporation) for 645 CHC clinics across 21 US states. OCHIN CHCs provide care for the nation's most vulnerable patients, the majority of whom are publicly insured or uninsured. Like most CHCs in the US, compared with the general population, patients who receive care at OCHIN CHCs are disproportionately poor, members of racial and ethnic minorities, and living with multiple chronic conditions. OCHIN hosts a research data warehouse that includes EHR data on more than 4.9 million patients, making it, to our knowledge, the largest single research-ready data source on US safety net patients. The research data warehouse also includes neighborhood and community-level data-also called community vital signs-from publicly available sources (eg, US Census, American Community Survey) that provide information about each patient's community context. Patient addresses, collected by OCHIN network clinics, are geocoded to identify the census tract of each patient's residence, then linked to the community vital signs data for that tract. The geocoding and linkage process and selection of measures are described in detail elsewhere. 12 In 2016, OCHIN released a suite of EHR tools to help clinics document and review patientreported social risk screening results. 19

Community-Level Social Risk
To quantify community-level social risk, all patients with a valid address in the OCHIN research data warehouse were assigned a census-tract level SDI score using information on their last available address recorded in the EHR. Originally developed by Butler et al 20 and updated in 2015, 21 the SDI is a composite measure of 7 demographic characteristics from the American Community Survey, including percentage living in poverty, percentage with less than 12 years of education, percentage of single-parent households, percentage living in a rented housing unit, percentage living in an overcrowded housing unit, percentage of households without a car, and percentage of nonemployed adults younger than 65 years. Census tracts with higher percentile scores have higher levels of social deprivation relative to other census tracts nationwide. For example, a census tract with an SDI score of 75 is considered "worse off" than 75% of census tracts in the US. On the other hand, a census tract with an SDI score of 5 is considered "better off" than 95% of census tracts in the US. This multidimensional measure has been shown to be more strongly associated with health outcomes than poverty alone. 18 Moreover, prior studies suggest that patients living in cold spots-defined as those census tracts with an SDI score in the highest quartile nationally (Ն75)-have worse health outcomes relative to those in more resource-rich tracts. 11

Patient-Level Social Risk
Patient-level measures of food insecurity, housing insecurity, and financial resource strain were included in our analysis. We classified patients as having social risks if they screened positive for 1 or more of these domains (see eTable 1 in the Supplement for a description of screening questions for each of these domains). These 3 domains were selected for several reasons. First, given the lack of standardized screening recommendations, CHCs are implementing screening in a variety of ways.
Instead of using an established tool (eg, PRAPARE, Accountable Health Communities), many CHCs have opted to focus on screening for specific social risk domains. To date, food insecurity, housing insecurity, and financial resource strain are among the most frequently documented social risk domains in OCHIN CHCs, in part because they are actionable (eg, through referrals to local resources). 18 Second, relative to other patient-level measures that focus on psychosocial domains (eg, relationship safety, social connection/isolation, stress), these domains are most aligned with the socioeconomic factors integrated into the SDI. Third, despite the limited consensus on standardized SDH screening recommendations, there is emerging consensus around standardized questions to assess food insecurity, housing insecurity, and financial resource strain.
OCHIN's EHR-based screening tools include standardized question and response categories for each of these domains. Food insecurity is assessed using a validated 2-item screen called the Hunger Vital Sign. 22 Housing insecurity is assessed differently depending on when the patient received the screening tool. Before May 17, 2018, patients were asked 2 items from the HealthBegins Upstream Risk screening tool about their living situation in the past month. 23 This has since transitioned to a 2-item question from the Accountable Health Communities screening tool about stability and quality of their housing. 2 Financial resource strain was assessed using the National Academy of Medicinerecommended screening question. 5,19,24 Finally, based on user feedback in pilot CHCs, screening questions were added in May 2017 asking whether patients desired CHC assistance in addressing identified risk factors. 19 When available, data on whether patients wanted help addressing their social risks were also extracted (see eTable 1 in the Supplement for a list of measures and response options).

Statistical Analysis
Descriptive statistics (counts and proportions) were generated to characterize the study sample.
First, each patient in the study sample was categorized into national quartiles based on the SDI of their census tract of residence (Q1: 1 to <25; Q2: 25 to <50; Q3: 50 to <75; Q4: 75-100). Second, the numbers and percentages of patients who reported food insecurity, housing insecurity, or financial

Census Tract SDI Quartile and Patient-Reported Social Risks
Overall, approximately 29.7% of sample patients (n = 10 858) screened positive for housing insecurity, food insecurity, and/or financial resource strain (Table 3) (Table 3).
Of the patients who screened positive for 1 or more social risks (n = 10 858), 23.6% (n = 2561) were asked whether they wanted help from clinic staff to address identified risks. Of those, 35.5% (n = 908) said they wanted help. Overall, 63.3% (n = 575) of those who wanted help lived in a cold spot census tract (Q4), with 19.2% (n = 174) in Q3; 12.4% (n = 113) in Q2; and 5.1% (n = 46) in Q1 (Table 3). Interestingly, when looking at the percentage who wanted help within each quartile, Q1 had the highest percentage of patients who wanted help to address identified risks (n = 46; 48.0%).
Conversely, Q4 had the highest percentage of patients who said they did not want help (n = 1046; 64.5%) ( Table 3).
Of those who screened positive for at least 1 social risk factor, 60.0% (n = 6516) resided in a cold spot census tract and would have been correctly identified as having a risk using a cold-spotting approach. However, 40.0% (n = 4342) of patients reporting 1 or more social risks would not be correctly identified ( Table 4). Of the 25 719 patients who did not screen positive for any social risk, 57.1% (n = 14 681) resided in a cold spot and thus would be incorrectly identified as having social risk using a cold-spotting approach (Table 4). Overall, the accuracy of the community-level data for identifying patients with and without social risks was 48.0% (n = 17 544).

JAMA Network Open | Health Informatics
Comparison of Community-Level and Patient-Level Data for Identifying Social Risk

Discussion
There is growing recognition of the integral role of the health care sector in identifying and addressing social factors and the importance of considering social context to improve health equity. 2-4,7,25 Yet critical knowledge gaps remain, and there is a limited evidence base regarding best practices for integrating social and medical care. 7 In the midst of this environment, numerous approaches have emerged, including both patient-level screening and efforts that leverage community-level social risk data, sometimes in combination. 1,5,6,10,18,[26][27][28][29][30][31] Recognizing the challenge, cost, and time involved with implementing patient-level social risk screening initiatives, some health care systems are exploring strategies for using publicly available community-level data as proxies of patient-level social and economic information and/or as a way to target social risk-targeted (directly addressing social risk factors) or social risk-informed care (adapting care to accommodate social risks). [6][7][8][9][10]27,32 In this study, we assessed concordance between patient-reported social risks and community-level social deprivation among patients receiving care in a national network of CHCs.
Building on previous work, 10,11 we defined cold spots as census tracts with SDI scores within the highest quartile nationally. Although there was some overlap between cold spot status and the presence of patient-level social risks, with 60.0% of those who reported at least 1 social need living in a cold spot census tract, 40.0% of patients who screened positive for at least 1 social risk did not live in a cold spot. Overall, the accuracy of the community-level data for identifying patients with and without social risks was 48.0%.
These findings have several implications for the use of patient-level and community-level social risk data in clinical settings. First, using community-level data as a proxy for patient-level social risk screening or to refine targets for patient-level data collection 10,29 may heighten the risk of ecologic fallacy, wherein incorrect assumptions are made about an individual based on aggregate-level information about a group. 6,16 Despite the potential utility of community-level data for a variety of purposes, including in cases when universal screening is not feasible, findings within our study population suggest that ecologic fallacy may in fact be an issue when using community-level data to identify patient-level social needs. Indeed, based on data from the OCHIN network, if a CHC were to

JAMA Network Open | Health Informatics
Comparison of Community-Level and Patient-Level Data for Identifying Social Risk use the cold spot approach to identify patients for screening or to implement a social care intervention, they would risk missing 40.0% (n = 4342) of patients with social risks who live in more affluent areas and/or incorrectly target 57.1% (n = 14 681) of patients without social risks who live in a cold spot.
Second, our results suggest that, when asked, a larger percentage of patients said that they did not want help addressing identified risks (64.5% [n = 1653] said that they did not want help vs 35.5% [n = 908] who did). This finding is supported by a 2019 qualitative study, 33 which found that although patients and caregivers believed social risk screening was important and acceptable, they did not expect their health care teams to address the social challenges they faced. Moreover, despite the low numbers, the counterintuitive finding that a higher percentage of patients within the most affluent quartile (Q1) said they wanted help could be indicative of a greater availability of resources in these areas, but this is unclear. Overall, these findings underscore the need for additional research to explore patient perspectives on social care screening and referrals, including whether and how health care teams should address identified risk factors. They also raise questions about whether and how the availability of resources in a patient's neighborhood or community-either perceived or actual-might influence their desire for help.
Third, our findings underscore the importance of identifying and exploring the potential for other less intensive strategies for understanding patient-level social risks. Medical professionals, especially those in primary care, have articulated the range of challenges involved in identifying and addressing social risks, including the increased burden of integrating these activities into an already burgeoning workoad. [9][10][11][12] One strategy would be to lower the burden of patient-level screening. There are several multiquestion screening instruments, 2-4 variability in measures, and limited understanding of their psychometric properties (ie, reliability and validity). 34 Potential next steps could include identifying a smaller set of risk factors to screen for in clinical settings, or alternatively, developing a valid and reliable single-question screening for social risks.
Finally, although our findings suggest that census tract-level SDI is not always accurate in identifying patient-level social risks, community-level data still have enormous value if targeted appropriately. As others have articulated, community-level social determinants of health or "community vital signs"-such as the availability of green spaces for walking, or environmental factors that might affect health-provide important contextual information about a patient's environment  that could inform care. 12,35 Beyond their utility in contextualizing patient care, community-level data are a vital source of information for community-level interventions (eg, advocacy, alignment) and could be used to inform the development value-based payment structures or approaches to risk adjustment. 6 Indeed, other countries have demonstrated the value of using area-based measures of socioeconomic variation to assess community needs, inform research, adjust clinical funding, allocate community resources, and determine policy impact. 36 More research is needed to understand how patient-level and community-level data can be used in concert to most effectively and efficiently invest limited resources.

Limitations
This study has several limitations. First, a primary limitation of our findings is the generalizability of the study sample. Patients who completed social risk assessments were a nonrandom sample of OCHIN patients and represented less than 1% of OCHIN patients seen during the study period. There was significant variation among OCHIN CHCs in the number of screenings conducted, the types of patients selected for screening, and the social risk questions that were asked. 18 Because of the limitations of these EHR data, we cannot determine if differing response rates across SDH domains can be attributed to patient refusal to answer or clinics' decisions not to ask them. Bias resulting from missing, inaccurate, or inadequate address data are also unknown. Second, this was an exploratory study using data from a relatively limited period and at the very beginning of social risk screening implementation in CHCs. Preliminary estimates from OCHIN CHCs suggest that the number of documented screenings is continuing to increase steadily. Future research should seek to replicate these results using a larger sample of patients and over an extended time frame. Third, we were unable to ascertain whether the SDI performed better in different geographic regions and/or in urban vs rural areas. Future research should explore whether the risk of misclassification varies between these geographic classifications and/or whether the SDI has more utility in some circumstances than in others. Finally, there are numerous community-level measures, beyond the SDI, that are available for public use. Although our findings suggest that relying on SDI in place of patient-level social risk screening may not be sufficient, future research should explore the utility of other measures of community-level social context for identifying high-risk individuals.

Conclusions
Our findings suggest that patient-level and community-level approaches to incorporating social risk data in clinical contexts are not equivalent. Although there is overlap in patients identified using each method, 48.0% of patients are misclassified, and many patients in less disadvantaged neighborhoods desired assistance. Using community-level social risk data to guide patient-level activities may mean that some patients who could benefit from programs targeting social conditions or care adjustments would not be identified.

JAMA Network Open | Health Informatics
Comparison of Community-Level and Patient-Level Data for Identifying Social Risk