Abstract

Data on race and ethnicity are crucial for identifying and addressing health disparities. Incomplete or inaccurate data constrain the ability of researchers, administrators, and policymakers to provide targeted assistance to groups who otherwise could be overlooked. Unfortunately, a large discrepancy exists between the many racial and ethnic categories individuals use to self-identify and available categories in typical databases. In the health care setting, categories for race and ethnicity generally follow the Office of Management and Budget (OMB) minimum standards for reporting federal data, which have not been updated since 1997.1Office of Management and BudgetRevisions to the standards for the classification of federal data on race and ethnicity.https://obamawhitehouse.archives.gov/omb/fedreg_1997standardsDate: 1997Date accessed: October 20, 2022Google Scholar The OMB minimum standards for classification of race and ethnicity (Table 1) use 5 categories for race (American Indian or Alaska Native, Asian, Black or African American, Native Hawaiian or Other Pacific Islander, and White) and 2 categories for ethnicity (Hispanic or Latino and Not Hispanic or Latino). These broad categories combine subgroups of individuals with distinct ancestries and cultures as well as socioeconomic realities. Many individuals—especially those who identify with minoritized subgroups—do not have the option to choose the specific ethnic group they prefer. Similarly, those who are of Middle Eastern or North African descent have not been provided options but historically have chosen White as their race. In this context, individuals become anonymous members of a larger group, and potential health disparities within these broader racial and ethnic categories are concealed in the aggregate data. In this article, our objectives are to show the importance of disaggregating racial and ethnic data in gastroenterology and hepatology, as well as to suggest solutions for addressing this problem at the individual, institutional, and societal levels (Table 2).Table 1OMB Racial and Ethnic Categories and Alternative Disaggregated CategoriesOMBHHSNew York StateaThese categories are required by NY S.6639-A/A.6896-A,17 which specifically addressed the Asian, Native Hawaiian, and Pacific Islander populations.American Indian or Alaska NativeAsian7 subgroups: Asian Indian, Chinese, Filipino, Japanese, Korean, Vietnamese, and Other Asian20 subgroups: 7 HHS subgroups, Laotian, Cambodian, Bangladeshi, Hmong, Indonesian, Malaysian, Pakistani, Sri Lankan, Taiwanese, Nepalese, Burmese, Tibetan, and ThaiBlack or African AmericanHispanic or Latino4 subgroups: Mexican/Mexican American/Chicano/a, Puerto Rican, Cuban, and Another Hispanic/Latino/Spanish originNative Hawaiian or Other Pacific Islander4 subgroups: Native Hawaiian, Guamanian or Chamorro, Samoan, and Other Pacific Islander6 subgroups: 4 HHS subgroups, Fijian, and TonganWhiteHHS, Department of Health and Human Services; OMB, Office of Management and Budget.a These categories are required by NY S.6639-A/A.6896-A,17The New York State SenateSenate Bill S6639A.https://www.nysenate.gov/legislation/bills/2021/S6639Date: 2021Date accessed: November 2, 2022Google Scholar which specifically addressed the Asian, Native Hawaiian, and Pacific Islander populations. Open table in a new tab Table 2Potential Solutions for Disaggregating Racial and Ethnic DataLevel of actionPotential solutionsImpactIndividualClinician: use disaggregated categories for intake forms or in clinical notesResearcher: collect disaggregated primary data, use existing disaggregated data sets, acknowledge limitation of aggregated data, and specify component groups+aThese are qualitative markers of impact, with + representing the least impact and +++ representing the most impact.InstitutionalDisaggregate categories in electronic health record (using a comprehensive list or based on local demographics), use evidence-based algorithms to reclassify existing data++aThese are qualitative markers of impact, with + representing the least impact and +++ representing the most impact.SocietalAdvocate for policy change on a local, state, or federal level by working with community-based or national organizations or by participating in Office of Management and Budget listening session with the public+++aThese are qualitative markers of impact, with + representing the least impact and +++ representing the most impact.a These are qualitative markers of impact, with + representing the least impact and +++ representing the most impact. Open table in a new tab HHS, Department of Health and Human Services; OMB, Office of Management and Budget. In the following section, we present 2 of the many examples in gastroenterology and hepatology that illustrate the importance of disaggregating race and ethnicity. Asian and Hispanic/Latino individuals, on aggregate, have among the lowest colorectal cancer screening uptake in the United States. In the 2015 National Health Interview Survey (NHIS), 62.4% of the overall population were up-to-date with screening, compared with 52.1% of Asian and 47.4% of Hispanic/Latino persons.2White A. Thompson T.D. White M.C. et al.Cancer screening test use—United States, 2015.MMWR Morb Mortal Wkly Rep. 2017; 66: 201-206Crossref PubMed Scopus (328) Google Scholar Because there are 49 Asian nations and 19 nations and territories in the Americas and Caribbean with predominantly Hispanic/Latino populations, it is unsurprising that screening uptake in Asian and Hispanic/Latino subgroups varies substantially. These nations and territories have distinct geopolitical histories and patterns of emigration, which have resulted in vastly different socioeconomic circumstances for immigrants who arrive in the United States. Disaggregated data collected by the NHIS indicate that screening uptake in the Hispanic/Latino population ranged from 63.2% for Puerto Rican to 36.0% for Mexican individuals.2White A. Thompson T.D. White M.C. et al.Cancer screening test use—United States, 2015.MMWR Morb Mortal Wkly Rep. 2017; 66: 201-206Crossref PubMed Scopus (328) Google Scholar Similarly, data from the 2014 New York City Community Health Survey showed that although 61.7% of Asian and Pacific Islander persons were up-to-date with colonoscopy, this ranged from 70.4 % for Chinese to 45.1% for Asian Indian individuals.3Rastogi N. Xia Y. Inadomi J.M. et al.Disparities in colorectal cancer screening in New York City: an analysis of the 2014 NYC Community Health Survey.Cancer Med. 2019; 8: 2572-2579Crossref PubMed Scopus (16) Google Scholar These granular details highlight disparities in communities that would benefit from targeted screening campaigns, and that otherwise would have been hidden in the aggregate data. Hepatitis B virus (HBV) is a major risk factor for cirrhosis and hepatocellular carcinoma, and the risk of HBV is higher for many individuals who are born outside of the United States. For this reason, the US Preventive Services Task Force recommends screening persons for HBV if they are either born in a country where the prevalence of HBV exceeds 2%, or are US-born persons who are unvaccinated and whose parents were born in countries where HBV prevalence exceeds 8%.4Krist A.H. Davidson K.W. et al.US Preventive Services Task ForceScreening for hepatitis B virus infection in adolescents and adults: US Preventive Services Task Force recommendation statement.JAMA. 2020; 324: 2415Crossref PubMed Scopus (25) Google Scholar The prevalence of HBV surpasses 8% in some Asian countries (Kyrgyzstan, Laos, Mongolia, Vietnam, and Yemen), and falls short of 2% in others (eg, India, Malaysia, and Japan). Similarly, the prevalence exceeds 2% in a handful of Latin American countries (Belize, Colombia, Ecuador, Peru, and Suriname). Without detailed disaggregated data on race, ethnicity, and country of origin for recent immigrants, it would be difficult to follow these recommendations in clinical practice and impossible to measure adherence or impact across a health care system. In this case, the likely consequence of relying on broad racial and ethnic categories would be both overscreening low-risk individuals and underscreening high-risk individuals. In addition to identifying hidden health disparities, there are other reasons to collect disaggregated racial and ethnic data. First, nuanced demographic information provides clarity in patient-centered care. Understanding the prevalence of disease in a specific group or community is intrinsically valuable, regardless of how it compares with other groups. Second, giving individuals a chance to select a racial or ethnic category that they self-identify with promotes a culture of inclusivity and counters the sense of invisibility that many communities feel. It may serve as a momentary but meaningful acknowledgment of a small community in the larger fabric of society. Third, disaggregated data help to challenge stereotypes, such as the heterogenous Asian American population being uniformly considered the model minority, having minimal health disparities, or universally benefiting from the healthy immigrant effect.5Yi S.S. Kwon S.C. Suss R. et al.The mutually reinforcing cycle of poor data quality and racialized stereotypes that shapes Asian American Health: study examines poor data quality and racialized stereotypes that shape Asian American health.Health Aff (Millwood). 2022; 41: 296-303Crossref PubMed Scopus (9) Google Scholar These stereotypes perpetuate a false sense of well-being that deprioritizes research and funding for these communities. Clinicians should recognize that screening and management recommendations for certain conditions may differ depending on a patient’s race, ethnicity, and country of origin. More nuanced data also allow clinicians to be aware of potential cultural sensitivities of their patients. Therefore, creating a workflow that incorporates the collection of disaggregated racial and ethnic data is a clinically valuable endeavor. If a provider uses a patient intake form or questionnaire and has control over its content, then adding additional racial and ethnic categories based on local demographics would be an efficient way to capture these data. Alternatively, updating clinical note templates to add a question about a patient’s self-identified race and ethnicity in the Social History section will accomplish a similar goal. Researchers should collect and report disaggregated racial and ethnic primary data whenever possible. When using secondary data, researchers should consider using data sources such as the NHIS or American Community Survey, which provide subcategories of the American Indian or Alaska Native, Asian, Native Hawaiian or Other Pacific Islander, and Hispanic/Latino classifications. In instances in which aggregated data must be used—whether because of the method of collection or because of small sample sizes for reporting—use of aggregated data should be acknowledged as a study limitation. Groups within the aggregated data category also should be specified when possible. We strongly encourage systems-level change in the electronic health record. Collecting disaggregated data is consistent with patient-centered care and upholds the principles of diversity, equity, and inclusion. Change at the level of an institution or health care system requires engagement of and support from administrators, and the impact will far exceed individual efforts. The prospective and retrospective data disaggregation initiatives at NYU Langone Health serve as illustrative examples. A collaboration between health equity researchers and the institutional leadership led to the introduction of multiple Asian and Hispanic/Latino subgroup options in the electronic health record for new patients. Simply providing the new categories, however, did not lead to a rapid shift in how patients were classified. From this experience, we learned that increasing awareness about the update among clinical staff and patients was a crucial step in the implementation process, and training and support are necessary for a successful rollout. Additional engagement with data managers responsible for reporting to federal agencies, clinicians, and community members have been critical toward building the new data capture system. In addition to prospective efforts, we also are exploring several methods to disaggregate existing data in the electronic health record. We have used a name list algorithm successfully to identify a large number of Arab Americans who often previously were misclassified as White, Other, or Unknown. The algorithm also doubled the size of Asian Americans in the data set and permitted specific identification of ethnic groups (eg, Chinese, Asian Indian). We also are using a statistical method called Bayesian Improved Surname Geocoding (BISG) to predict the probability of an individual belonging to broad racial and ethnic groups based on surname and residential address.6Elliott M.N. Morrison P.A. Fremont A. et al.Using the Census Bureau’s surname list to improve estimates of race/ethnicity and associated disparities.Health Serv Outcomes Res Methodol. 2009; 9: 69-83Crossref Scopus (124) Google Scholar BISG is the best algorithm available for classifying individuals with missing race and ethnicity, and it has been shown to have high predictive accuracy for Asian, Black, Hispanic/Latino, and White individuals.7Yee K. Hoopes M. Giebultowicz S. et al.Implications of missingness in self-reported data for estimating racial and ethnic disparities in Medicaid quality measures.Health Serv Res. 2022; 57: 1370-1378Crossref PubMed Scopus (2) Google Scholar,8Sartin E.B. Metzger K.B. Pfeiffer M.R. et al.Facilitating research on racial and ethnic disparities and inequities in transportation: application and evaluation of the Bayesian Improved Surname Geocoding (BISG) algorithm.Traffic Inj Prev. 2021; 22: S32-S37Crossref PubMed Scopus (4) Google Scholar Patient self-reported data remain the gold standard. However, evidence-based methods such as name lists and BISG can help direct quality improvement activities including translation services and educational support toward specific communities and geographic areas. At the institutional level, it is important to recognize that racial and ethnic data categories vary by region. For instance, individuals of Dominican descent account for 23% of the Hispanic/Latino community in the New York metropolitan area but make up less than 1% of the Hispanic/Latino population in the Los Angeles and Houston metropolitan areas.9Noe-Bustamante L. Key facts about U.S. Hispanics and their diverse heritage.https://www.pewresearch.org/fact-tank/2019/09/16/key-facts-about-u-s-hispanicsDate: 2019Date accessed: November 1, 2022Google Scholar Institutions should either adopt a comprehensive list of racial and ethnic subgroups10Agency for Healthcare Research and QualityRace, ethnicity, and language data: standardization for health care quality improvement.https://www.ahrq.gov/research/findings/final-reports/iomracereport/reldataaptabe1.htmlDate: 2018Date accessed: November 2, 2022Google Scholar or build one using local population demographics, preferably with feedback from the community. Regardless of how many categories are available for selection, it also is important that they can rollup or combine into the 5 race and 2 ethnicity groups in the OMB minimum standard to maintain comparability with other data sources. With support from the Robert Wood Johnson Foundation, the research institute PolicyLink produced a comprehensive report on methods for collecting and analyzing data and government policies that enable data disaggregation.11Rubin V. Ngo D. Ross A. et al.Counting a diverse nation: disaggregating data on race and ethnicity to advance a culture of health.https://www.policylink.org/sites/default/files/Counting_a_Diverse_Nation_08_15_18.pdfDate accessed: October 18, 2022Google Scholar At the federal level, modifying the OMB minimum standards remains the shortest path to widespread change. Although the US Census Bureau’s internal research in 2015 concluded that “it is optimal to use a dedicated” Middle Eastern or North African category,12US Census Bureau2015 National content test race and ethnicity analysis report.https://www2.census.gov/programs-surveys/decennial/2020/program-management/final-analysis-reports/2015nct-race-ethnicity-analysis.pdfDate: 2017Date accessed: November 2, 2022Google Scholar and a 2017 proposal from the Federal Interagency Working Group for Research on Race and Ethnicity concurred, the OMB ultimately rejected these recommendations and the 2020 Census did not incorporate the proposed change. However, the Biden administration OMB recently signaled a willingness to reassess the minimum standards by reconvening an Interagency Working Group and beginning a series of listening sessions with the public.13Orvis K. OMB launches new public listening sessions on federal race and ethnicity standards revision.https://www.whitehouse.gov/omb/briefing-room/2022/08/30/omb-launches-new-public-listening-sessions-on-federal-race-and-ethnicity-standards-revisionDate: 2022Date accessed: November 2, 2022Google Scholar It is worth noting that the Department of Health and Human Services, which oversees the NHIS, has adopted an expanded data standard for race and ethnicity (Table 1).14US Department of Health and Human ServicesExplanation of data standards for race, ethnicity, sex, primary language, and disability.https://minorityhealth.hhs.gov/omh/browse.aspx?lvl=3&lvlid=54#:∼:text=The%20OMB%20minimum%20categories%20for,and%20Not%20Hispanic%20or%20LatinoDate: 2021Date accessed: October 20, 2022Google Scholar The Centers for Medicare & Medicaid Services, which operates within the Department of Health and Human Services, also has announced that new enrollment forms for Medicare Advantage and Medicare prescription drug (Part D) plans are required to include disaggregated racial and ethnic categories in January 2023.15Centers for Medicare & Medicaid ServicesModel individual enrollment request form to enroll in a Medicare Advantage Plan (MA) or a Medicare prescription drug plan (Part D), and advance announcement of January 2023 software release - addition of race and ethnicity data fields on enrollment transactions.https://www.cms.gov/files/document/hpms-announcement-memo-race-and-ethnicity.pdfDate: 2022Date accessed: November 29, 2022Google Scholar The majority of federal agencies, however, continue to follow the OMB minimum standard. Although federal reform has been hindered, states have led efforts for data disaggregation. In 2000, the Massachusetts Department of Public Health developed a new data collection form that offered additional racial and ethnic groups choices, including Cape Verdean, Haitian, and Puerto Rican.11Rubin V. Ngo D. Ross A. et al.Counting a diverse nation: disaggregating data on race and ethnicity to advance a culture of health.https://www.policylink.org/sites/default/files/Counting_a_Diverse_Nation_08_15_18.pdfDate accessed: October 18, 2022Google Scholar In 2016, the Accounting for Health and Education in API Demographics Act became law in California and required the State Department of Public Health to collect more detailed data on Asian American, Native Hawaiian, and Pacific Islander groups, including Bangladeshi, Indonesian, Taiwanese, Fijian, and Tongan.16State of CaliforniaAB-1726.http://leginfo.legislature.ca.gov/faces/billNavClient.xhtml?bill_ id=201520160AB1726Date: 2016Date accessed: November 2, 2022Google Scholar In 2021, New York passed a law that required every state agency that collected racial and ethnic data to include options for at least 20 Asian and 6 Native Hawaiian and Pacific Islander groups (Table 1).17The New York State SenateSenate Bill S6639A.https://www.nysenate.gov/legislation/bills/2021/S6639Date: 2021Date accessed: November 2, 2022Google Scholar Critics of disaggregation argue the following: (1) these distinctions are a manifestation of identity politics that fragment US society, (2) disaggregation weakens the political strength of a larger umbrella group, and (3) smaller populations may be more susceptible to privacy concerns.11Rubin V. Ngo D. Ross A. et al.Counting a diverse nation: disaggregating data on race and ethnicity to advance a culture of health.https://www.policylink.org/sites/default/files/Counting_a_Diverse_Nation_08_15_18.pdfDate accessed: October 18, 2022Google Scholar,18Kader F. Doan L.N. Lee M. et al.Disaggregating race/ethnicity data categories: criticisms, dangers, and opposing viewpoints.http://www.healthaffairs.org/do/10.1377/forefront.20220323.555023/fullDate: 2022Date accessed: October 12, 2022Google Scholar In addition, smaller groups may make data analysis and interpretation more challenging. However, each legislative success described earlier required years of persistent advocacy from a coalition of community-based organizations while contending with opposing viewpoints. For groups that historically have been subjected to surveillance and monitoring, working with community organizations to explain the rationale for disaggregation and underscore a commitment to privacy is crucial. A number of strategies for suppressing data that do not meet statistical reliability, data quality, or confidentiality criteria have been developed.19Klein R.J. Proctor S.E. Boudreault M.A. et al.Healthy people 2010 criteria for data suppression.Healthy People 2010 Stat Notes. 2002; 24: 1-12Google Scholar Smaller groups that have an insufficient sample size also can be combined in an aggregate category for analysis. Collecting and reporting disaggregated racial and ethnic data is an important, practical step toward building a more diverse, equitable, and inclusive society and to improve health care quality. Going beyond the broad demographic categories of the OMB minimum standard allows us to unmask and address health disparities in our patients and bring more resources and attention to smaller communities. From modifying how we record patient or participant data to advocating for policy changes at the state and national levels, there are numerous ways to contribute to and advance health equity across our health systems.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call