Published August 13, 2024 | Version v1
Journal article Open

Race adjustments in clinical algorithms can help correct for racial disparities in data quality

  • 1. University of Chicago
  • 2. University of California, Berkeley
  • 3. Cornell University

Description

Despite ethical and historical arguments for removing race from clinical algorithms, the consequences of removal remain unclear. Here, we highlight a largely undiscussed consideration in this debate: varying data quality of input features across race groups. For example, family history of cancer is an essential predictor in cancer risk prediction algorithms but is less reliably documented for Black participants and may therefore be less predictive of cancer outcomes. Using data from the Southern Community Cohort Study, we assessed whether race adjustments could allow risk prediction models to capture varying data quality by race, focusing on colorectal cancer risk prediction. We analyzed 77,836 adults with no history of colorectal cancer at baseline. The predictive value of self-reported family history was greater for White participants than for Black participants. We compared two cancer risk prediction algorithms—a race-blind algorithm which included standard colorectal cancer risk factors but not race, and a race-adjusted algorithm which additionally included race. Relative to the race-blind algorithm, the race-adjusted algorithm improved predictive performance, as measured by goodness of fit in a likelihood ratio test (P-value: <0.001) and area under the receiving operating characteristic curve among Black participants (P-value: 0.006). Because the race-blind algorithm underpredicted risk for Black participants, the race-adjusted algorithm increased the fraction of Black participants among the predicted high-risk group, potentially increasing access to screening. More broadly, this study shows that race adjustments may be beneficial when the data quality of key predictors in clinical algorithms differs by race group.

Data availability

The data cannot be shared as per the Data Use Agreement, though access to the dataset we used may be obtained through request to the SCCS. Please refer to the following web address for more information on accessing the SCCS data: https://www.southerncommunitystudy.org/research-opportunities.html (32).

Files

zink-et-al-2024-race-adjustments-in-clinical-algorithms-can-help-correct-for-racial-disparities-in-data-quality.pdf

Files (349.3 kB)

Name Size Download all
Supporting information
md5:cd7ab1b55d1d8478c86a6afdd927135b
150.4 kB Preview Download
Article
md5:909f9afebc3088c5a2b55ba146c97d31
198.9 kB Preview Download

Additional details

Identifiers

DOI
10.1073/pnas.2402267121
Other
oai:uchicago.tind.io:13159

Funding

Google
Research Scholar award
National Science Foundation
Career Award
CIFAR
Azrieli Global scholarship
LinkedIn
Research Award
Unknown funder
Abby Joseph Cohen Faculty Fund
AI2050
Early Career Fellowship
University of Chicago
National Cancer Institute
U01CA202979
Vanderbilt-Ingram Cancer Center
P30 CA68485

UChicago Information

Division(s)
Booth School of Business
Center(s) or Institute(s)
Center for Applied Artificial Intelligence