AI generates covertly racist decisions about people based on their dialect
- 1. Allen Institute for AI
- 2. Stanford University
- 3. University of Chicago
Description
Data availability
All the datasets used in this study are publicly available. The dataset released as ref. 87 can be found at https://aclanthology.org/2020.emnlp-main.473/. The dataset released as ref. 83 can be found at http://slanglab.cs.umass.edu/TwitterAAE/. The human stereotype scores used for evaluation can be found in the published articles of the Princeton Trilogy studies. The most recent of these articles also contains the human favourability scores for the trait adjectives. The dataset of occupational prestige that we used for the employability analysis can be found in the corresponding paper. The Brown Corpus, which we used for the Supplementary Information ('Feature analysis'), can be found at http://www.nltk.org/nltk_data/. The dataset containing the parallel AAE, Appalachian English and Indian English texts, which we used in the Supplementary Information ('Alternative explanations'), can be found at https://huggingface.co/collections/SALT-NLP/value-nlp-666b60a7f76c14551bda4f52.
Our code is written in Python and draws on the Python packages openai and transformers for language-model probing, as well as numpy, pandas, scipy and statsmodels for data analysis. The feature analysis described in the Supplementary Information also uses the VALUE Python library. Our code is publicly available on GitHub at https://github.com/valentinhofmann/dialect-prejudice.
Files
AI-generates-covertly-racist-decisions-about-people-based-on-their-dialect.pdf
Files
(6.8 MB)
| Name | Size | Download all |
|---|---|---|
|
Article md5:ee2b1c5822e29a37b25da1a822a934d0 |
1.8 MB | Preview Download |
|
Supplementary information files md5:f99ebf93fe1a972629104710dd22e71f |
5.0 MB | Preview Download |
Additional details
Identifiers
- DOI
- 10.1038/s41586-024-07856-5
- Other
- oai:uchicago.tind.io:13311
Funding
- German Academic Scholarship Foundation
- Open Phil AI Fellowship
- Hoffman-Yee Research Grants programme
- Stanford Institute for Human-Centered Artificial Intelligence