Human-like Biases Research Articles

Language in nonmedical data sets is known to transmit human-like biases when used in natural language processing (NLP) algorithms that can reinforce disparities. It is unclear if NLP algorithms of medical notes could lead to similar transmissions of biases. Can we identify implicit bias in clinical notes, and are biases stable across time and geography? To determine whether different racial and ethnic descriptors are similar contextually to stigmatizing language in ICU notes and whether these relationships are stable across time and geography, we identified notes on critically ill adults admitted to the University of California, San Francisco (UCSF), from 2012 through 2022 and to Beth Israel Deaconess Hospital (BIDMC) from 2001 through 2012. Because word meaning is derived largely from context, we trained unsupervised word-embedding algorithms to measure the similarity (cosine similarity) quantitatively of the context between a racial or ethnic descriptor (eg, African-American) and a stigmatizing target word (eg, nonco-operative) or group of words (violence, passivity, noncompliance, nonadherence). In UCSF notes, Black descriptors were less likely to be similar contextually to violent words compared with White descriptors. Contrastingly, in BIDMC notes, Black descriptors were more likely to be similar contextually to violent words compared with White descriptors. The UCSF data setalso showed that Black descriptors were more similar contextually to passivity and noncompliance words compared with Latinx descriptors. Implicit bias is identifiable in ICU notes. Racial and ethnic group descriptors carry different contextual relationships to stigmatizing words, depending on when and where notes were written. Because NLP models seem able to transmit implicit bias from training data, use of NLP algorithms in clinical prediction could reinforce disparities. Active debiasing strategies may be necessary to achieve algorithmic fairness when using language models in clinical research.

Read full abstract

Artificial writing is permeating our lives due to recent advances in large-scale, transformer-based language models (LMs) such as BERT, GPT-2 and GPT-3. Using them as pre-trained models and fine-tuning them for specific tasks, researchers have extended the state of the art for many natural language processing tasks and shown that they capture not only linguistic knowledge but also retain general knowledge implicitly present in the data. Unfortunately, LMs trained on unfiltered text corpora suffer from degenerated and biased behaviour. While this is well established, we show here that recent LMs also contain human-like biases of what is right and wrong to do, reflecting existing ethical and moral norms of society. We show that these norms can be captured geometrically by a ‘moral direction’ which can be computed, for example, by a PCA, in the embedding space. The computed ‘moral direction’ can rate the normativity (or non-normativity) of arbitrary phrases without explicitly training the LM for this task, reflecting social norms well. We demonstrate that computing the ’moral direction’ can provide a path for attenuating or even preventing toxic degeneration in LMs, showcasing this capability on the RealToxicityPrompts testbed. Large language models identify patterns in the relations between words and capture their relations in an embedding space. Schramowski and colleagues show that a direction in this space can be identified that separates ‘right’ and ‘wrong’ actions as judged by human survey participants.

Read full abstract

Human-like Biases Research Articles

Articles published on Human-like Biases

Wisdom of the silicon crowd: LLM ensemble prediction capabilities rival human crowd accuracy.

(Ir)rationality and cognitive biases in large language models.

Do Large Language Models Show Human-like Biases? Exploring Confidence—Competence Gap in AI

Measuring Implicit Bias in ICU Notes Using Word-Embedding Neural Network Models

On Human-like Biases in Convolutional Neural Networks for the Perception of Slant from Texture

Large pre-trained language models contain human-like biases of what is right and wrong to do

Semantics derived automatically from language corpora contain human-like biases.

ON LOSS AVERSION IN CAPUCHIN MONKEYS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human-like Biases Research Articles

Articles published on Human-like Biases

Wisdom of the silicon crowd: LLM ensemble prediction capabilities rival human crowd accuracy.

(Ir)rationality and cognitive biases in large language models.

Do Large Language Models Show Human-like Biases? Exploring Confidence—Competence Gap in AI

Measuring Implicit Bias in ICU Notes Using Word-Embedding Neural Network Models

On Human-like Biases in Convolutional Neural Networks for the Perception of Slant from Texture

Large pre-trained language models contain human-like biases of what is right and wrong to do

Semantics derived automatically from language corpora contain human-like biases.

ON LOSS AVERSION IN CAPUCHIN MONKEYS