Artificial intelligence tools trained on human-labeled data reflect human biases: a case study in a large clinical consecutive knee osteoarthritis cohort

Anders Lenskjold,Mathias W Brejnebøl,Martin H Rose,Henrik Gudbergsen,Akshay Chaudhari,Anders Troelsen,Anne Moller,Janus U Nybing,Mikael Boesen

doi:10.1038/s41598-024-75752-z

Abstract

Humans have been shown to have biases when reading medical images, raising questions about whether humans are uniform in their disease gradings. Artificial intelligence (AI) tools trained on human-labeled data may have inherent human non-uniformity. In this study, we used a radiographic knee osteoarthritis external validation dataset of 50 patients and a six-year retrospective consecutive clinical cohort of 8,273 patients. An FDA-approved and CE-marked AI tool was tested for potential non-uniformity in Kellgren-Lawrence grades between the right and left sides of the images. We flipped the images horizontally so that a left knee looked like a right knee and vice versa. According to human review, the AI tool showed non-uniformity with 20–22% disagreements on the external validation dataset and 13.6% on the cohort. However, we found no evidence of a significant difference in the accuracy compared to senior radiologists on the external validation dataset, or age bias or sex bias on the cohort. AI non-uniformity can boost the evaluated performance against humans, but image areas with inferior performance should be investigated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Artificial intelligence tools trained on human-labeled data reflect human biases: a case study in a large clinical consecutive knee osteoarthritis cohort

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Nov 5, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Constructing a clinical radiographic knee osteoarthritis database using artificial intelligence tools with limited human labor: A proof of principle
Anders Lenskjold ... Mikael Boesen
Osteoarthritis and Cartilage | VOL. 32
Anders Lenskjold, et. al.Anders Lenskjold ... Mikael Boesen
01 Dec 2023
Osteoarthritis and Cartilage | VOL. 32

ChatGPT Isn't Magic
Tama Leaver ... Suzanne Srdarov
M/C Journal | VOL. 26
Tama Leaver, et. al.Tama Leaver ... Suzanne Srdarov
02 Oct 2023
M/C Journal | VOL. 26

Privacy-Preserving Federated Model Predicting Bipolar Transition in Patients With Depression: Prediction Model Development Study.
Dong Yun Lee ... Myoungsuk Kim
Journal of Medical Internet Research | VOL. 25
Dong Yun Lee, et. al.Dong Yun Lee ... Myoungsuk Kim
20 Jul 2023
Journal of Medical Internet Research | VOL. 25

Artificial intelligence (AI) tools for academic research
Adetoun A Oyelude
Library Hi Tech News | VOL. 41
Adetoun A OyeludeAdetoun A Oyelude
17 Sep 2024
Library Hi Tech News | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial intelligence tools trained on human-labeled data reflect human biases: a case study in a large clinical consecutive knee osteoarthritis cohort

Abstract

Talk to us

Similar Papers

More From: Scientific Reports