Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Thomas Schaffter,Jae Ho Sohn,Eduardo Castro,Joseph H Rothstein,David D Cox,Can Son Khoo,Jaime S Cardoso,Antonio Jimeno-Yepes ,Hari Trivedi,Christoph M Friedrich,Gerard Cardoso Negrie,Daniel L Rubin,Darvin Yi,J C F Pereira ,Jiashi Feng,Gustavo Stolovitzky,Mehmet Eren Ahsen,Bibo Shi,Imane Nedjar,Diana S M Buist ,Ethan Goan,Lester Mackey,Russell Mcbride ,Alberto Albiol,Joseph Y Lo,Laurie R Margolies ,L Caballero ,Zequn Jie,Andrew D Trister ,Umar Asif,Christoph I Lee,Justin Guinney,Hyoeun Kim ,Stephen Morrell,Joyce Cahoon,William Lotter,A Gregory Sorensen,Rami Ben-Ari,Thea Norman,Karl Trygve Kalleberg,Felix Nensa,Berkman Sahiner,Dezső Ribli ,Sijia Wang,Michael Kawczynski,Zbigniew Wojna,Pavitra Krishnaswamy,F Albiol ,Mengling Feng,Shivanthan A C Yohanandan ,Yiqiu Shen,Ljubomir Buturović ,Li Shen,Obioma Pelka,Sven Koitka,Dimitri Perrin,Simona Rabinovici-Cohen,Thomas Yu,Yuanfang Guan,Fredrik Strand,Weiva Sieh,Bruce Hoff,Elias Chaibub Neto,Krzysztof J Geras,Kyunghyun Cho,Yaroslav Nikulin,Stefan Harrer,Peter Lindholm,Clinton Fookes,Hao Du,Stephen H Friend ,Gaurav Pandey,Hwejin Jung

doi:10.1001/jamanetworkopen.2020.0265

Abstract

Mammography screening currently relies on subjective human interpretation. Artificial intelligence (AI) advances could be used to increase mammography screening accuracy by reducing missed cancers and false positives. To evaluate whether AI can overcome human mammography interpretation limitations with a rigorous, unbiased evaluation of machine learning algorithms. In this diagnostic accuracy study conducted between September 2016 and November 2017, an international, crowdsourced challenge was hosted to foster AI algorithm development focused on interpreting screening mammography. More than 1100 participants comprising 126 teams from 44 countries participated. Analysis began November 18, 2016. Algorithms used images alone (challenge 1) or combined images, previous examinations (if available), and clinical and demographic risk factor data (challenge 2) and output a score that translated to cancer yes/no within 12 months. Algorithm accuracy for breast cancer detection was evaluated using area under the curve and algorithm specificity compared with radiologists' specificity with radiologists' sensitivity set at 85.9% (United States) and 83.9% (Sweden). An ensemble method aggregating top-performing AI algorithms and radiologists' recall assessment was developed and evaluated. Overall, 144 231 screening mammograms from 85 580 US women (952 cancer positive ≤12 months from screening) were used for algorithm training and validation. A second independent validation cohort included 166 578 examinations from 68 008 Swedish women (780 cancer positive). The top-performing algorithm achieved an area under the curve of 0.858 (United States) and 0.903 (Sweden) and 66.2% (United States) and 81.2% (Sweden) specificity at the radiologists' sensitivity, lower than community-practice radiologists' specificity of 90.5% (United States) and 98.5% (Sweden). Combining top-performing algorithms and US radiologist assessments resulted in a higher area under the curve of 0.942 and achieved a significantly improved specificity (92.0%) at the same sensitivity. While no single AI algorithm outperformed radiologists, an ensemble of AI algorithms combined with radiologist assessment in a single-reader screening environment improved overall accuracy. This study underscores the potential of using machine learning methods for enhancing mammography screening interpretation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JAMA Network Open	Publication Date: Mar 2, 2020
Citations: 271	License type: cc-by

R Discovery Prime

R Discovery Prime

Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

Artificial intelligence: Friend or foe?
Anusch Yazdani ... Sam Costa
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63
Anusch Yazdani, et. al.Anusch Yazdani ... Sam Costa
01 Apr 2023
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63

Comparative Performance of Artificial Intelligence Algorithms for Screening Mammography.
Michio Taya
Radiology. Imaging cancer | VOL. 2
Michio TayaMichio Taya
01 Nov 2020
Radiology. Imaging cancer | VOL. 2

The brave new world of artificial intelligence: dawn of a new era
Giovanni Di Napoli ... Linda S Lee
iGIE | VOL. 2
Giovanni Di Napoli, et. al.Giovanni Di Napoli ... Linda S Lee
28 Feb 2023
iGIE | VOL. 2

Artificial Intelligence for Computer Vision in Surgery: A Call for Developing Reporting Guidelines.
Daichi Kitaguchi ... Nobuyoshi Takeshita
Annals of Surgery | VOL. 275
Daichi Kitaguchi, et. al.Daichi Kitaguchi ... Nobuyoshi Takeshita
23 Nov 2021
Annals of Surgery | VOL. 275

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open