Benchmarking saliency methods for chest X-ray interpretation

Adriel Saporta,Matthew P Lungren,Anuj Pareek,Van-Doan Ngo,Andrew Y Ng,Pranav Rajpurkar,Jayne Seekins,Xiaotong Gui,Chanh D T Nguyen,Steven Q H Truong,Francis G Blankenberg,Ashwin Agrawal

doi:10.1038/s42256-022-00536-x

Abstract

Saliency methods, which produce heat maps that highlight the areas of the medical image that influence model prediction, are often presented to clinicians as an aid in diagnostic decision-making. However, rigorous investigation of the accuracy and reliability of these strategies is necessary before they are integrated into the clinical setting. In this work, we quantitatively evaluate seven saliency methods, including Grad-CAM, across multiple neural network architectures using two evaluation metrics. We establish the first human benchmark for chest X-ray segmentation in a multilabel classification set-up, and examine under what clinical conditions saliency maps might be more prone to failure in localizing important pathologies compared with a human expert benchmark. We find that (1) while Grad-CAM generally localized pathologies better than the other evaluated saliency methods, all seven performed significantly worse compared with the human benchmark, (2) the gap in localization performance between Grad-CAM and the human benchmark was largest for pathologies that were smaller in size and had shapes that were more complex, and (3) model confidence was positively correlated with Grad-CAM localization performance. Our work demonstrates that several important limitations of saliency methods must be addressed before we can rely on them for deep learning explainability in medical imaging.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Machine Intelligence	Publication Date: Oct 1, 2022
Citations: 74	License type: open-access

R Discovery Prime

R Discovery Prime

Benchmarking saliency methods for chest X-ray interpretation

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence

Lead the way for us

Similar Papers

Saliency Cards: A Framework to Characterize and Compare Saliency Methods
Angie Boggust ... Harini Suresh
-
Angie Boggust, et. al.Angie Boggust ... Harini Suresh
12 Jun 2023
12 Jun 2023

Dynamic modelling using a multiple neural network architecture
C Rivas
-
C RivasC Rivas
01 Jan 1993
01 Jan 1993

An integrated multiple neural network architecture for reading alphanumeric characters in complex scenes
Kong-Wah Wan ... Soo-Leng Lau
-
Kong-Wah Wan, et. al. Kong-Wah Wan ... Soo-Leng Lau
27 Jun 1994
27 Jun 1994

Forecasting Seasonal Time Series with Neural Networks: A Sensitivity Analysis of Architecture Parameters
Sven F Crone ... Rohit Dhawan
-
Sven F Crone, et. al.Sven F Crone ... Rohit Dhawan
01 Aug 2007
01 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmarking saliency methods for chest X-ray interpretation

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence