Chest X-ray Dataset Research Articles

Deep learning shows great promise for medical image analysis but often lacks explainability, hindering its adoption in healthcare. Attribution techniques that explain model reasoning can potentially increase trust in deep learning among clinical stakeholders. In the literature, much of the research on attribution in medical imaging focuses on visual inspection rather than statistical quantitative analysis.In this paper, we proposed an image-based saliency framework to enhance the explainability of deep learning models in medical image analysis. We use adaptive path-based gradient integration, gradient-free techniques, and class activation mapping along with its derivatives to attribute predictions from brain tumor MRI and COVID-19 chest X-ray datasets made by recent deep convolutional neural network models.The proposed framework integrates qualitative and statistical quantitative assessments, employing Accuracy Information Curves (AICs) and Softmax Information Curves (SICs) to measure the effectiveness of saliency methods in retaining critical image information and their correlation with model predictions. Visual inspections indicate that methods such as ScoreCAM, XRAI, GradCAM, and GradCAM++ consistently produce focused and clinically interpretable attribution maps. These methods highlighted possible biomarkers, exposed model biases, and offered insights into the links between input features and predictions, demonstrating their ability to elucidate model reasoning on these datasets. Empirical evaluations reveal that ScoreCAM and XRAI are particularly effective in retaining relevant image regions, as reflected in their higher AUC values. However, SICs highlight variability, with instances of random saliency masks outperforming established methods, emphasizing the need for combining visual and empirical metrics for a comprehensive evaluation.The results underscore the importance of selecting appropriate saliency methods for specific medical imaging tasks and suggest that combining qualitative and quantitative approaches can enhance the transparency, trustworthiness, and clinical adoption of deep learning models in healthcare. This study advances model explainability to increase trust in deep learning among healthcare stakeholders by revealing the rationale behind predictions. Future research should refine empirical metrics for stability and reliability, include more diverse imaging modalities, and focus on improving model explainability to support clinical decision-making.

This work aims to assess standard evaluation practices used by the research community for evaluating medical imaging classifiers, with a specific focus on the implications of class imbalance. The analysis is performed on chest X-rays as a case study and encompasses a comprehensive model performance definition, considering both discriminative capabilities and model calibration. We conduct a concise literature review to examine prevailing scientific practices used when evaluating X-ray classifiers. Then, we perform a systematic experiment on two major chest X-ray datasets to showcase a didactic example of the behavior of several performance metrics under different class ratios and highlight how widely adopted metrics can conceal performance in the minority class. Our literature study confirms that: (1) even when dealing with highly imbalanced datasets, the community tends to use metrics that are dominated by the majority class; and (2) it is still uncommon to include calibration studies for chest X-ray classifiers, albeit its importance in the context of healthcare. Moreover, our systematic experiments confirm that current evaluation practices may not reflect model performance in real clinical scenarios and suggest complementary metrics to better reflect the performance of the system in such scenarios. Our analysis underscores the need for enhanced evaluation practices, particularly in the context of class-imbalanced chest X-ray classifiers. We recommend the inclusion of complementary metrics such as the area under the precision-recall curve (AUC-PR), adjusted AUC-PR, and balanced Brier score, to offer a more accurate depiction of system performance in real clinical scenarios, considering metrics that reflect both, discrimination and calibration performance. This study underscores the critical need for refined evaluation metrics in medical imaging classifiers, emphasizing that prevalent metrics may mask poor performance in minority classes, potentially impacting clinical diagnoses and healthcare outcomes. Common scientific practices in papers dealing with X-ray computer-assisted diagnosis (CAD) systems may be misleading. We highlight limitations in reporting of evaluation metrics for X-ray CAD systems in highly imbalanced scenarios. We propose adopting alternative metrics based on experimental evaluation on large-scale datasets.

Chest X-ray Dataset Research Articles

Related Topics

Articles published on Chest X-ray Dataset

A Novel Approach for Stratifying Pulmonary Edema Severity on Chest X-ray via Dual-Mechanic Self-Learning and Bidirectional Multi-Modal Cross-Attention Algorithms

Acquisition parameters influence AI recognition of race in chest x-rays and mitigating these factors reduces underdiagnosis bias

Automated quantification of SARS-CoV-2 pneumonia with large vision model knowledge adaptation

An open chest X-ray dataset with benchmarks for automatic radiology report generation in French

UniChest: Conquer-and-Divide Pre-Training for Multi-Source Chest X-Ray Classification.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets.

AssistDistil for Medical Image Segmentation

Deep Learning for Pneumonia Detection in Chest X-ray Images: A Comprehensive Survey.

ALFREDO: Active Learning with FeatuRe disEntangelement and DOmain adaptation for medical image classification

Advanced Lung Disease Detection and Classification Using Ge-U-Net-ODLwith Gabor Filters and Entropy-Based Feature Selection

Evaluating Diagnostic Thresholds for Pneumonia Severity Using HOG Features and Deep Learning Models

Literature Study: Transfer Learning for COVID-19 Disease Analysis Based on Chest X-ray Dataset

The limits of fair medical imaging AI in real-world generalization

Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis

Leveraging Transfer Learning for Efficient Diagnosis of COPD Using CXR Images and Explainable AI Techniques

Few-shot learning for COVID-19 chest X-ray classification with imbalanced data: an inter vs. intra domain study

Class imbalance on medical image classification: towards better evaluation practices for discrimination and calibration performance.

Robust Stochastic Neural Ensemble Learning With Noisy Labels for Thoracic Disease Classification.

Comparative Analysis of VGG16, RESNET50, AND CNN Models for Lung Disease Prediction: A Deep Learning Approach

Towards COVID-19 detection and classification using optimal efficient Densenet model on chest X-ray images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Chest X-ray Dataset Research Articles

Related Topics

Articles published on Chest X-ray Dataset

A Novel Approach for Stratifying Pulmonary Edema Severity on Chest X-ray via Dual-Mechanic Self-Learning and Bidirectional Multi-Modal Cross-Attention Algorithms

Acquisition parameters influence AI recognition of race in chest x-rays and mitigating these factors reduces underdiagnosis bias

Automated quantification of SARS-CoV-2 pneumonia with large vision model knowledge adaptation

An open chest X-ray dataset with benchmarks for automatic radiology report generation in French

UniChest: Conquer-and-Divide Pre-Training for Multi-Source Chest X-Ray Classification.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets.

AssistDistil for Medical Image Segmentation

Deep Learning for Pneumonia Detection in Chest X-ray Images: A Comprehensive Survey.

ALFREDO: Active Learning with FeatuRe disEntangelement and DOmain adaptation for medical image classification

Advanced Lung Disease Detection and Classification Using Ge-U-Net-ODLwith Gabor Filters and Entropy-Based Feature Selection

Evaluating Diagnostic Thresholds for Pneumonia Severity Using HOG Features and Deep Learning Models

Literature Study: Transfer Learning for COVID-19 Disease Analysis Based on Chest X-ray Dataset

The limits of fair medical imaging AI in real-world generalization

Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis

Leveraging Transfer Learning for Efficient Diagnosis of COPD Using CXR Images and Explainable AI Techniques

Few-shot learning for COVID-19 chest X-ray classification with imbalanced data: an inter vs. intra domain study

Class imbalance on medical image classification: towards better evaluation practices for discrimination and calibration performance.

Robust Stochastic Neural Ensemble Learning With Noisy Labels for Thoracic Disease Classification.

Comparative Analysis of VGG16, RESNET50, AND CNN Models for Lung Disease Prediction: A Deep Learning Approach

Towards COVID-19 detection and classification using optimal efficient Densenet model on chest X-ray images