Abstract

Thorax disease classification is a challenging task due to complex pathologies and subtle texture changes, etc. It has been extensively studied for years largely because of its wide application in computer-aided diagnosis. Most existing methods directly learn global feature representations from whole Chest X-ray (CXR) images, without considering in depth the richer visual cues lying around informative local regions. Thus, these methods often produce sub-optimal thorax disease classification performance because they ignore the very informative pathological changes around organs. In this paper, we propose a novel Part-Aware Mask-Guided Attention Network (PMGAN) that learns complementary global and local feature representations from all-organ region and multiple single-organ regions simultaneously for thorax disease classification. Specifically, multiple innovative soft attention modules are designed to progressively guide feature learning toward the global informative regions of whole CXR image. A mask-guided attention module is designed to further search for informative regions and visual cues within the all-organ or single-organ images, where attention is elegantly regularized by automatically generated organ masks and without introducing computation during the inference stage. In addition, a multi-task learning strategy is designed, which effectively maximizes the learning of complementary local and global representations. The proposed PMGAN has been evaluated on the ChestX-ray14 dataset and the experimental results demonstrate its superior thorax disease classification performance against the state-of-the-art methods.

Highlights

  • Chest X-rays (CXR) have been one of the most common radiological examinations aiding in thorax disease diagnosis [1,2]

  • Several works adopt prevalent Convolutional Neural Network (CNN) models, i.e., ResNet [10] and DenseNet [11], to classify multiple thoracic pathologies according to information that is captured from global CXR image

  • Since the mask-guided attention network has the same architecture as soft attention network, we introduce independent segmentation constraint into each Mask-Guided Attention (MA) to guide the attention learning toward corresponding organ region

Read more

Summary

Introduction

Chest X-rays (CXR) have been one of the most common radiological examinations aiding in thorax disease diagnosis [1,2]. CXR images contain complex pathologies and subtle texture changes of different thorax diseases, which bring great challenges to disease diagnosis even for professional radiologists, may lead to wrong diagnosis Aiming to address these challenges, it is important to develop the CXR image classification systems to support the daily clinical routines. Several works adopt prevalent Convolutional Neural Network (CNN) models, i.e., ResNet [10] and DenseNet [11], to classify multiple thoracic pathologies according to information that is captured from global CXR image. These methods have achieved some promising results, exploiting informative regions to learn discriminative local features from CXR images remains an open research challenge. The local region generation technique usually suffers from misalignment problem and introduces extra computation

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call