Image Representation Ability Research Articles

Glaucoma is one of the leading causes of irreversible blindness. Segmentation of optic disc (OD) and optic cup (OC) on fundus images is a crucial step in glaucoma screening. Although many deep learning models have been constructed for this task, it remains challenging to train an OD/OC segmentation model that could be deployed successfully to different healthcare centers. The difficulties mainly comes from the domain shift issue, i.e., the fundus images collected at these centers usually vary greatly in the tone, contrast, and brightness. To address this issue, in this paper, we propose a novel unsupervised domain adaptation (UDA) method called Reconstruction-driven Dynamic Refinement Network (RDR-Net), where we employ a due-path segmentation backbone for simultaneous edge detection and region prediction and design three modules to alleviate the domain gap. The reconstruction alignment (RA) module uses a variational auto-encoder (VAE) to reconstruct the input image and thus boosts the image representation ability of the network in a self-supervised way. It also uses a style-consistency constraint to force the network to retain more domain-invariant information. The low-level feature refinement (LFR) module employs input-specific dynamic convolutions to suppress the domain-variant information in the obtained low-level features. The prediction-map alignment (PMA) module elaborates the entropy-driven adversarial learning to encourage the network to generate source-like boundaries and regions. We evaluated our RDR-Net against state-of-the-art solutions on four public fundus image datasets. Our results indicate that RDR-Net is superior to competing models in both segmentation performance and generalization ability.

Read full abstract

Lung nodule malignancy prediction is an essential step in the early diagnosis of lung cancer. Besides the difficulties commonly discussed, the challenges of this task also come from the ambiguous labels provided by annotators, since deep learning models have in some cases been found to reproduce or amplify human biases. In this paper, we propose a multi-view 'divide-and-rule' (MV-DAR) model to learn from both reliable and ambiguous annotations for lung nodule malignancy prediction on chest CT scans. According to the consistency and reliability of their annotations, we divide nodules into three sets: a consistent and reliable set (CR-Set), an inconsistent set (IC-Set), and a low reliable set (LR-Set). The nodule in IC-Set is annotated by multiple radiologists inconsistently, and the nodule in LR-Set is annotated by only one radiologist. Although ambiguous, inconsistent labels tell which label(s) is consistently excluded by all annotators, and the unreliable labels of a cohort of nodules are largely correct from the statistical point of view. Hence, both IC-Set and LR-Set can be used to facilitate the training of MV-DAR. Our MV-DAR contains three DAR models to characterize a lung nodule from three orthographic views and is trained following a two-stage procedure. Each DAR consists of three networks with the same architecture, including a prediction network (Prd-Net), a counterfactual network (CF-Net), and a low reliable network (LR-Net), which are trained on CR-Set, IC-Set, and LR-Set respectively in the pretraining phase. In the fine-tuning phase, the image representation ability learned by CF-Net and LR-Net is transferred to Prd-Net by negative-attention module (NA-Module) and consistent-attention module (CA-Module), aiming to boost the prediction ability of Prd-Net. The MV-DAR model has been evaluated on the LIDC-IDRI dataset and LUNGx dataset. Our results indicate not only the effectiveness of the MV-DAR in learning from ambiguous labels but also its superiority over present noisy label-learning models in lung nodule malignancy prediction.

Read full abstract

Image Representation Ability Research Articles

Related Topics

Articles published on Image Representation Ability

CADS: A Self-supervised Learner via Cross-modal Alignment and Deep Self-distillation for CT Volume Segmentation.

Microsnoop: A generalist tool for microscopy image representation

Reconstruction-Driven Dynamic Refinement Based Unsupervised Domain Adaptation for Joint Optic Disc and Cup Segmentation.

Joint optimization for attention-based generation and recognition of chinese characters using tree position embedding

MsIFT: Multi-Source Image Fusion Transformer

Learning From Ambiguous Labels for Lung Nodule Malignancy Prediction.

Image Analysis by Fractional-Order Gaussian-Hermite Moments.

전래동화를 활용한 미술 활동이 유아의 정서지능과 그림표상능력에 미치는 영향

Fusing Multilevel Deep Features for Fabric Defect Detection Based NTV-RPCA

Semi-supervised adversarial model for benign-malignant lung nodule classification on chest CT.

ConvNet and LSH-Based Visual Localization Using Localized Sequence Matching.

A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

Image analysis by Gaussian–Hermite moments

Transform based spatio-temporal descriptors for human action recognition

Robust and Efficient Fourier–Mellin Transform Approximations for Gray-Level Image Reconstruction and Complete Invariant Description

On image analysis by the methods of moments

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Image Representation Ability Research Articles

Related Topics

Articles published on Image Representation Ability

CADS: A Self-supervised Learner via Cross-modal Alignment and Deep Self-distillation for CT Volume Segmentation.

Microsnoop: A generalist tool for microscopy image representation

Reconstruction-Driven Dynamic Refinement Based Unsupervised Domain Adaptation for Joint Optic Disc and Cup Segmentation.

Joint optimization for attention-based generation and recognition of chinese characters using tree position embedding

MsIFT: Multi-Source Image Fusion Transformer

Learning From Ambiguous Labels for Lung Nodule Malignancy Prediction.

Image Analysis by Fractional-Order Gaussian-Hermite Moments.

전래동화를 활용한 미술 활동이 유아의 정서지능과 그림표상능력에 미치는 영향

Fusing Multilevel Deep Features for Fabric Defect Detection Based NTV-RPCA

Semi-supervised adversarial model for benign-malignant lung nodule classification on chest CT.

ConvNet and LSH-Based Visual Localization Using Localized Sequence Matching.

A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

Image analysis by Gaussian–Hermite moments

Transform based spatio-temporal descriptors for human action recognition

Robust and Efficient Fourier–Mellin Transform Approximations for Gray-Level Image Reconstruction and Complete Invariant Description

On image analysis by the methods of moments