Unpaired Multi-Modal Segmentation via Knowledge Distillation.

Qi Dou,Ben Glocker,Pheng Ann Heng,Quande Liu

doi:10.1109/tmi.2019.2963882

Abstract

Multi-modal learning is typically performed with network architectures containing modality-specific layers and shared layers, utilizing co-registered images of different modalities. We propose a novel learning scheme for unpaired cross-modality image segmentation, with a highly compact architecture achieving superior segmentation accuracy. In our method, we heavily reuse network parameters, by sharing all convolutional kernels across CT and MRI, and only employ modality-specific internal normalization layers which compute respective statistics. To effectively train such a highly compact model, we introduce a novel loss term inspired by knowledge distillation, by explicitly constraining the KL-divergence of our derived prediction distributions between modalities. We have extensively validated our approach on two multi-class segmentation problems: i) cardiac structure segmentation, and ii) abdominal organ segmentation. Different network settings, i.e., 2D dilated network and 3D U-net, are utilized to investigate our method's general efficacy. Experimental results on both tasks demonstrate that our novel multi-modal learning scheme consistently outperforms single-modal training and previous multi-modal approaches.

Highlights

A NATOMICAL structures are imaged with a variety of modalities depending on the clinical indication
We extensively evaluate our method on two Computed Tomography (CT) and Magnetic Resonance Imaging (MRI) multi-class segmentation tasks, including cardiac segmentation with a 2D dilated convolutional neural network (CNN) and abdominal multiorgan segmentation with a 3D U-Net
The two key aspects are: 1) separating internal feature normalizations for each modality, given the very different statistical distributions of CT and MRI; 2) knowledge distillation from pre-softmax activations, in order to leverage information shared across modalities to guide the multi-modal learning

Summary

INTRODUCTION

A NATOMICAL structures are imaged with a variety of modalities depending on the clinical indication. Early fusion means concatenating multi-modal images as different channels at the input layer of a network. This strategy has demonstrated effectiveness on segmenting brain tissue [3]–[5] and brain lesions [6]–[8] in multiple sequences of MRI. More complex multi-modal CNNs have been designed, by leveraging dense connections [12], inception modules [13] or multi-scale feature fusion [14] These more complicated models still follow the idea of combining modality-specific and shared layers. Our paper proposes a novel compact model for unpaired CT and MRI multi-modal segmentation, by explicitly addressing distribution shift and distilling cross-modality knowledge. Code for our proposed approach is publicly available at https://github.com/carrenD/ummkd

RELATED WORK

Independent normalization of CT and MRI

Knowledge distillation

METHODS

Separate internal feature normalization

Knowledge distillation loss

Overall loss function and training procedure

EXPERIMENTS

Datasets and networks

Experimental settings

Segmentation results and comparison with state-of-the-arts

Methods

Analytical ablation studies

C T confusion matrix

DISCUSSION

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Medical Imaging	Publication Date: Feb 3, 2020
Citations: 171	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Unpaired Multi-Modal Segmentation via Knowledge Distillation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging

Lead the way for us

Similar Papers

A more precise multi-modal image registration using self-similarities
Qiu Guan ... Kangjie Li
-
Qiu Guan, et. al.Qiu Guan ... Kangjie Li
01 Oct 2017
01 Oct 2017

AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem?
Jun Ma ... Xingle An
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Jun Ma, et. al.Jun Ma ... Xingle An
01 Oct 2022
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

A multimodal medical image contrastive learning algorithm with domain adaptive denormalization
Xiuding Cai ... Zhongliang Fu
Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi | VOL. 40
Xiuding Cai, et. al.Xiuding Cai ... Zhongliang Fu
25 Jun 2023
Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi | VOL. 40

3D U-JAPA-Net: Mixture of Convolutional Networks for Abdominal Multi-organ CT Segmentation
Hideki Kakeya ... Toshiyuki Okada
-
Hideki Kakeya, et. al.Hideki Kakeya ... Toshiyuki Okada
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unpaired Multi-Modal Segmentation via Knowledge Distillation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging