Abstract

In recent years, various encoder-decoder-based U-Net architecture has shown remarkable performance in medical image segmentation. However, these encoder-decoder U-Net has a drawback in learning multi-scale features in complex segmentation tasks and weak ability to generalize to other tasks. This paper proposed a generalize encoder-decoder model called dense dilated inception network (DDI-Net) for medical image segmentation by modifying U-Net architecture. We utilize three steps; firstly, we propose a dense path to replace the skip connection in the middle of the encoder and decoder to make the model deeper. Secondly, we replace the U-Net's basic convolution blocks with a modified inception module called multi-scale dilated inception module (MDI) to make the model wider without gradient vanish and with fewer parameters. Thirdly, data augmentation and normalization are applied to the training data to improve the model generalization. We evaluated the proposed model on three subtasks of the medical segmentation decathlon challenge. The experiment results prove that DDI-Net achieves superior performance than the compared methods with a Dice score of 0.82, 0.68, and 0.79 in brain tumor segmentation for edema, non-enhancing, and enhancing tumor. For the hippocampus segmentation, the result achieves 0.92 and 0.90 for anterior and posterior, respectively. For the heart segmentation, the method achieves 0.95 for the left atrial.

Highlights

  • Accurate and automated segmentation of anatomical structures is the most critical and challenging task in analyzing medical images

  • To overcome the above-aforementioned challenges, we propose a generalized encoder-decoder model called dense dilated inception network (DDI-Net) for medical image segmentation by modifying U-Net architecture

  • We evaluated our DDI-Net on three subtasks of medical segmentation decathlon challenge (MSD) datasets [23]

Read more

Summary

INTRODUCTION

Accurate and automated segmentation of anatomical structures is the most critical and challenging task in analyzing medical images. Many sophisticated CNN models have been proposed such as Alex Net [13], VGG [14], Google Net [15], Dense Net [16], ResNet [17], Deeplab [18], fully convolution network (FCN) [19] and U-Net [20] Among these CNN networks, U-Net, an encoder-decoder based model, makes an outstanding achievement and becomes the most famous model in medical image segmentation tasks and computer vision at large that outperformed the existing approaches [21]. To overcome the above-aforementioned challenges, we propose a generalized encoder-decoder model called dense dilated inception network (DDI-Net) for medical image segmentation by modifying U-Net architecture.

RELATED WORK
PROPOSED METHOD
Dense Path
Multi-Scale Dilated Inception Block
Datasets
Implementation Details
Evaluation Metric
Brain Tumour Segmentation
Hippocampus Segmentation
Heart Segmentation
Ablation Studies
Algorithm Run-Time
Comparison with State-of-the-Art Methods
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call