A Self-Spatial Adaptive Weighting Based U-Net for Image Segmentation

Choongsang Cho,Jongyoul Park,Young Han Lee,Sangkeun Lee

doi:10.3390/electronics10030348

Choongsang Cho, Jongyoul Park + Show 2 more

Open Access

https://doi.org/10.3390/electronics10030348

Copy DOI

Abstract

Semantic image segmentation has a wide range of applications. When it comes to medical image segmentation, its accuracy is even more important than those of other areas because the performance gives useful information directly applicable to disease diagnosis, surgical planning, and history monitoring. The state-of-the-art models in medical image segmentation are variants of encoder-decoder architecture, which is called U-Net. To effectively reflect the spatial features in feature maps in encoder-decoder architecture, we propose a spatially adaptive weighting scheme for medical image segmentation. Specifically, the spatial feature is estimated from the feature maps, and the learned weighting parameters are obtained from the computed map, since segmentation results are predicted from the feature map through a convolutional layer. Especially in the proposed networks, the convolutional block for extracting the feature map is replaced with the widely used convolutional frameworks: VGG, ResNet, and Bottleneck Resent structures. In addition, a bilinear up-sampling method replaces the up-convolutional layer to increase the resolution of the feature map. For the performance evaluation of the proposed architecture, we used three data sets covering different medical imaging modalities. Experimental results show that the network with the proposed self-spatial adaptive weighting block based on the ResNet framework gave the highest IoU and DICE scores in the three tasks compared to other methods. In particular, the segmentation network combining the proposed self-spatially adaptive block and ResNet framework recorded the highest 3.01% and 2.89% improvements in IoU and DICE scores, respectively, in the Nerve data set. Therefore, we believe that the proposed scheme can be a useful tool for image segmentation tasks based on the encoder-decoder architecture.

Highlights

Over the past few years, deep convolutional neural networks have made a lot of progress in computer vision-based tasks, including image classification [1,2], object detection [3,4], semantic segmentation [5,6], human pose estimation [7,8], image captioning [9,10], and so on
Considering the goal of segmentation, which assigns a category label to each pixel in the image, the segmentation result is obtained from the last feature map via the convolutional layer, so the feature maps in the encoder-decoder architecture should reflect the spatial characteristics of the task
In encoder-decoder architecture, we propose a spatial adaptive weighting method for encoder-decoder architecture to reflect the spatial characteristics of feature maps

Summary

Introduction

Over the past few years, deep convolutional neural networks have made a lot of progress in computer vision-based tasks, including image classification [1,2], object detection [3,4], semantic segmentation [5,6], human pose estimation [7,8], image captioning [9,10], and so on.Semantic image segmentation has a wide range of applications in the fields of computer vision, robotics, medical, and computer graphics. Image segmentation in natural images is used to parse the scene, and its performance has improved so that it can be applicable to automatic driving and robot sensing, to name a few [6,11]. When it comes to medical image segmentation, accuracy is even more important than other areas because the result gives important information for disease diagnosis, surgical planning, and history monitoring [12]. State-of-the-art scene segmentation frameworks for natural images are based on the fully convolutional network (FCN) [13], and the state-of-the-art models for medical image segmentation are variants of the encoder-decoder architecture called U-Net [14,15]. Considering the goal of segmentation, which assigns a category label to each pixel in the image, the segmentation result is obtained from the last feature map via the convolutional layer, so the feature maps in the encoder-decoder architecture should reflect the spatial characteristics of the task

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Feb 2, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Self-Spatial Adaptive Weighting Based U-Net for Image Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Segment anything model for few-shot medical image segmentation with domain tuning
Weili Shi ... Zhengang Jiang
Complex & Intelligent Systems | VOL. 11
Weili Shi, et. al.Weili Shi ... Zhengang Jiang
14 Nov 2024
Complex & Intelligent Systems | VOL. 11

Multi-scale context UNet-like network with redesigned skip connections for medical image segmentation
Ledan Qian ... Soo-Hyung Kim
Computer Methods and Programs in Biomedicine | VOL. 243
Ledan Qian, et. al.Ledan Qian ... Soo-Hyung Kim
27 Oct 2023
Computer Methods and Programs in Biomedicine | VOL. 243

R2U++: a multiscale recurrent residual U-Net with dense skip connections for medical image segmentation
Mehreen Mubashar ... Shoaib Azmat
Neural Computing and Applications | VOL. 34
Mehreen Mubashar, et. al.Mehreen Mubashar ... Shoaib Azmat
03 Jun 2022
Neural Computing and Applications | VOL. 34

Medical image segmentation with UNet-based multi-scale context fusion
Yongqi Yuan ... Yong Cheng
Scientific Reports | VOL. 14
Yongqi Yuan, et. al.Yongqi Yuan ... Yong Cheng
28 Oct 2024
Scientific Reports | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Self-Spatial Adaptive Weighting Based U-Net for Image Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics