Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance.

Claudio Urrea,Yainet Garcia-Garcia,John Kern

doi:10.3390/biomedicines12061309

Abstract

This article addresses the semantic segmentation of laparoscopic surgery images, placing special emphasis on the segmentation of structures with a smaller number of observations. As a result of this study, adjustment parameters are proposed for deep neural network architectures, enabling a robust segmentation of all structures in the surgical scene. The U-Net architecture with five encoder-decoders (U-Net5ed), SegNet-VGG19, and DeepLabv3+ employing different backbones are implemented. Three main experiments are conducted, working with Rectified Linear Unit (ReLU), Gaussian Error Linear Unit (GELU), and Swish activation functions. The applied loss functions include Cross Entropy (CE), Focal Loss (FL), Tversky Loss (TL), Dice Loss (DiL), Cross Entropy Dice Loss (CEDL), and Cross Entropy Tversky Loss (CETL). The performance of Stochastic Gradient Descent with momentum (SGDM) and Adaptive Moment Estimation (Adam) optimizers is compared. It is qualitatively and quantitatively confirmed that DeepLabv3+ and U-Net5ed architectures yield the best results. The DeepLabv3+ architecture with the ResNet-50 backbone, Swish activation function, and CETL loss function reports a Mean Accuracy (MAcc) of 0.976 and Mean Intersection over Union (MIoU) of 0.977. The semantic segmentation of structures with a smaller number of observations, such as the hepatic vein, cystic duct, Liver Ligament, and blood, verifies that the obtained results are very competitive and promising compared to the consulted literature. The proposed selected parameters were validated in the YOLOv9 architecture, which showed an improvement in semantic segmentation compared to the results obtained with the original architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biomedicines	Publication Date: Jun 13, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance.

Abstract

Talk to us

Similar Papers

More From: Biomedicines

Lead the way for us

Similar Papers

Multi-Scale Residual Deep Network for Semantic Segmentation of Buildings with Regularizer of Shape Representation
Chengyi Wang ... Lianfa Li
Remote Sensing | VOL. 12
Chengyi Wang, et. al.Chengyi Wang ... Lianfa Li
10 Sep 2020
Remote Sensing | VOL. 12

Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid.
Shan Zhao ... Fukai Zhang
Sensors (Basel, Switzerland) | VOL. 24
Shan Zhao, et. al.Shan Zhao ... Fukai Zhang
16 Aug 2024
Sensors (Basel, Switzerland) | VOL. 24

Sampling-attention deep learning network with transfer learning for large-scale urban point cloud semantic segmentation
Yunxiang Zhou ... Xiaolong Xue
Engineering Applications of Artificial Intelligence | VOL. 117
Yunxiang Zhou, et. al.Yunxiang Zhou ... Xiaolong Xue
16 Nov 2022
Engineering Applications of Artificial Intelligence | VOL. 117

SHDM-NET: Heat map detail guidance with image matting for industrial weld semantic segmentation network
Qi Wang ... Hegui Zhu
Engineering Applications of Artificial Intelligence | VOL. 126
Qi Wang, et. al.Qi Wang ... Hegui Zhu
22 Aug 2023
Engineering Applications of Artificial Intelligence | VOL. 126

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance.

Abstract

Talk to us

Similar Papers

More From: Biomedicines