Architecture For Segmentation Research Articles

Deep neural networks are commonly used for automated medical image segmentation, but models will frequently struggle to generalize well across different imaging modalities. This issue is particularly problematic due to the limited availability of annotated data, both in the target as well as the source modality, making it difficult to deploy these models on a larger scale. To overcome these challenges, we propose a new semi-supervised training strategy called MoDATTS. Our approach is designed for accurate cross-modality 3D tumor segmentation on unpaired bi-modal datasets. An image-to-image translation strategy between modalities is used to produce synthetic but annotated images and labels in the desired modality and improve generalization to the unannotated target modality. We also use powerful vision transformer architectures for both image translation (TransUNet) and segmentation (Medformer) tasks and introduce an iterative self-training procedure in the later task to further close the domain gap between modalities, thus also training on unlabeled images in the target modality. MoDATTS additionally allows the possibility to exploit image-level labels with a semi-supervised objective that encourages the model to disentangle tumors from the background. This semi-supervised methodology helps in particular to maintain downstream segmentation performance when pixel-level label scarcity is also present in the source modality dataset, or when the source dataset contains healthy controls. The proposed model achieves superior performance compared to other methods from participating teams in the CrossMoDA 2022 vestibular schwannoma (VS) segmentation challenge, as evidenced by its reported top Dice score of 0.87±0.04 for the VS segmentation. MoDATTS also yields consistent improvements in Dice scores over baselines on a cross-modality adult brain gliomas segmentation task composed of four different contrasts from the BraTS 2020 challenge dataset, where 95% of a target supervised model performance is reached when no target modality annotations are available. We report that 99% and 100% of this maximum performance can be attained if 20% and 50% of the target data is additionally annotated, which further demonstrates that MoDATTS can be leveraged to reduce the annotation burden.

Read full abstract

Background The primary surgical approach for removing adrenal masses is minimally invasive adrenalectomy. Recognition of anatomical landmarks during surgery is critical for minimizing complications. Artificial intelligence-based tools can be utilized to create real-time navigation systems during laparoscopic and robotic right adrenalectomy. In this study, we aimed to develop deep learning models that can identify critical anatomical structures during minimally invasive right adrenalectomy. Methods In this experimental feasibility study, intraoperative videos of 20 patients who underwent minimally invasive right adrenalectomy in a tertiary care center between 2011 and 2023 were analyzed and used to develop an artificial intelligence-based anatomical landmark recognition system. Semantic segmentation of the liver, the inferior vena cava (IVC), and the right adrenal gland were performed. Fifty random images per patient during the dissection phase were extracted from videos. The experiments on the annotated images were performed on two state-of-the-art segmentation models named SwinUNETR and MedNeXt, which are transformer and convolutional neural network (CNN)-based segmentation architectures, respectively. Two loss function combinations, Dice-Cross Entropy and Dice-Focal Loss were experimented with for both of the models. The dataset was split into training and validation subsets with an 80:20 distribution on a patient basis in a 5-fold cross-validation approach. To introduce a sample variability to the dataset, strong-augmentation techniques were performed using intensity modifications and perspective transformations to represent different surgery environment scenarios. The models were evaluated by Dice Similarity Coefficient (DSC) and Intersection over Union (IoU) which are widely used segmentation metrics. For pixelwise classification performance, accuracy, sensitivity and specificity metrics were calculated on the validation subset. Results Out of 20 videos, 1000 images were extracted, and the anatomical landmarks (liver, IVC, and right adrenal gland) were annotated. Randomly distributed 800 images and 200 images were selected for the training and validation subsets, respectively. Our benchmark results show that the utilization of Dice-Cross Entropy Loss with the transformer-based SwinUNETR model achieved 78.37%, whereas the CNN-based MedNeXt model reached a 77.09% mDSC score. Conversely, MedNeXt reaches a higher mIoU score of 63.71% than SwinUNETR by 62.10% on a three-region prediction task. Conclusion Artificial intelligence-based systems can predict anatomical landmarks with high performance in minimally invasive right adrenalectomy. Such tools can later be used to create real-time navigation systems during surgery in the near future.

Read full abstract

Architecture For Segmentation Research Articles

Related Topics

Articles published on Architecture For Segmentation

HiSEG: Human assisted instance segmentation

Semantic Segmentation in Large-Size Orthomosaics to Detect the Vegetation Area in Opuntia spp. Crop.

Image-level supervision and self-training for transformer-based cross-modality tumor segmentation

Overcoming Remote Workforce Cyber Threats: A Comprehensive Ransomware and Bot Net Defense Strategy Utilizing VPN Networks

DeMambaNet: Deformable Convolution and Mamba Integration Network for High-Precision Segmentation of Ambiguously Defined Dental Radicular Boundaries.

MGB-Unet: An Improved Multiscale Unet with Bottleneck Transformer for Myositis Segmentation from Ultrasound Images.

Deep Learning-Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study.

Enhancing brain tumor segmentation in MRI images using the IC-net algorithm framework

Accurate segmentation of liver tumor from multi-modality non-contrast images using a dual-stream multi-level fusion framework

Deep Learning-Based Real-Time Ureter Identification in Laparoscopic Colorectal Surgery.

Improving Deep Learning-Based Algorithm for Ploidy Status Prediction Through Combined U-NET Blastocyst Segmentation and Sequential Time-Lapse Blastocysts Images.

Image semantic segmentation of indoor scenes: A survey

Towards Metric-Driven Difference Detection between Receptive and Nonreceptive Endometrial Samples Using Automatic Histology Image Analysis

P2AT: Pyramid pooling axial transformer for real-time semantic segmentation

Dominant Color Detection For Online Fashion Retrievals

Multichannel Sandstone Thin Sections Identification Based on Improved DeepLab V3 Plus Neural Network.

Automated end-to-end Architecture for Retinal Layers and Fluids Segmentation on OCT B-scans

Utilization of artificial intelligence in minimally invasive right adrenalectomy: recognition of anatomical landmarks with deep learning

NnSegNeXt: A 3D Convolutional Network for Brain Tissue Segmentation Based on Quality Evaluation.

Construction of Three-Dimensional Semantic Maps of Unstructured Lawn Scenes Based on Deep Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Architecture For Segmentation Research Articles

Related Topics

Articles published on Architecture For Segmentation

HiSEG: Human assisted instance segmentation

Semantic Segmentation in Large-Size Orthomosaics to Detect the Vegetation Area in Opuntia spp. Crop.

Image-level supervision and self-training for transformer-based cross-modality tumor segmentation

Overcoming Remote Workforce Cyber Threats: A Comprehensive Ransomware and Bot Net Defense Strategy Utilizing VPN Networks

DeMambaNet: Deformable Convolution and Mamba Integration Network for High-Precision Segmentation of Ambiguously Defined Dental Radicular Boundaries.

MGB-Unet: An Improved Multiscale Unet with Bottleneck Transformer for Myositis Segmentation from Ultrasound Images.

Deep Learning-Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study.

Enhancing brain tumor segmentation in MRI images using the IC-net algorithm framework

Accurate segmentation of liver tumor from multi-modality non-contrast images using a dual-stream multi-level fusion framework

Deep Learning-Based Real-Time Ureter Identification in Laparoscopic Colorectal Surgery.

Improving Deep Learning-Based Algorithm for Ploidy Status Prediction Through Combined U-NET Blastocyst Segmentation and Sequential Time-Lapse Blastocysts Images.

Image semantic segmentation of indoor scenes: A survey

Towards Metric-Driven Difference Detection between Receptive and Nonreceptive Endometrial Samples Using Automatic Histology Image Analysis

P2AT: Pyramid pooling axial transformer for real-time semantic segmentation

Dominant Color Detection For Online Fashion Retrievals

Multichannel Sandstone Thin Sections Identification Based on Improved DeepLab V3 Plus Neural Network.

Automated end-to-end Architecture for Retinal Layers and Fluids Segmentation on OCT B-scans

Utilization of artificial intelligence in minimally invasive right adrenalectomy: recognition of anatomical landmarks with deep learning

NnSegNeXt: A 3D Convolutional Network for Brain Tissue Segmentation Based on Quality Evaluation.

Construction of Three-Dimensional Semantic Maps of Unstructured Lawn Scenes Based on Deep Learning