Two-stage Architecture Research Articles

Auto-segmentation of organs-at-risk (OARs) in the head and neck (HN) on computed tomography (CT) images is a time-consuming component of the radiation therapy pipeline that suffers from inter-observer variability. Deep learning (DL) has shown state-of-the-art results in CT auto-segmentation, with larger and more diverse datasets showing better segmentation performance. Institutional CT auto-segmentation datasets have been small historically (n<50) due to the time required for manual curation of images and anatomical labels. Recently, large public CT auto-segmentation datasets (n>1000 aggregated) have become available through online repositories such as The Cancer Imaging Archive. Transfer learning is a technique applied when training samples are scarce, but a large dataset from a closely related domain is available. The purpose of this study was to investigate whether a large public dataset could be used in place of an institutional dataset (n>500), or to augment performance via transfer learning, when building HN OAR auto-segmentation models for institutional use. Auto-segmentation models were trained on a large public dataset (public models) and a smaller institutional dataset (institutional models). The public models were fine-tuned on the institutional dataset using transfer learning (transfer models). We assessed both public model generalizability and transfer model performance by comparison with institutional models. Additionally, the effect of institutional dataset size on both transfer and institutional models was investigated. All DL models used a high-resolution, two-stage architecture based on the popular 3D U-Net. Model performance was evaluated using five geometric measures: the dice similarity coefficient (DSC), surface DSC, 95th percentile Hausdorff distance, mean surface distance (MSD), and added path length. For a small subset of OARs (left/right optic nerve, spinal cord, left submandibular), the public models performed significantly better (p<0.05) than, or showed no significant difference to, the institutional models under most of the metrics examined. For the remaining OARs, the public models were inferior to the institutional models, although performance differences were small (DSC≤0.03, MSD<0.5mm) for seven OARs (brainstem, left/right lens, left/right parotid, mandible, right submandibular). The transfer models performed significantly better than the institutional models for seven OARs (brainstem, right lens, left/right optic nerve, left/right parotid, spinal cord) with a small margin of improvement (DSC≤0.02, MSD<0.4mm). When numbers of institutional training samples were limited, public and transfer models outperformed the institutional models for most OARs (brainstem, left/right lens, left/right optic nerve, left/right parotid, spinal cord, and left/right submandibular). Training auto-segmentation models with public data alone was suitable for a small number of OARs. Using only public data incurred a small performance deficit for most other OARs, when compared with institutional data alone, but may be preferable over time-consuming curation of a large institutional dataset. When a large institutional dataset was available, transfer learning with models pretrained on a large public dataset provided a modest performance improvement for several OARs. When numbers of institutional samples were limited, using the public dataset alone, or as a pretrained model, was beneficial for most OARs.

Read full abstract

Various advanced reactor designs proposed in recent years envision deployment scenarios which feature reactor operations with significantly reduced staffing or even completely autonomous frameworks to reduce the operations and management costs of the plants. Many SMR and microreactor designs feature extended fuel cycles which limit inspection intervals, reduced access to critical components, and load-following capabilities that expose the reactor unit to different transients. Safe and reliable semi- or fully autonomous operations under these challenging operational regimes must be enabled through an on-line monitoring (OLM) system that effectively detects and diagnoses malfunctions in the reactor plant. This work presents the development of the Fault Diagnosis Module of the previously proposed data-driven OLM system for reactor operations: the Fault Detection and Diagnosis Monitoring System (FDDMS). The Fault Diagnosis Module monitors various sensor signatures from multiple systems and components at a nuclear power plant and accurately diagnoses the type and location of a malfunction. When integrated into the complete FDDMS methodology, the Fault Diagnosis Module can provide power transient dependent fault characterization by using separate convolutional neural network (CNN) models for steady state, ramping up in power, and ramping down in power operations, making the FDDMS especially applicable to load-following operational strategies. Two separate diagnosis approaches were explored in this paper for the Fault Diagnosis Module architecture: Hierarchical and End-to-End. The Hierarchical approach is a two-stage architecture in which the first stage uses a single CNN to classify the plant subsystem where the malfunction initiated. Subsequently in the second stage, a separate CNN is used for each subsystem to describe the specific fault type. The End-to-End approach uses a single CNN to directly classify the fault type from the overall list. To efficiently utilize computational resources, the hyperband intelligent hyperparameter optimization method to generate optimal CNN architectures. The diagnosis approaches were compared in terms of precision rate, recall rate, F1-score, and total accuracy for the possible fault types. Both diagnosis approaches produced good and satisfactory performance by generating total diagnosis accuracies above 99% on 17 different malfunction scenarios for all three power transient datasets. Additionally, robustness against noisy sensor data was tested, with models maintaining 99–100% accuracy at various levels of noise, and an illustrative real-time application of the methodology is provided.

Read full abstract

Two-stage Architecture Research Articles

Related Topics

Articles published on Two-stage Architecture

The two-stage detection-after-segmentation model improves the accuracy of identifying subdiaphragmatic lesions

Hybrid Transformer and Convolution for Image Compressed Sensing

Personality prediction via multi-task transformer architecture combined with image aesthetics

Cyber Intrusion Detection System Using Deep Learning Approach

Choroidal Layer Analysis in OCT images via Ambiguous Boundary-aware Attention

ZOZI-Seg: A transformer and UNet cascade network with Zoom-Out and Zoom-In scheme for aortic dissection segmentation in enhanced CT images

A two-stage transformer based network for motor imagery classification

Transfer learning for auto-segmentation of 17 organs-at-risk in the head and neck: Bridging the gap between institutional and public datasets.

CETR: CenterNet-Vision transformer model for wheat head detection

Weakly supervised large-scale pancreatic cancer detection using multi-instance learning.

Video-Based Multiphysiological Disentanglement and Remote Robust Estimation for Respiration.

Detecting fake reviewers from the social context with a graph neural network method

An intelligent fault detection and diagnosis monitoring system for reactor operational resilience: Fault diagnosis

A novel wavelet-transform-based convolution classification network for cervical lymph node metastasis of papillary thyroid carcinoma in ultrasound images.

Deep neural architecture for natural language image synthesis for Tamil text using BASEGAN and hybrid super resolution GAN (HSRGAN)

Personalized Ambient Pollution Estimation Based on Stationary-Camera-Taken Images Under Cross-Camera Information Sharing in Smart City

A Pareto-based two-stage evolutionary algorithm for flexible job shop scheduling problem with worker cooperation flexibility

Unifying Convolution and Transformer for Efficient Concealed Object Detection in Passive Millimeter-Wave Images

An End-to-End Online Traffic-Risk Incident Prediction in First-Person Dash Camera Videos

Higher-order memory guided temporal random walk for dynamic heterogeneous network embedding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Two-stage Architecture Research Articles

Related Topics

Articles published on Two-stage Architecture

The two-stage detection-after-segmentation model improves the accuracy of identifying subdiaphragmatic lesions

Hybrid Transformer and Convolution for Image Compressed Sensing

Personality prediction via multi-task transformer architecture combined with image aesthetics

Cyber Intrusion Detection System Using Deep Learning Approach

Choroidal Layer Analysis in OCT images via Ambiguous Boundary-aware Attention

ZOZI-Seg: A transformer and UNet cascade network with Zoom-Out and Zoom-In scheme for aortic dissection segmentation in enhanced CT images

A two-stage transformer based network for motor imagery classification

Transfer learning for auto-segmentation of 17 organs-at-risk in the head and neck: Bridging the gap between institutional and public datasets.

CETR: CenterNet-Vision transformer model for wheat head detection

Weakly supervised large-scale pancreatic cancer detection using multi-instance learning.

Video-Based Multiphysiological Disentanglement and Remote Robust Estimation for Respiration.

Detecting fake reviewers from the social context with a graph neural network method

An intelligent fault detection and diagnosis monitoring system for reactor operational resilience: Fault diagnosis

A novel wavelet-transform-based convolution classification network for cervical lymph node metastasis of papillary thyroid carcinoma in ultrasound images.

Deep neural architecture for natural language image synthesis for Tamil text using BASEGAN and hybrid super resolution GAN (HSRGAN)

Personalized Ambient Pollution Estimation Based on Stationary-Camera-Taken Images Under Cross-Camera Information Sharing in Smart City

A Pareto-based two-stage evolutionary algorithm for flexible job shop scheduling problem with worker cooperation flexibility

Unifying Convolution and Transformer for Efficient Concealed Object Detection in Passive Millimeter-Wave Images

An End-to-End Online Traffic-Risk Incident Prediction in First-Person Dash Camera Videos

Higher-order memory guided temporal random walk for dynamic heterogeneous network embedding