Small Training Dataset Research Articles

Typically, the current dose prediction models are limited to small amounts of data and require retraining for a specific site, often leading to suboptimal performance. We propose a site-agnostic, three-dimensional dose distribution prediction model using deep learning that can leverage data from any treatment site, thus increasing the total data available to train the model. Applying our proposed model to a new target treatment site requires only a brief fine-tuning of the model to the new data and involves no modifications to the model input channels or its parameters. Thus, it can be efficiently adapted to a different treatment site, even with a small training dataset. This study uses two separate datasets/treatment sites: data from patients with prostate cancer treated with intensity-modulated radiation therapy (source data), and data from patients with head-and-neck cancer treated with volumetric-modulated arc therapy (target data). We first developed a source model with 3D UNet architecture, trained from random initial weights on the source data. We evaluated the performance of this model on the source data. We then studied the generalizability of the model to the new target dataset via transfer learning. To do this, we built three more models, all with the same 3D UNet architecture: target model, adapted model, and combined model. The source and target models were trained on the source and target data from random initial weights, respectively. The adapted model fine-tuned the source model to the target domain by using the target data. Finally, the combined model was trained from random initial weights on a combined data pool consisting of both target and source datasets. We tested all four models on the target dataset and evaluated quantitative dose-volume histogram metrics for the planning target volume (PTV) and organs at risk (OARs). When tested on the source treatment site, the source model accurately predicted the dose distributions with average (mean, max) absolute dose errors of (0.32%±0.14, 2.37%±0.93) (PTV) relative to the prescription dose, and highest mean dose error of 1.68%±0.76, and highest max dose error of 5.47%± 3.31 for femoral head right. The error in PTV dose coverage prediction is 3.21%±1.51 for D98 , 3.04%±1.69 for D95 , and 1.83%±1.01 for D02 . Averaging across all OARs, the source model predicted the OAR mean dose within 1.38% and the OAR max dose within 3.64%. For the target treatment site, the target model average (mean, max) absolute dose errors relative to the prescription dose for the PTV were (1.08%±0.95, 2.90%±1.35). Left cochlea had the highest mean and max dose errors of 5.37%±5.82 and 8.33%±8.88, respectively. The errors in PTV dose coverage prediction for D98 and D95 were 2.88%±1.59 and 2.55%±1.28, respectively. The target model can predict the OAR mean dose within 2.43% and the OAR max dose within 4.33% on average across all OARs. We developed a site-agnostic model for three-dimensional dose prediction and tested its adaptability to a new target treatment site via transfer learning. Our proposed model can make accurate predictions with limited training data.

Read full abstract

Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors—such as Haar, SIFT, and LBP—deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods—e.g., fine-tuning—lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.

Read full abstract

Small Training Dataset Research Articles

Related Topics

Articles published on Small Training Dataset

A Study on Prediction Skills and Reading Efficiency in College English Based on Optimized BP Networks

Nonlinear Schrödinger Kernel for Hardware Acceleration of Machine Learning

A Synaptic Pruning-Based Spiking Neural Network for Hand-Written Digits Classification.

Parameter continuity in time-varying Gauss–Markov models for learning from small training data sets

Scattering response modeling scheme based on combined neural network inspired by the equivalent scattering center.

Neural network training using ℓ1-regularization and bi-fidelity data

Computing With Networks of Chemical Oscillators and its Application for Schizophrenia Diagnosis.

AraConv: Developing an Arabic Task-Oriented Dialogue System Using Multi-Lingual Transformer Model mT5

Deep learning image transmission through a multimode fiber based on a small training dataset.

Sheared edge defect segmentation using a convolutional U-Net for quantified quality assessment of fine blanked workpieces

Regularizing deep networks with label geometry for accurate object localization on small training datasets

Tooth Instance Segmentation on Panoramic Dental Radiographs Using U-Nets and Morphological Processing

Site-agnostic 3D dose distribution prediction with deep learning neural networks.

Automated Breast Cancer Detection Models Based on Transfer Learning.

A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

(Retracted) Estimation of human age by features of face and eyes based on multilevel feature convolutional neural network

Meta domain generalization for smart manufacturing: Tool wear prediction with small data

Radar HRRP Target Recognition Model Based on a Stacked CNN–Bi-RNN With Attention Mechanism

Constructing an Surrogate Model for Pedestrian Protection Using Machine Learning

Memory-Modulated Transformer Network for Heterogeneous Face Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Small Training Dataset Research Articles

Related Topics

Articles published on Small Training Dataset

A Study on Prediction Skills and Reading Efficiency in College English Based on Optimized BP Networks

Nonlinear Schrödinger Kernel for Hardware Acceleration of Machine Learning

A Synaptic Pruning-Based Spiking Neural Network for Hand-Written Digits Classification.

Parameter continuity in time-varying Gauss–Markov models for learning from small training data sets

Scattering response modeling scheme based on combined neural network inspired by the equivalent scattering center.

Neural network training using ℓ1-regularization and bi-fidelity data

Computing With Networks of Chemical Oscillators and its Application for Schizophrenia Diagnosis.

AraConv: Developing an Arabic Task-Oriented Dialogue System Using Multi-Lingual Transformer Model mT5

Deep learning image transmission through a multimode fiber based on a small training dataset.

Sheared edge defect segmentation using a convolutional U-Net for quantified quality assessment of fine blanked workpieces

Regularizing deep networks with label geometry for accurate object localization on small training datasets

Tooth Instance Segmentation on Panoramic Dental Radiographs Using U-Nets and Morphological Processing

Site-agnostic 3D dose distribution prediction with deep learning neural networks.

Automated Breast Cancer Detection Models Based on Transfer Learning.

A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

(Retracted) Estimation of human age by features of face and eyes based on multilevel feature convolutional neural network

Meta domain generalization for smart manufacturing: Tool wear prediction with small data

Radar HRRP Target Recognition Model Based on a Stacked CNN–Bi-RNN With Attention Mechanism

Constructing an Surrogate Model for Pedestrian Protection Using Machine Learning

Memory-Modulated Transformer Network for Heterogeneous Face Recognition