Mini-batch Size Research Articles

In recent years, deep learning-based crack detection methods have been widely explored and applied due to their high versatility and adaptability. In civil engineering applications, recent research on crack detection through deep convolutional neural network (DCNN) includes road pavement crack detection, bridge inspection, defects detection in shield tunnel lining, etc. Despite the increasing popularity of DCNN on crack detection, many challenges have yet to be properly addressed. For crack detection using three-dimensional (3D) range (i.e., elevation) images, disturbances such as surface variation can negatively affect the detection performance. Besides, some typical non-crack patterns such as grooves can be easily misidentified as cracks, i.e., false positives. Another issue lies in the selection of hyperparameters related with the design of a DCNN architecture. For example, the hyperparameters which are related with network structure (e.g., kernel size, network depth and width) and training (e.g., mini-batch size and learning rate) can impact the network performance to a significant extent. Therefore, they need to be properly determined for optimal performance. However, for deep learning-based roadway crack classification using laser-scanned range images, a comprehensive discussion on the hyperparameter selection/tuning has not been thoroughly performed. This study develops a hyperparameter selection process involving a series of experiments on laser-scanned range images with high diversities, investigating the optimal joint hyperparameter configuration on network structure and training for DCNN-based roadway crack classification. In a comparative study, 36 DCNN architectures with varying layouts are developed for crack classification. These architecture candidates differ in kernel sizes (e.g., 3 × 3, 7 × 7, and 11 × 11), network depths (from 5 to 8 weight layers), and widths (from 16 to 96 kernels in each convolutional layer). The 7-layer DCNN with constant 7 × 7 kernels and increasing network widths yields the highest classification performance among the proposed 36 DCNN classifiers, which may be because it can best reflect the complexity of the acquired laser-scanned roadway range images. Once the optimal architecture layout is determined, further discussion on the selection of min-batch sizes, learning rates, dropout factor and leaky rectified linear unit (LReLU) factor is performed. Experimental results show the optimal architecture with associated training configuration can achieve consistent and accurate performance, under the contamination of surface variations and grooved patterns in laser-scanned range images. Discussion on the hyperparameter selection can provide insights for the development of DCNN in similar applications using laser-scanned range images.

Read full abstract

Scatter is a major factor degrading the image quality of cone beam computed tomography (CBCT). Conventional scatter correction strategies require handcrafted analytical models with ad hoc assumptions, which often leads to less accurate scatter removal. This study aims to develop an effective scatter correction method using a residual convolutional neural network (CNN). A U-net based 25-layer CNN was constructed for CBCT scatter correction. The establishment of the model consists of three steps: model training, validation, and testing. For model training, a total of 1800 pairs of x-ray projection and the corresponding scatter-only distribution in nonanthropomorphic phantoms taken in full-fan scan were generated using Monte Carlo simulation of a CBCT scanner installed with a proton therapy system. An end-to-end CNN training was implemented with two major loss functions for 100 epochs with a mini-batch size of 10. Image rotations and flips were randomly applied to augment the training datasets during training. For validation, 200 projections of a digital head phantom were collected. The proposed CNN-based method was compared to a conventional projection-domain scatter correction method named fast adaptive scatter kernel superposition (fASKS) method using 360 projections of an anthropomorphic head phantom. Two different loss functions were applied for the same CNN to evaluate the impact of loss functions on the final results. Furthermore, the CNN model trained with full-fan projections was fine-tuned for scatter correction in half-fan scan by using transfer learning with additional 360 half-fan projection pairs of nonanthropomorphic phantoms. The tuned-CNN model for half-fan scan was compared with the fASKS method as well as the CNN-based method without the fine-tuning using additional lung phantom projections. The CNN-based method provides projections with significantly reduced scatter and CBCT images with more accurate Hounsfield Units (HUs) than that of the fASKS-based method. Root mean squared error of the CNN-corrected projections was improved to 0.0862 compared to 0.278 for uncorrected projections or 0.117 for the fASKS-corrected projections. The CNN-corrected reconstruction provided better HU quantification, especially in regions near the air or bone interfaces. All four image quality measures, which include mean absolute error (MAE), mean squared error (MSE), peak signal-to-noise ratio (PSNR), and structural similarity (SSIM), indicated that the CNN-corrected images were significantly better than that of the fASKS-corrected images. Moreover, the proposed transfer learning technique made it possible for the CNN model trained with full-fan projections to be applicable to remove scatters in half-fan projections after fine-tuning with only a small number of additional half-fan training datasets. SSIM value of the tuned-CNN-corrected images was 0.9993 compared to 0.9984 for the non-tuned-CNN-corrected images or 0.9990 for the fASKS-corrected images. Finally, the CNN-based method is computationally efficient - the correction time for the 360 projections only took less than 5s in the reported experiments on a PC (4.20GHz Intel Core-i7 CPU) with a single NVIDIA GTX 1070 GPU. The proposed deep learning-based method provides an effective tool for CBCT scatter correction and holds significant value for quantitative imaging and image-guided radiation therapy.

Read full abstract

Mini-batch Size Research Articles

Related Topics

Articles published on Mini-batch Size

Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network.

Detection of Bacterial Wilt on Enset Crop Using Deep Learning Approach

運転データに基づく建築設備のANNモデル構築手法に関する研究 (第1報)ミニバッチサイズが予測精度へ与える影響の評価

Deep Learning for SVD and Hybrid Beamforming

Inexact SARAH algorithm for stochastic optimization

Properties of the stochastic approximation EM algorithm with mini-batch sampling

Optimal Scree-CNN for Detecting NS1 Molecular Fingerprint from Salivary SERS Spectra.

Scalable and Practical Natural Gradient for Large-Scale Deep Learning.

Stochastic quasi-gradient methods: variance reduction via Jacobian sketching

Deep learning-based roadway crack classification using laser-scanned range images: A comparative study on hyperparameter selection

Crowding prediction on mass rapid transit systems using a weighted bidirectional recurrent neural network

FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters

A 2.9–33.0 TOPS/W Reconfigurable 1-D/2-D Compute-Near-Memory Inference Accelerator in 10-nm FinFET CMOS

An Approach to Hyperparameter Optimization for the Objective Function in Machine Learning

A Neuronal Morphology Classification Approach Based on Locally Cumulative Connected Deep Neural Networks

Random Minibatch Subgradient Algorithms for Convex Problems with Functional Constraints

Comprehensive techniques of multi-GPU memory optimization for deep learning acceleration

Learning with Type-2 Fuzzy activation functions to improve the performance of Deep Neural Networks

Projection-domain scatter correction for cone beam computed tomography using a residual convolutional neural network.

BOA: batch orchestration algorithm for straggler mitigation of distributed DL training in heterogeneous GPU cluster

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mini-batch Size Research Articles

Related Topics

Articles published on Mini-batch Size

Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network.

Detection of Bacterial Wilt on Enset Crop Using Deep Learning Approach

運転データに基づく建築設備のANNモデル構築手法に関する研究 (第1報)ミニバッチサイズが予測精度へ与える影響の評価

Deep Learning for SVD and Hybrid Beamforming

Inexact SARAH algorithm for stochastic optimization

Properties of the stochastic approximation EM algorithm with mini-batch sampling

Optimal Scree-CNN for Detecting NS1 Molecular Fingerprint from Salivary SERS Spectra.

Scalable and Practical Natural Gradient for Large-Scale Deep Learning.

Stochastic quasi-gradient methods: variance reduction via Jacobian sketching

Deep learning-based roadway crack classification using laser-scanned range images: A comparative study on hyperparameter selection

Crowding prediction on mass rapid transit systems using a weighted bidirectional recurrent neural network

FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters

A 2.9–33.0 TOPS/W Reconfigurable 1-D/2-D Compute-Near-Memory Inference Accelerator in 10-nm FinFET CMOS

An Approach to Hyperparameter Optimization for the Objective Function in Machine Learning

A Neuronal Morphology Classification Approach Based on Locally Cumulative Connected Deep Neural Networks

Random Minibatch Subgradient Algorithms for Convex Problems with Functional Constraints

Comprehensive techniques of multi-GPU memory optimization for deep learning acceleration

Learning with Type-2 Fuzzy activation functions to improve the performance of Deep Neural Networks

Projection-domain scatter correction for cone beam computed tomography using a residual convolutional neural network.

BOA: batch orchestration algorithm for straggler mitigation of distributed DL training in heterogeneous GPU cluster