Output Layer Of Network Research Articles

The success of deep learning in many real-world tasks has triggered an intense effort to understand the power and limitations of deep learning in the training and generalization of complex tasks, so far with limited progress. In this work, we study the statistical mechanics of learning in Deep Linear Neural Networks (DLNNs) in which the input-output function of an individual unit is linear. Despite the linearity of the units, learning in DLNNs is nonlinear, hence studying its properties reveals some of the features of nonlinear Deep Neural Networks (DNNs). Importantly, we solve exactly the network properties following supervised learning using an equilibrium Gibbs distribution in the weight space. To do this, we introduce the Back-Propagating Kernel Renormalization (BPKR), which allows for the incremental integration of the network weights starting from the network output layer and progressing backward until the first layer's weights are integrated out. This procedure allows us to evaluate important network properties, such as its generalization error, the role of network width and depth, the impact of the size of the training set, and the effects of weight regularization and learning stochasticity. BPKR does not assume specific statistics of the input or the task's output. Furthermore, by performing partial integration of the layers, the BPKR allows us to compute the properties of the neural representations across the different hidden layers. We have proposed an extension of the BPKR to nonlinear DNNs with ReLU. Surprisingly, our numerical simulations reveal that despite the nonlinearity, the predictions of our theory are largely shared by ReLU networks in a wide regime of parameters. Our work is the first exact statistical mechanical study of learning in a family of DNNs, and the first successful theory of learning through successive integration of DoFs in the learned weight space.

The conventional phase retrieval wavefront sensing approaches mainly refer to a series of iterative algorithms, such as G-S algorithms, Y-G algorithms and error reduction algorithms. These methods use intensity information to calculate the wavefront phase. However, most of the traditional phase retrieval algorithms are difficult to meet the real-time requirements and depend on the iteration initial value used in iterative transformation or iterative optimization to some extent, so their practicalities are limited. To solve these problems, in this paper, a phase-diversity phase retrieval wavefront sensing method based on wavelet transform image fusion and convolutional neural network is proposed. Specifically, the image fusion method based on wavelet transform is used to fuse the point spread functions at the in-focus and defocus image planes, thereby simplifying the network inputs without losing the image information. The convolutional neural network (CNN) can directly extract image features and fit the required nonlinear mapping. In this paper, the CNN is utilized to establish the nonlinear mapping between the fusion images and wavefront distortions (represented by Zernike polynomials), that is, the fusion images are taken as the input data, and the corresponding Zernike coefficients as the output data. The network structure of the training in this paper has 22 layers, they are 1 input layer, 13 convolution layers, 6 pooling layers, 1 flatten layer and 1 full connection layer, that is, the output layer. The size of the convolution kernel is 3 × 3 and the step size is 1. The pooling method selects the maximum pooling and the size of the pooling kernel is 2 × 2. The activation function is ReLU, the optimization function is Adam, the loss function is the MSE, and the learning rate is 0.0001. The number of training data is 10000, which is divided into three parts: training set, validation set, and test set, accounting for 80%, 15% and 5% respectively. Trained CNN can directly output the Zernike coefficients of order 4–9 to a high precision, with these fusion images serving as the input, which is more in line with the real-time requirements. Abundant simulation experiments prove that the wavefront sensing precision is root-mean-square(RMS) 0.015<i>λ</i>, when the dynamic range of the wavefront is the aberration of low spatial frequency within 1.1<i>λ</i> of RMS value (i.e. the dynamic range of Zernike coefficients of order 4–9 is <inline-formula><tex-math id="M600">\begin{document}$[- 0.5\lambda \,, \, 0.5\lambda]$\end{document}</tex-math><alternatives><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="5-20201362_M600.jpg"/><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="5-20201362_M600.png"/></alternatives></inline-formula>). In practical application, according to the system aberration characteristics, the number of network output layer units can be changed and the network structure can be adjusted based on the method presented in this paper, thereby training the new network suitable for higher order aberration to realize high-precision wavefront sensing. It is also proved that the proposed method has certain robustness against noise, and when the relative defocus error is within 7.5%, the wavefront sensor accuracy is acceptable. With the improvement of image resolution, the wavefront sensing accuracy is improved, but the number of input data of the network also increases with the sampling rate increasing, and the time cost of network training increases accordingly.

Output Layer Of Network Research Articles

Related Topics

Articles published on Output Layer Of Network

Activation Functions for Neural Networks: Application and Performance-based Comparison

FSKT‐GE: Feature maps similarity knowledge transfer for low‐resolution gaze estimation

A real-time method for detecting bottom defects of lithium batteries based on an improved YOLOv5 model

DPA‐UNet rectal cancer image segmentation based on visual attention

Improved YOLOv3 detection method for PCB plug-in solder joint defects based on ordered probability density weighting and attention mechanism

A physics‐guided neural network‐based approach to velocity model calibration for microseismic data

Mining of Weak Fault Information Adaptively Based on DNN Inversion Estimation for Fault Diagnosis of Rotating Machinery

Statistical Mechanics of Deep Linear Neural Networks: The Backpropagating Kernel Renormalization

Airborne infrared aircraft target detection algorithm based on YOLOv4-tiny

Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning

Prediction of the Slope Solute Loss Based on BP Neural Network

Diagnosis Support Model of Cardiomegaly Based on CNN Using ResNet and Explainable Feature Map

Phase retrieval wavefront sensing based on image fusion and convolutional neural network

Naturalistic Driver Intention and Path Prediction Using Recurrent Neural Networks

Infrared Target Extraction Based on Immune Extension Neural Network

Neural network–based speed control method and experimental verification for electromagnetic direct drive vehicle robot driver

Prediction on Steel Corrosion Amount After the Concrete Cracking Due to Corrosion Expansion Based on Generalized Neural Network

Learning Fuzzy Network Using Sequence Bound Global Particle Swarm Optimizer

Evolving neural network using a genetic algorithm for predicting the deformation modulus of rock masses

Determining the structure of a radial basis function network for prediction of nonlinear hydrological time series

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Output Layer Of Network Research Articles

Related Topics

Articles published on Output Layer Of Network

Activation Functions for Neural Networks: Application and Performance-based Comparison

FSKT‐GE: Feature maps similarity knowledge transfer for low‐resolution gaze estimation

A real-time method for detecting bottom defects of lithium batteries based on an improved YOLOv5 model

DPA‐UNet rectal cancer image segmentation based on visual attention

Improved YOLOv3 detection method for PCB plug-in solder joint defects based on ordered probability density weighting and attention mechanism

A physics‐guided neural network‐based approach to velocity model calibration for microseismic data

Mining of Weak Fault Information Adaptively Based on DNN Inversion Estimation for Fault Diagnosis of Rotating Machinery

Statistical Mechanics of Deep Linear Neural Networks: The Backpropagating Kernel Renormalization

Airborne infrared aircraft target detection algorithm based on YOLOv4-tiny

Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning

Prediction of the Slope Solute Loss Based on BP Neural Network

Diagnosis Support Model of Cardiomegaly Based on CNN Using ResNet and Explainable Feature Map

Phase retrieval wavefront sensing based on image fusion and convolutional neural network

Naturalistic Driver Intention and Path Prediction Using Recurrent Neural Networks

Infrared Target Extraction Based on Immune Extension Neural Network

Neural network–based speed control method and experimental verification for electromagnetic direct drive vehicle robot driver

Prediction on Steel Corrosion Amount After the Concrete Cracking Due to Corrosion Expansion Based on Generalized Neural Network

Learning Fuzzy Network Using Sequence Bound Global Particle Swarm Optimizer

Evolving neural network using a genetic algorithm for predicting the deformation modulus of rock masses

Determining the structure of a radial basis function network for prediction of nonlinear hydrological time series