Low Learning Rate Research Articles

Introduction. Currently, artificial neural networks (ANN) are successfully used for technical diagnostics of steel ropes. Expensive software products with an adapted neural network implementation environment, such as STATISTICA, Amygdala, MatLab Simulink, are often used for this purpose. The most affordable way to build and train an ANN, from a financial point of view, is to write your own program code using interactive libraries such as TensorFlow, PyTorch, Scikit-learn. However, such libraries are not fully adapted for building an ANN, and to use them you need to have basic programming skills. As a result, the quality of an ANN depends not only on its architecture, training data, and composition, but also on the environment in which it is built. The aim of the work was to compare the quality of the ANN, built and trained by various methods according to the criterion of test network performance, confidence levels for assessing the technical condition of the rope, as well as the complexity and speed of training. For this purpose, new software has been developed to solve the problem of assessing the technical condition of a steel rope using a combination of various rejection indicators. Materials and Methods. The basis for an ANN training was a statistical database of typical damages of steel ropes and, an expert assessment of the technical condition of steel ropes. The software was written in the Python programming language. Various methods of programming a neural network were presented: an ANN built on the basis of the STATISTICA software package and an ANN built using the interactive Scikit-learn library. Ten test samples were prepared to verify the operation of the ANN. The ANN quality was assessed based on the test network performance and confidence probabilities (activation levels of the “winning” neuron) of determining the technical condition of the rope. Results. The construction of the ANN using the interactive library Scikit-learn showed a relatively high complexity of construction and a relatively low learning rate of the ANN. Test performance of the network, with a test sample size of ten, turned out to be the same for both built ANNs. At the same time, there was a difference in the indicator of the average confidence level for determining the technical condition of a steel rope between the results of the ANN built on the basis of the STATISTICA software package and the ANN built using the Scikit-learn interactive library. Discussion and Conclusion. The results showed that the ANN built using the STATISTICA software package with the same architecture and network learning parameters had more optimal software algorithms according to the criteria of confidence probability and network learning speed in comparison with the ANN built using the free Skicit-learn library. However, the indicator of the ANN test performance turned out to be the same for both ANNs. This result justified the use of TensorFlow, PyTorch, and Skicit-learn libraries by the world's leading research and commercial centers in the field of artificial intelligence. The obtained scientific result allows us to numerically evaluate and compare the quality of an ANN having the same architecture and learning parameters, but built using different methods. This will be useful for future scientific research in the field and for selecting the optimal environment for constructing ANNs in industrial applications.

Read full abstract

Objective The present study proposes a Deep Learning model based on the binary system for grading oral epithelial dysplasia (OED) at the whole slide imaging level to eliminate inter-pathologist variability. Study Design A dataset of 99 whole slide images from three institutions were manually annotated, segmented, and fragmented into smaller patches of 299 × 299 pixels. A total of 40,893 images were sampled into 80%:10%:10% for training, validation, and independent test sets. An adaptation of ResNet50 with 2 hidden layers and 512 neurons in the fully connected layer (FC) was implemented with a low learning rate of 0.00001 for 200 epochs. Results The proposed ResNet50 reached 85.30% accuracy during training and 85.11% for validation, showing the potential of learning. The independent test showed an overall accuracy of 60%, with 61% sensitivity, 59% specificity, and 0.64AUROC, showing a lack of generalization ability for the present classification problem. Conclusion The proposed DL-based model presented a capacity for learning with the potential of achieving high accuracy but with relatively low generalization. Further work will encompass a 2-class problem (premalignant and malignant) to reinforce class separation and investigate the stability of accuracy and generalization of alternative DL models. The present study proposes a Deep Learning model based on the binary system for grading oral epithelial dysplasia (OED) at the whole slide imaging level to eliminate inter-pathologist variability. A dataset of 99 whole slide images from three institutions were manually annotated, segmented, and fragmented into smaller patches of 299 × 299 pixels. A total of 40,893 images were sampled into 80%:10%:10% for training, validation, and independent test sets. An adaptation of ResNet50 with 2 hidden layers and 512 neurons in the fully connected layer (FC) was implemented with a low learning rate of 0.00001 for 200 epochs. The proposed ResNet50 reached 85.30% accuracy during training and 85.11% for validation, showing the potential of learning. The independent test showed an overall accuracy of 60%, with 61% sensitivity, 59% specificity, and 0.64AUROC, showing a lack of generalization ability for the present classification problem. The proposed DL-based model presented a capacity for learning with the potential of achieving high accuracy but with relatively low generalization. Further work will encompass a 2-class problem (premalignant and malignant) to reinforce class separation and investigate the stability of accuracy and generalization of alternative DL models.

Read full abstract

Low Learning Rate Research Articles

Articles published on Low Learning Rate

Sigmoidal learning rate optimizer for deep neural network training using a two-phase adaptation approach

Progressing from first-of-a-kind to Nth-of-a-kind: Applying learning rates to carbon capture deployment in Sweden

Signatures of Bayesian inference emerge from energy-efficient synapses.

Signatures of Bayesian inference emerge from energy-efficient synapses

EFL Learners' Belief in English Only Instruction: A promising or ineffective approach at the tertiary level in Iraq

Comparative Analysis of the Performance of Artificial Neural Networks in Assessing the Technical Condition of Steel Ropes

Effects of transcranial alternating current stimulation to the supplementary motor area on motor learning.

Different learning aberrations relate to delusion-like beliefs with different contents.

A Comparative Study of Transformer-based Models for Text Summarization of News Articles

Learning and metacognition under volatility in GD: Lower learning rates and distorted coupling between action and confidence.

Faster Stochastic Variance Reduction Methods for Compositional MiniMax Optimization

Identifying slow learners in an e-learning environment using k-means clustering approach

A Modified Gated Recurrent Unit Approach for Epileptic Electroencephalography Classification

A Lightweight Diabetic Retinopathy Detection Model Using a Deep-Learning Technique.

How averaging individual curves transforms their shape: Mathematical analyses with application to learning and forgetting curves

Classification of Mild Cognitive Impairment with Deep Transfer Learning Approach using CWT based Scalogram Images

Postnatal Phencyclidine-Induced Deficits in Decision Making Are Ameliorated by Optogenetic Inhibition of Ventromedial Orbitofrontal Cortical Glutamate Neurons

Prolonged development of forced-choice recognition when targets are paired with non-corresponding lures

Research on the Deep Deterministic Policy Algorithm Based on the First-Order Inverted Pendulum

DEEP LEARNING FOR ORAL EPITHELIAL DYSPLASIA GRADING

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low Learning Rate Research Articles

Articles published on Low Learning Rate

Sigmoidal learning rate optimizer for deep neural network training using a two-phase adaptation approach

Progressing from first-of-a-kind to Nth-of-a-kind: Applying learning rates to carbon capture deployment in Sweden

Signatures of Bayesian inference emerge from energy-efficient synapses.

Signatures of Bayesian inference emerge from energy-efficient synapses

EFL Learners' Belief in English Only Instruction: A promising or ineffective approach at the tertiary level in Iraq

Comparative Analysis of the Performance of Artificial Neural Networks in Assessing the Technical Condition of Steel Ropes

Effects of transcranial alternating current stimulation to the supplementary motor area on motor learning.

Different learning aberrations relate to delusion-like beliefs with different contents.

A Comparative Study of Transformer-based Models for Text Summarization of News Articles

Learning and metacognition under volatility in GD: Lower learning rates and distorted coupling between action and confidence.

Faster Stochastic Variance Reduction Methods for Compositional MiniMax Optimization

Identifying slow learners in an e-learning environment using k-means clustering approach

A Modified Gated Recurrent Unit Approach for Epileptic Electroencephalography Classification

A Lightweight Diabetic Retinopathy Detection Model Using a Deep-Learning Technique.

How averaging individual curves transforms their shape: Mathematical analyses with application to learning and forgetting curves

Classification of Mild Cognitive Impairment with Deep Transfer Learning Approach using CWT based Scalogram Images

Postnatal Phencyclidine-Induced Deficits in Decision Making Are Ameliorated by Optogenetic Inhibition of Ventromedial Orbitofrontal Cortical Glutamate Neurons

Prolonged development of forced-choice recognition when targets are paired with non-corresponding lures

Research on the Deep Deterministic Policy Algorithm Based on the First-Order Inverted Pendulum

DEEP LEARNING FOR ORAL EPITHELIAL DYSPLASIA GRADING