Deep Residual Network for Image Recognition

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Training of a neural network is easier than it goes deeper. Deeper architecture makes neural networks more difficult to train because of vanishing gradient and complexity problems, and via this training, deeper neural networks become much time taking and high utilization of computer resources. Introducing residual blocks in neural networks train specifically deeper architecture networks than those used previously. Residual networks gain this achievement by attaching a trip connection to the layers of artificial neural networks. This paper is about showing residual networks and how they work like formulas, we will see residual networks obtain good accuracy, and as well as the model is easier to optimize because Res Net makes training of large structured neural networks more efficient. We will check residual nets on the Image Net dataset with a depth of 152 layers which is 8x more intense than VGG nets yet very less complex. After building this architecture of residual nets gets error up to 3.57% on the Image Net test dataset. We also compare the Res Net result to its equivalent Convolutional Network without residual connection. Our results show that ResNet provides higher accuracy but apart from that, it is more prone to over fitting. Stochastic augmentation of training datasets and adding dropout layers in networks are some of the over fitting prevention methods.

Similar Papers
  • Book Chapter
  • Cite Count Icon 39
  • 10.1007/978-3-030-80126-7_53
Study of Residual Networks for Image Recognition
  • Jan 1, 2021
  • Mohammad Sadegh Ebrahimi + 1 more

Deep neural networks have demonstrated a high potential on image classification tasks while presenting new computational challenges to the machine learning community. Due to the complexity and vanishing gradient problem, it normally takes longer time and more computational power to train deeper neural networks. To address some of these issues, deep Residual Networks (ResNets) can expedite the training process and attain more accuracy compared to their equivalent neural networks without the residual connections. ResNets often achieve this improvement by adding a simple skip connection parallel to convolutional layers in neural networks. Although over the past few years, ResNets have proven to be effective in advancing the performance of deep learning models, the best practices and trade-offs regarding adding residual connections to deep networks, and the exact improvement and disadvantages of these connections during the learning process are not well understood. In this project, we designed ResNet models that can perform a simple image classification task on the Tiny ImageNet datasets. For control, we then compare the performance of these ResNet models with their equivalent Convolutional Network (ConvNet) by removing the residual connections. Our findings illustrate that despite their higher accuracy, ResNets are more prone to overfitting, and that may depend on the depth of the network. We show that several methods to prevent overfittings, such as adding dropout layers and stochastic augmentation of the training dataset can be effective in attenuating this problem in ResNets.KeywordsDeep learningResidual networksComputer vision

  • Research Article
  • Cite Count Icon 21
  • 10.1016/j.vlsi.2020.05.002
Logarithm-approximate floating-point multiplier is applicable to power-efficient neural network training
  • May 14, 2020
  • Integration
  • Taiyu Cheng + 4 more

Logarithm-approximate floating-point multiplier is applicable to power-efficient neural network training

  • Book Chapter
  • Cite Count Icon 79
  • 10.1007/978-981-19-8825-7_21
ResNet: Solving Vanishing Gradient in Deep Networks
  • Jan 1, 2023
  • Lokesh Borawar + 1 more

Training of a neural network is easier when layers are limited but situation changes rapidly when more layers are added and a deeper architecture network is built. Due to the vanishing gradient and complexity issues, it makes it more challenging to train neural networks, which makes training deeper neural networks more time consuming and resource intensive. When residual blocks are added to neural networks, training becomes more effective even with more complex architecture. Due to skip connections linked to the layers of artificial neural networks, which improves residual network (ResNet) efficiency, otherwise it was a time consuming procedure. The implantation of residual networks, their operation, formulae, and the solution to the vanishing gradient problem are the topics of this study. It is observed that because of ResNet, the model obtains good accuracy on image recognition task, and it is easier to optimize. In this study, ResNet is tested on the CIFAR-10 dataset, which has a depth of 34 layers and is both, more dense than VGG nets and less complicated. ResNet achieves error rates of up to 20% on the CIFAR-10 test dataset after constructing this architecture, which takes 80 epochs. More epochs can decrease the error further. The outcomes of ResNet and its corresponding convolutional network (ConvNet) without skip connection are compared. The findings indicate that ResNet offers more accuracy but is more prone to overfitting. To improve accuracy, overfitting prevention techniques including stochastic augmentation on training datasets and the addition of dropout layers in networks have been used.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 105
  • 10.3390/rs13020294
DR-Net: An Improved Network for Building Extraction from High Resolution Remote Sensing Image
  • Jan 15, 2021
  • Remote Sensing
  • Meng Chen + 7 more

At present, convolutional neural networks (CNN) have been widely used in building extraction from remote sensing imagery (RSI), but there are still some bottlenecks. On the one hand, there are so many parameters in the previous network with complex structure, which will occupy lots of memories and consume much time during training process. On the other hand, low-level features extracted by shallow layers and abstract features extracted by deep layers of artificial neural network cannot be fully fused, which leads to an inaccurate building extraction from RSI. To alleviate these disadvantages, a dense residual neural network (DR-Net) was proposed in this paper. DR-Net uses a deeplabv3+Net encoder/decoder backbone, in combination with densely connected convolution neural network (DCNN) and residual network (ResNet) structure. Compared with deeplabv3+net (containing about 41 million parameters) and BRRNet (containing about 17 million parameters), DR-Net contains about 9 million parameters; So, the number of parameters reduced a lot. The experimental results for both the WHU Building Dataset and Massachusetts Building Dataset, DR-Net show better performance in building extraction than other two state-of-the-art methods. Experiments on WHU building data set showed that Intersection over Union (IoU) increased by 2.4% and F1 score increased by 1.4%; in terms of Massachusetts Building Dataset, IoU increased by 3.8% and F1 score increased by 2.9%.

  • Book Chapter
  • Cite Count Icon 2
  • 10.1007/978-3-030-93247-3_18
Tree-Like Branching Network for Multi-class Classification
  • Jan 1, 2022
  • Mengqi Xue + 3 more

In multi-task learning, network branching, i.e. specializing branches for different tasks on top of a shared truck, has been a golden rule. In multi-class classification task, however, previous work usually arranges all categories at the last layer in deep neural networks, which implies that all the layers are shared by these categories regardless of their varying relationships. In this paper, we study how to convert a trained typical neural network into a branching network where layers are properly shared or specialized for the involved categories. We propose a three-step branching strategy, dubbed as Tree-Like Branching (TLB), to exploit network sharing and branching for multi-class classification. TLB first mines inherent category relationships from a trained neural network in a layer-wise manner. Then it determines the appropriate layer in the network on which specialized branches grow to reconcile the conflicting decision patterns of different categories. Finally TLB adopts knowledge distillation to train the derived branching network. Experiments on widely used benchmarks show that the derived tree-like network from TLB achieves higher accuracy and lower cost compared to prior models, meanwhile exhibiting better interpretability.KeywordsMulti-task learningMulti-class classificationDeep neural networkKnowledge distillation

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 13
  • 10.3390/a14040107
Identification of Intrinsically Disordered Protein Regions Based on Deep Neural Network-VGG16
  • Mar 28, 2021
  • Algorithms
  • Pengchang Xu + 2 more

The accurate of i identificationntrinsically disordered proteins or protein regions is of great importance, as they are involved in critical biological process and related to various human diseases. In this paper, we develop a deep neural network that is based on the well-known VGG16. Our deep neural network is then trained through using 1450 proteins from the dataset DIS1616 and the trained neural network is tested on the remaining 166 proteins. Our trained neural network is also tested on the blind test set R80 and MXD494 to further demonstrate the performance of our model. The MCC value of our trained deep neural network is 0.5132 on the test set DIS166, 0.5270 on the blind test set R80 and 0.4577 on the blind test set MXD494. All of these MCC values of our trained deep neural network exceed the corresponding values of existing prediction methods.

  • Conference Article
  • Cite Count Icon 21
  • 10.23919/eusipco.2017.8081266
Residual neural networks for speech recognition
  • Aug 1, 2017
  • Hari Krishna Vydana + 1 more

Recent developments in deep learning methods have greatly influenced the performances of speech recognition systems. In a Hidden Markov model-Deep neural network (HMM-DNN) based speech recognition system, DNNs have been employed to model senones (context dependent states of HMM), where HMMs capture the temporal relations among senones. Due to the use of more deeper networks significant improvement in the performances has been observed and developing deep learning methods to train more deeper architectures has gained a lot of scientific interest. Optimizing a deeper network is more complex task than to optimize a less deeper network, but recently residual network have exhibited a capability to train a very deep neural network architectures and are not prone to vanishing/exploding gradient problems. In this work, the effectiveness of residual networks have been explored for of speech recognition. Along with the depth of the residual network, the criticality of width of the residual network has also been studied. It has been observed that at higher depth, width of the networks is also a crucial parameter for attaining significant improvements. A 14-hour subset of WSJ corpus is used for training the speech recognition systems, it has been observed that the residual networks have shown much ease in convergence even with a depth much higher than the deep neural network. In this work, using residual networks an absolute reduction of 0.4 in WER error rates (8% reduction in the relative error) is attained compared to the best performing deep neural network.

  • Research Article
  • Cite Count Icon 29
  • 10.1093/gigascience/giy130
Transfer learning improves resting-state functional connectivity pattern analysis using convolutional neural networks
  • Nov 5, 2018
  • GigaScience
  • Pál Vakli + 3 more

BackgroundDeep learning is gaining importance in the prediction of cognitive states and brain pathology based on neuroimaging data. Including multiple hidden layers in artificial neural networks enables unprecedented predictive power; however, the proper training of deep neural networks requires thousands of exemplars. Collecting this amount of data is not feasible in typical neuroimaging experiments. A handy solution to this problem, which has largely fallen outside the scope of deep learning applications in neuroimaging, is to repurpose deep networks that have already been trained on large datasets by fine-tuning them to target datasets/tasks with fewer exemplars. Here, we investigated how this method, called transfer learning, can aid age category classification and regression based on brain functional connectivity patterns derived from resting-state functional magnetic resonance imaging. We trained a connectome-convolutional neural network on a larger public dataset and then examined how the knowledge learned can be used effectively to perform these tasks on smaller target datasets collected with a different type of scanner and/or imaging protocol and pre-processing pipeline.ResultsAge classification on the target datasets benefitted from transfer learning. Significant improvement (∼9%–13% increase in accuracy) was observed when the convolutional layers’ weights were initialized based on the values learned on the public dataset and then fine-tuned to the target datasets. Transfer learning also appeared promising in improving the otherwise poor prediction of chronological age.ConclusionsTransfer learning is a plausible solution to adapt convolutional neural networks to neuroimaging data with few exemplars and different data acquisition and pre-processing protocols.

  • Research Article
  • Cite Count Icon 17
  • 10.1016/j.csite.2024.104208
The significance of radiative heat and mass transfer through a vertical sheet with chemical reaction: Designing by artificial approach Levenberg-Marquardt
  • Mar 7, 2024
  • Case Studies in Thermal Engineering
  • J.G Al-Juaid + 6 more

The significance of radiative heat and mass transfer through a vertical sheet with chemical reaction: Designing by artificial approach Levenberg-Marquardt

  • Conference Article
  • Cite Count Icon 2
  • 10.1145/3613424.3623779
ADA-GP: Accelerating DNN Training By Adaptive Gradient Prediction
  • Oct 28, 2023
  • Vahid Janfaza + 3 more

Neural network training is inherently sequential where the layers finish the forward propagation in succession, followed by the calculation and back-propagation of gradients (based on a loss function) starting from the last layer. The sequential computations significantly slow down neural network training, especially the deeper ones. Prediction has been successfully used in many areas of computer architecture to speed up sequential processing. Therefore, we propose ADA-GP, which uses gradient prediction adaptively to speed up deep neural network (DNN) training while maintaining accuracy. ADA-GP works by incorporating a small neural network to predict gradients for different layers of a DNN model. ADA-GP uses a novel tensor reorganization method to make it feasible to predict a large number of gradients. ADA-GP alternates between DNN training using backpropagated gradients and DNN training using predicted gradients. ADA-GP adaptively adjusts when and for how long gradient prediction is used to strike a balance between accuracy and performance. Last but not least, we provide a detailed hardware extension in a typical DNN accelerator to realize the speed up potential from gradient prediction. Our extensive experiments with fifteen DNN models show that ADA-GP can achieve an average speed up of 1.47 × with similar or even higher accuracy than the baseline models. Moreover, it consumes, on average, 34% less energy due to reduced off-chip memory accesses compared to the baseline accelerator.

  • Research Article
  • 10.30917/att-vk-1814-9588-2023-1-4
Diagnosis of dermatophytosis in cats using artificial neural networks
  • Feb 1, 2023
  • Veterinaria i kormlenie
  • А.А Bushmina + 2 more

The purpose of the research, the results of which are presented in this article, is to determine the possibility and evaluate the effectiveness of using a trained neural network in the diagnosis of ringworm. The article provides an analysis of the methods used for diagnosing dermatomycosis in veterinary practice. One of the actively developing areas at present is the use of artificial neural networks in the diagnosis of animal diseases. The authors have developed a method for diagnosing dermatophytosis using a trained neural network. To identify hair damaged by dermatophyte spores in cats, a trained artificial neural network YOLO v5 was used, based on the YOLO architecture (high-precision artificial neural network), which provides high accuracy and speed of object detection in images. Diagnostics was carried out in three stages. The first stage: the diagnosis of hair in cats damaged by dermatophyte spores was carried out using a trained artificial neural network. The second stage: microscopy by a veterinary specialist of the veterinary center. The third stage: comparison of the received data from the trained artificial neural network and veterinary specialists. Three comparative experiments were carried out on 20 depersonalized samples with different ratios from healthy and sick animals. As a result of testing the trichoscopy method using artificial neural networks for diagnosing spore-damaged hair dermatitis in cats, it was found that a trained artificial neural network of 60 studied samples diagnosed dermatophyte spore damage in 20 samples, a veterinarian - in 17. All positive results were confirmed by a mycological laboratory study. and identification of the pathogen. It has been established that the use of a trained artificial neural network increases the diagnostic efficiency by 15% and reduces the time to perform diagnostic microscopy by 60.3%. The application of the proposed method allows to reduce the time of microscopic examination, improve the accuracy of interpretation of the results, automate methods for identifying causative agents of ringworm in small animals and take timely measures to treat the animal.

  • Research Article
  • Cite Count Icon 1
  • 10.3934/mfc.2020024
Fixed-point algorithms for inverse of residual rectifier neural networks
  • Oct 26, 2020
  • Mathematical Foundations of Computing
  • Ruhua Wang + 3 more

<p style='text-indent:20px;'>A deep neural network with invertible hidden layers has a nice property of preserving all the information in the feature learning stage. In this paper, we analyse the hidden layers of residual rectifier neural networks, and investigate conditions for invertibility under which the hidden layers are invertible. A new fixed-point algorithm is developed to invert the hidden layers of residual networks. The proposed inverse algorithms are capable of inverting some residual networks which cannot be inverted by existing inverting algorithms. Furthermore, a special residual rectifier network is designed and trained on MNIST so that it can achieve comparable performance with the state-of-art performance while its hidden layers are invertible.

  • Research Article
  • Cite Count Icon 180
  • 10.1093/bioinformatics/btz291
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.
  • May 9, 2019
  • Bioinformatics
  • Yang Li + 4 more

Contact-map of a protein sequence dictates the global topology of structural fold. Accurate prediction of the contact-map is thus essential to protein 3D structure prediction, which is particularly useful for the protein sequences that do not have close homology templates in the Protein Data Bank. We developed a new method, ResPRE, to predict residue-level protein contacts using inverse covariance matrix (or precision matrix) of multiple sequence alignments (MSAs) through deep residual convolutional neural network training. The approach was tested on a set of 158 non-homologous proteins collected from the CASP experiments and achieved an average accuracy of 50.6% in the top-L long-range contact prediction with L being the sequence length, which is 11.7% higher than the best of other state-of-the-art approaches ranging from coevolution coupling analysis to deep neural network training. Detailed data analyses show that the major advantage of ResPRE lies at the utilization of precision matrix that helps rule out transitional noises of contact-maps compared with the previously used covariance matrix. Meanwhile, the residual network with parallel shortcut layer connections increases the learning ability of deep neural network training. It was also found that appropriate collection of MSAs can further improve the accuracy of final contact-map predictions. The standalone package and online server of ResPRE are made freely available, which should bring important impact on protein structure and function modeling studies in particular for the distant- and non-homology protein targets. https://zhanglab.ccmb.med.umich.edu/ResPRE and https://github.com/leeyang/ResPRE. Supplementary data are available at Bioinformatics online.

  • Conference Article
  • Cite Count Icon 34
  • 10.1117/12.2216555
Deep learning in the small sample size setting: cascaded feed forward neural networks for medical image segmentation
  • Mar 24, 2016
  • Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE
  • Bilwaj Gaonkar + 3 more

Deep Learning, refers to large set of neural network based algorithms, have emerged as promising machine- learning tools in the general imaging and computer vision domains. Convolutional neural networks (CNNs), a specific class of deep learning algorithms, have been extremely effective in object recognition and localization in natural images. A characteristic feature of CNNs, is the use of a locally connected multi layer topology that is inspired by the animal visual cortex (the most powerful vision system in existence). While CNNs, perform admirably in object identification and localization tasks, typically require training on extremely large datasets. Unfortunately, in medical image analysis, large datasets are either unavailable or are extremely expensive to obtain. Further, the primary tasks in medical imaging are organ identification and segmentation from 3D scans, which are different from the standard computer vision tasks of object recognition. Thus, in order to translate the advantages of deep learning to medical image analysis, there is a need to develop deep network topologies and training methodologies, that are geared towards medical imaging related tasks and can work in a setting where dataset sizes are relatively small. In this paper, we present a technique for stacked supervised training of deep feed forward neural networks for segmenting organs from medical scans. Each `neural network layer' in the stack is trained to identify a sub region of the original image, that contains the organ of interest. By layering several such stacks together a very deep neural network is constructed. Such a network can be used to identify extremely small regions of interest in extremely large images, inspite of a lack of clear contrast in the signal or easily identifiable shape characteristics. What is even more intriguing is that the network stack achieves accurate segmentation even when it is trained on a single image with manually labelled ground truth. We validate this approach,using a publicly available head and neck CT dataset. We also show that a deep neural network of similar depth, if trained directly using backpropagation, cannot acheive the tasks achieved using our layer wise training paradigm.

  • Research Article
  • Cite Count Icon 24
  • 10.1109/tsmc.2022.3221843
Congestive Heart Failure Detection From ECG Signals Using Deep Residual Neural Network
  • May 1, 2023
  • IEEE Transactions on Systems, Man, and Cybernetics: Systems
  • Eedara Prabhakararao + 1 more

The early and accurate detection of congestive heart failure (CHF) using an electrocardiogram (ECG) is of great significance for improving the survival rate of patients. Existing approaches show limited detection accuracy as they fail to capture the temporal ECG dynamics. Also, these methods lack model transparency and are often difficult to interpret. This article proposes a novel end-to-end diagnostic attention-based deep residual recurrent neural network (DA-DRRNet) that effectively captures the temporal dynamics and extracts high-level attentive representations for accurate CHF detection. Specifically, we first employ a recurrent neural network (RNN) layer to encode the temporal dynamics from the raw ECG beats. Then, multilayered RNNs with residual connections are incorporated to extract high-level feature representations hierarchically. The residual connections allow gradients in deep RNN to propagate effectively, thereby improving the network’s representation ability. Finally, an attention module identifies the hidden vectors corresponding to the diagnostically prominent ECG characteristics to form an attentive representation for improved CHF detection. Using ECG signals from the three publicly available datasets (BIDMC-CHF, PTBDB, and MIT-BIH NSRDB), the proposed method achieves an impressive accuracy of 98.57 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\%$</tex-math> </inline-formula> and nearly 100 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\%$</tex-math> </inline-formula> for beat-level and 24-h record-level diagnosis, respectively. Notably, the analysis of learned attention weights demonstrates that the proposed model focuses on the clinically relevant ECG features that characterize CHF. This model transparency and improved detection results advance research in this field and provide a reliable and transparent diagnostic system for CHF analysis.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant