Identification of Intrinsically Disordered Protein Regions Based on Deep Neural Network-VGG16

  • Abstract
  • Highlights & Summary
  • PDF
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

The accurate of i identificationntrinsically disordered proteins or protein regions is of great importance, as they are involved in critical biological process and related to various human diseases. In this paper, we develop a deep neural network that is based on the well-known VGG16. Our deep neural network is then trained through using 1450 proteins from the dataset DIS1616 and the trained neural network is tested on the remaining 166 proteins. Our trained neural network is also tested on the blind test set R80 and MXD494 to further demonstrate the performance of our model. The MCC value of our trained deep neural network is 0.5132 on the test set DIS166, 0.5270 on the blind test set R80 and 0.4577 on the blind test set MXD494. All of these MCC values of our trained deep neural network exceed the corresponding values of existing prediction methods.

Similar Papers
  • Conference Article
  • Cite Count Icon 2
  • 10.1145/3613424.3623779
ADA-GP: Accelerating DNN Training By Adaptive Gradient Prediction
  • Oct 28, 2023
  • Vahid Janfaza + 3 more

Neural network training is inherently sequential where the layers finish the forward propagation in succession, followed by the calculation and back-propagation of gradients (based on a loss function) starting from the last layer. The sequential computations significantly slow down neural network training, especially the deeper ones. Prediction has been successfully used in many areas of computer architecture to speed up sequential processing. Therefore, we propose ADA-GP, which uses gradient prediction adaptively to speed up deep neural network (DNN) training while maintaining accuracy. ADA-GP works by incorporating a small neural network to predict gradients for different layers of a DNN model. ADA-GP uses a novel tensor reorganization method to make it feasible to predict a large number of gradients. ADA-GP alternates between DNN training using backpropagated gradients and DNN training using predicted gradients. ADA-GP adaptively adjusts when and for how long gradient prediction is used to strike a balance between accuracy and performance. Last but not least, we provide a detailed hardware extension in a typical DNN accelerator to realize the speed up potential from gradient prediction. Our extensive experiments with fifteen DNN models show that ADA-GP can achieve an average speed up of 1.47 × with similar or even higher accuracy than the baseline models. Moreover, it consumes, on average, 34% less energy due to reduced off-chip memory accesses compared to the baseline accelerator.

  • Research Article
  • Cite Count Icon 21
  • 10.1016/j.vlsi.2020.05.002
Logarithm-approximate floating-point multiplier is applicable to power-efficient neural network training
  • May 14, 2020
  • Integration
  • Taiyu Cheng + 4 more

Logarithm-approximate floating-point multiplier is applicable to power-efficient neural network training

  • Research Article
  • Cite Count Icon 172
  • 10.1109/tai.2021.3067574
Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
  • May 4, 2021
  • IEEE Transactions on Artificial Intelligence
  • Edgar Galván + 1 more

A variety of methods have been applied to the architectural configuration and learning or training of artificial deep neural networks (DNN). These methods play a crucial role in the success or failure of the DNN for most problems and applications. Evolutionary algorithms (EAs) are gaining momentum as a computationally feasible method for the automated optimization of DNNs. Neuroevolution is a term, which describes these processes of automated configuration and training of DNNs using EAs. While many works exist in the literature, no comprehensive surveys currently exist focusing exclusively on the strengths and limitations of using neuroevolution approaches in DNNs. Absence of such surveys can lead to a disjointed and fragmented field preventing DNNs researchers potentially adopting neuroevolutionary methods in their own research, resulting in lost opportunities for wider application within real-world deep learning problems. This article presents a comprehensive survey, discussion, and evaluation of the state-of-the-art in using EAs for architectural configuration and training of DNNs. This article highlights the most pertinent current issues and challenges in neuroevolution and identifies multiple promising future research directions. <p xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><i>Impact Statement—</i>The concept of deep learning originated from the study of artificial neural networks (ANNs). ANNs have achieved extraordinary results in a variety of diverse application areas. Numerous methods have been applied to the architectural configuration and learning or training of artificial DNN and these methods play a crucial role in the success or failure of the DNN for most problems and applications. Recently, EAs have been gaining momentum as a computationally feasible method (called neuroevolution) for the automated configuration and learning or training of DNNs. This article reviews over 170 recent scientific papers describing how major EAs paradigms are being applied by researchers to the configuration and optimization of multiple DNNs. By articulating a clear understanding of the context, state-of-the-art, and feasibility of Neuroevolution, researchers in AI, EAs, and DNN will benefit from this article. The impact of this article comes from contributing toward enhancing research capacity, knowledge, and skills for researchers currently working in neuroevolution and actively engaging those considering becoming involved in this area.

  • Research Article
  • Cite Count Icon 42
  • 10.1016/j.jco.2020.101540
Non-convergence of stochastic gradient descent in the training of deep neural networks
  • Nov 27, 2020
  • Journal of Complexity
  • Patrick Cheridito + 2 more

Non-convergence of stochastic gradient descent in the training of deep neural networks

  • Conference Article
  • Cite Count Icon 8
  • 10.1109/icpads47876.2019.00068
A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
  • Dec 1, 2019
  • Bontak Gu + 3 more

Deep neural network (DNN) training is generally performed by cloud computing platforms. However, cloud-based training has several problems such as network bottleneck, server management cost, and privacy. To overcome these problems, one of the most promising solutions is distributed DNN model training which trains the model with not only high-performance servers but also low-end power-efficient mobile edge or user devices. However, due to the lack of a framework which can provide an optimal cluster configuration (i.e., determining which computing devices participate in DNN training tasks), it is difficult to perform efficient DNN model training considering DNN service providers' preferences such as training time or energy efficiency. In this paper, we introduce a novel framework for distributed DNN training that determines the best training cluster configuration with available heterogeneous computing resources. Our proposed framework utilizes pre-training with a small number of training steps and estimates training time, power, energy, and energy-delay product (EDP) for each possible training cluster configuration. Based on the estimated metrics, our framework performs DNN training for the remaining steps with the chosen best cluster configurations depending on DNN service providers' preferences. Our framework is implemented in TensorFlow and evaluated with three heterogeneous computing platforms and five widely used DNN models. According to our experimental results, in 76.67% of the cases, our framework chooses the best cluster configuration depending on DNN service providers' preferences with only a small training time overhead.

  • Research Article
  • Cite Count Icon 1
  • 10.14429/dsj.74.19475
Accelerating deep network training for radar identification using batch normalization
  • Nov 25, 2024
  • Defence Science Journal
  • Preeti Gupta + 2 more

Deep learning techniques have shown remarkable success in radar identification. However, deep neural network training can be time and resource intensive. Batch normalization is a popular approach for quickening deep feed-forward neural network training. The training of deep neural networks is accelerated by minimizing the internal covariate shift and stabilizing the training process by normalizing the intermediate activations within each mini-batch. In this research, the convergence behavior of networks with and without batch normalization is compared. Batch normalization standardizes the input to a layer for each mini-batch applied to either the activations of a prior layer or inputs directly. Our experiments indicate that batch normalization is effective in improving a variety of neural network properties. The results show that batch-normalized models have higher test and validation accuracies across all datasets, which we attribute to their regularizing impact and more steady gradient propagation. This research also examines the impact of several parameters, such as batch size, momentum, and beta and gamma parameters, on the effectiveness of DNNs with batch normalization. The radar dataset used for training is the fused emitter set obtained after feature level fusion of the tracks intercepted by ESM (Electronic Support) and ELINT (Electronic Intelligence) system.

  • Research Article
  • 10.30917/att-vk-1814-9588-2023-1-4
Diagnosis of dermatophytosis in cats using artificial neural networks
  • Feb 1, 2023
  • Veterinaria i kormlenie
  • А.А Bushmina + 2 more

The purpose of the research, the results of which are presented in this article, is to determine the possibility and evaluate the effectiveness of using a trained neural network in the diagnosis of ringworm. The article provides an analysis of the methods used for diagnosing dermatomycosis in veterinary practice. One of the actively developing areas at present is the use of artificial neural networks in the diagnosis of animal diseases. The authors have developed a method for diagnosing dermatophytosis using a trained neural network. To identify hair damaged by dermatophyte spores in cats, a trained artificial neural network YOLO v5 was used, based on the YOLO architecture (high-precision artificial neural network), which provides high accuracy and speed of object detection in images. Diagnostics was carried out in three stages. The first stage: the diagnosis of hair in cats damaged by dermatophyte spores was carried out using a trained artificial neural network. The second stage: microscopy by a veterinary specialist of the veterinary center. The third stage: comparison of the received data from the trained artificial neural network and veterinary specialists. Three comparative experiments were carried out on 20 depersonalized samples with different ratios from healthy and sick animals. As a result of testing the trichoscopy method using artificial neural networks for diagnosing spore-damaged hair dermatitis in cats, it was found that a trained artificial neural network of 60 studied samples diagnosed dermatophyte spore damage in 20 samples, a veterinarian - in 17. All positive results were confirmed by a mycological laboratory study. and identification of the pathogen. It has been established that the use of a trained artificial neural network increases the diagnostic efficiency by 15% and reduces the time to perform diagnostic microscopy by 60.3%. The application of the proposed method allows to reduce the time of microscopic examination, improve the accuracy of interpretation of the results, automate methods for identifying causative agents of ringworm in small animals and take timely measures to treat the animal.

  • Conference Article
  • Cite Count Icon 30
  • 10.21437/interspeech.2013-734
Investigations on hessian-free optimization for cross-entropy training of deep neural networks
  • Aug 25, 2013
  • Simon Wiesler + 2 more

Context-dependent deep neural network HMMs have been shown to achieve recognition accuracy superior to Gaussian mixture models in a number of recent works. Typically, neural networks are optimized with stochastic gradient descent. On large datasets, stochastic gradient descent improves quickly during the beginning of the optimization. But since it does not make use of second order information, its asymptotic convergence behavior is slow. In regions with pathological curvature, stochastic gradient descent may almost stagnate and thereby falsely indicate convergence. Another drawback of stochastic gradient descent is that it can only be parallelized within minibatches. The Hessian-free algorithm is a second order batch optimization algorithm that does not suffer from these problems. In a recent work, Hessian-free optimization has been applied to a training of deep neural networks according to a sequence criterion. In that work, improvements in accuracy and training time have been reported. In this paper, we analyze the properties of the Hessian-free optimization algorithm and investigate whether it is suited for cross-entropy training of deep neural networks as well.

  • Conference Article
  • Cite Count Icon 4
  • 10.1145/3629526.3645035
MalleTrain: Deep Neural Networks Training on Unfillable Supercomputer Nodes
  • May 7, 2024
  • Xiaolong Ma + 6 more

First-come first-serve scheduling can result in substantial (up to 10%) of transiently idle nodes on supercomputers. Recognizing that such unfilled nodes are well-suited for deep neural network (DNN) training, due to the flexible nature of DNN training tasks, Liu et al. proposed that the re-scaling DNN training tasks to fit gaps in schedules be formulated as a mixed-integer linear programming (MILP) problem, and demonstrated via simulation the potential benefits of the approach. Here, we introduce MalleTrain, a system that provides the first practical implementation of this approach and that furthermore generalizes it by allowing it to be used even for DNN training applications for which model information is unknown before runtime. Key to this latter innovation is the use of a lightweight online job profiling advisor (JPA) to collect critical scalability information for DNN jobs---information that it then employs to optimize resource allocations dynamically, in real time. We describe the MalleTrain architecture and present the results of a detailed experimental evaluation on a supercomputer GPU cluster and several representative DNN training workloads, including neural architecture search and hyperparameter optimization. Our results not only confirm the practical feasibility of leveraging idle supercomputer nodes for DNN training but improve significantly on prior results, improving training throughput by up to 22.3% without requiring users to provide job scalability information.

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/978-3-030-11479-4_13
Why Dose Layer-by-Layer Pre-training Improve Deep Neural Networks Learning?
  • Jan 1, 2019
  • Seyyede Zohreh Seyyedsalehi + 1 more

Deep perceptron neural networks are capable of implementing a hierarchy of successive nonlinear conversions. But training these neural networks by conventional learning methods such as the error back-propagation is faced with serious obstacles owing to local minima. The layer-by-layer pre-training method has been recently proposed for training these neural networks and has shown considerable performance. In the pre-training method, the complex problem of training deep neural networks is broken down into some simple sub-problems in which some corresponding single-hidden-layer neural networks are trained through the error back-propagation algorithm. In this chapter, the theoretical principles regarding how this method effectively improves the training of deep neural networks are discussed, and the maximum discrimination theory is proposed as a proper framework for analysis of training convergence in these neural networks. Subsequently, discriminations of inputs in different layers of two similar deep neural networks, one of which is directly trained through the conventional error back-propagation algorithm and the other through layer-by-layer pre-training method, are compared, and results confirm the validity of the proposed framework.

  • Conference Article
  • Cite Count Icon 91
  • 10.1109/iccv.2017.305
Centered Weight Normalization in Accelerating Training of Deep Neural Networks
  • Oct 1, 2017
  • Lei Huang + 4 more

Training deep neural networks is difficult for the pathological curvature problem. Re-parameterization is an effective way to relieve the problem by learning the curvature approximately or constraining the solutions of weights with good properties for optimization. This paper proposes to reparameterize the input weight of each neuron in deep neural networks by normalizing it with zero-mean and unit-norm, followed by a learnable scalar parameter to adjust the norm of the weight. This technique effectively stabilizes the distribution implicitly. Besides, it improves the conditioning of the optimization problem and thus accelerates the training of deep neural networks. It can be wrapped as a linear module in practice and plugged in any architecture to replace the standard linear module. We highlight the benefits of our method on both multi-layer perceptrons and convolutional neural networks, and demonstrate its scalability and efficiency on SVHN, CIFAR-10, CIFAR-100 and ImageNet datasets.

  • Conference Article
  • Cite Count Icon 705
  • 10.21437/interspeech.2013-548
Sequence-discriminative training of deep neural networks
  • Aug 25, 2013
  • Karel Veselý + 3 more

Sequence-discriminative training of deep neural networks (DNNs) is investigated on a standard 300 hour American En- glish conversational telephone speech task. Different sequence- discriminative criteria — maximum mutual information (MMI), minimum phone error (MPE), state-level minimum Bayes risk (sMBR), and boosted MMI — are compared. Two different heuristics are investigated to improve the performance of the DNNs trained using sequence-based criteria — lattices are re- generated after the first iteration of training; and, for MMI and BMMI, the frames where the numerator and denominator hy- potheses are disjoint are removed from the gradient compu- tation. Starting from a competitive DNN baseline trained us- ing cross-entropy, different sequence-discriminative criteria are shown to lower word error rates by 7-9% relative, on aver- age. Little difference is noticed between the different sequence- based criteria that are investigated. The experiments are done using the open-source Kaldi toolkit, which makes it possible for the wider community to reproduce these results. Index Terms: speech recognition, deep learning, sequence- criterion training, neural networks, reproducible research

  • Conference Article
  • Cite Count Icon 11
  • 10.1109/iws52775.2021.9499638
Training of Deep Neural Networks in Electromagnetic Problems: a Case Study of Antenna Array Pattern Synthesis
  • May 23, 2021
  • 2021 IEEE MTT-S International Wireless Symposium (IWS)
  • Zhao Zhou + 7 more

This paper discusses the training of deep neural networks (DNNs) for electromagnetic problems. The main concerns include how to modify EM problems to take the advantage of the deep learning techniques and how to tailor conventional deep learning concepts with electromagnetic domain knowledge, which has been overlooked by most existing DNN based EM research. A 1×8 patch antenna array has been adopted as the test vehicle for investigation, with the aim to use deep learning for radiation pattern synthesis. It is analyzed via electromagnetic simulation first to collect sufficient training data sets containing different combinations of excitation signals and corresponding radiation patterns. These data are then pre-processed and passed to DNNs for training to imitate the mapping between excitation signals and radiation patterns. With careful feature selection and DNN architecture optimizations, two DNN models are obtained eventually. One of them aims at forward radiation synthesis in any certain excitation condition, and the other seeks out backward excitation signals needed for a given radiation pattern, and both achieved an accuracy over 80%. This paper may provide enlightenment and reference in applying deep learning to electromagnetic problems in terms of feature selection and architecture modification.

  • Research Article
  • Cite Count Icon 6
  • 10.1109/tifs.2023.3273169
A Guessing Entropy-Based Framework for Deep Learning-Assisted Side-Channel Analysis
  • Jan 1, 2023
  • IEEE Transactions on Information Forensics and Security
  • Ziyue Zhang + 2 more

Recently deep-learning (DL) techniques have been widely adopted in side-channel power analysis. A DL-assisted SCA generally consists of two phases: a deep neural network (DNN) training phase and a follow-on attack phase using the trained DNN. However, currently the two phases are not well aligned, as there is no conclusion on what metric used in the training can result in the most effective attack in the second phase. When traditional loss functions such as negative log-likelihood (NLL) are used in training a DNN, the trained model does not yield optimal follow-on attack. Recently some information theoretical SCA leakage metrics are proposed, either as the validation metric to stop the DNN training with traditional loss functions, or as both the validation metric and the training loss function. None of those proposed metrics, however, directly measures the SCA effectiveness. We propose to conduct DNN training directly with a common SCA effectiveness metric, Guessing Entropy (GE). We overcome the prior practical difficulty of using GE in DNN training by utilizing the GEEA estimation algorithm introduced in CHES 2020. We show that using GEEA as either the validation metric or the loss function produces DNN models that lead to much more effective follow-on attacks. Our work consolidates the DL-assisted SCA framework with a consistent metric, which shows great potential to be adopted as the universal SCA-oriented DNN training framework.

  • Conference Article
  • Cite Count Icon 2
  • 10.1109/cogmi52975.2021.00044
On Detection of Out of Distribution Inputs in Deep Neural Networks
  • Dec 1, 2021
  • Susmit Jha + 1 more

Deep neural networks (DNNs) have achieved near-human-level accuracy on many datasets across different domains. But they are known to produce incorrect predictions with high confidence on out-of-distribution (OOD) inputs. This challenge has limited the adoption of deep learning models in high-assurance systems such as autonomous driving, air traffic man-agement, and medical diagnosis. The problem of detecting when an input is outside the training distribution of a machine learning model, and hence, its prediction on this input cannot be trusted, has received significant attention recently. Several techniques based on statistical, geometric, topological, or relational signature have been developed to detect OOD inputs. In this paper, we investigate two major sources of uncertainty in a deep neural network's prediction on a given input. The first uncertainty source is aleatoric due to the ambiguity or noise in the input. The second is due to epistemic uncertainty arising due to insufficiency of training data and corresponds to the OOD inputs. We posit that the training of deep neural networks using usual objectives cannot distinguish between these two sources of uncertainty. We describe lightweight modifications to the training of deep neural networks that enable deep neural networks to learn representations that can be used to detect OODs. For evaluating our approach, we conducted experiments on CIFAR10 and SVHN as in-distribution data and Imagenet, SVHN (for CIFAR10), and CIFAR10 (for SVHN) as OOD data across different DNN architectures such as ResNet34. WideResNet, and DenseNet.

Save Icon
Up Arrow
Open/Close
Setting-up Chat
Loading Interface