Loss In Classification Accuracy Research Articles

Deep neural networks (DNNs) have become the driving force behind recent artificial intelligence (AI) research. With the help of a vast amount of training data, neural networks can perform better than traditional machine learning algorithms in many applications. An important problem with implementing a neural network is the design of its architecture. Typically, such an architecture is obtained manually by exploring its hyperparameter space and kept fixed during training. This approach is both time consuming and inefficient. Another issue is that modern neural networks often contain millions of parameters, whereas many applications require small inference models due to imposed resource constraints, such as energy constraints on battery-operated devices. However, efforts to migrate DNNs to such devices typically entail a significant loss of classification accuracy. To address these challenges, we propose a two-step neural network synthesis methodology, called DR+SCANN, that combines two complementary approaches to design compact and accurate DNNs. At the core of our framework is the SCANN methodology that uses three basic architecture-changing operations, namely, connection growth, neuron growth, and connection pruning, to synthesize feedforward architectures with arbitrary structure. These neural networks are not limited to the multilayer perceptron structure. SCANN encapsulates three synthesis methodologies that apply a repeated grow-and-prune paradigm to three architectural starting points. DR+SCANN combines the SCANN methodology with dataset dimensionality reduction to alleviate the curse of dimensionality. We demonstrate the efficacy of SCANN and DR+SCANN on various image and nonimage datasets. We evaluate SCANN on MNIST, CIFAR-10, and ImageNet benchmarks. Without any loss in accuracy, SCANN generates a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$46.3\times $ </tex-math></inline-formula> smaller network than the LeNet-5 Caffe model. We also compare SCANN-synthesized networks with a state-of-the-art fully connected (FC) feedforward model for MNIST, and show <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$20\times $ </tex-math></inline-formula> ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$19.9\times $ </tex-math></inline-formula> ) reduction in the number of parameters (floating-point operations) with little drop in accuracy. For the CIFAR-10 dataset, we target AlexNet and VGG-16 baseline architectures. SCANN reduces the number of parameters in AlexNet by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$10.1\times $ </tex-math></inline-formula> without any drop in accuracy. It reduces the number of parameters in the FC layers of VGG-16 to only 2.5k while increasing accuracy by 1.05%. On the ImageNet dataset, for the VGG-16 and MobileNetV2 architectures, we reduce network parameters by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$8.0\times $ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.3\times $ </tex-math></inline-formula> , respectively, with a similar or improved performance over their respective baselines. We also evaluate the efficacy of using dimensionality reduction alongside SCANN (DR+SCANN) on nine small-to-medium-size datasets. Using this methodology enables us to reduce the number of connections in the network by up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$5078.7\times $ </tex-math></inline-formula> (geometric mean: <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$82.1\times $ </tex-math></inline-formula> ), with little to no drop in accuracy. On seven out of nine datasets, we show 0.41%–10.09% accuracy improvements over the FC baseline models. We also show that our synthesis methodology yields neural networks that are much better at navigating the accuracy versus energy efficiency space. This can enable neural network-based inference even on Internet-of-Things sensors.

Read full abstract

Abstract Introduction: Earlier detection is a critical clinical intervention to reduce cancer-related mortality. The DELFI liquid biopsy approach (DNA evaluation of fragments for early interception) utilizing low coverage (1-2x) whole genome sequencing (WGS) to analyze cell-free DNA (cfDNA) fragmentation provides a promising avenue for cancer detection. As sequencing costs remain a barrier to adoption of liquid biopsy approaches for early detection, we evaluated WGS for DELFI using 2-channel Illumina NovaSeq sequencing as a more affordable (~7-fold cost savings) alternative to 4-channel HiSeq instruments. Methods: We performed WGS on the prospectively collected LUCAS cohort of 365 individuals at risk for lung cancer using both HiSeq and NovaSeq platforms (Mathios et. al., Nature Communications 2021). Genome-wide fragmentation was summarized in non-overlapping 5 Mb bins by ratio of short (100-150 bp) to long (151-220 bp) fragments.To measure within-sequencer repeatability, we compared fragmentation profiles of non-cancer individuals to the median non-cancer fragmentation profile by Spearman correlation. Principal component analyses were performed to assess the extent to which the sequencer explains variation of fragmentation profiles across samples.For cancer prediction, we used a penalized logistic regression model with fragmentation profiles and other genome-wide characteristics as features. Machine learning performance was assessed by cross-validation and area under the receiver operator characteristic curve (AUC). To evaluate whether we could have developed the classifier from a combination of NovaSeq with HiSeq sequenced samples, we evaluated performance trained on 90:10%, 75:25%, 50:50%, 25:75%, and 10:90% HiSeq:NovaSeq mixtures, respectively. Results: cfDNA fragmentation profiles were highly concordant among non-cancer individuals for both platforms with median correlations of 0.96 (IQR: 0.95 - 0.97) and 0.95 (IQR: 0.94 - 0.96). Visualization of fragmentation principal components did not reveal separation by sequencing platform. The DELFI approach applied to samples sequenced by NovaSeq recapitulated previously published performance measures based on HiSeq (AUC 0.90, 95% CI 0.86 - 0.94). In simulations of mixed-platform datasets, we found the same qualitative performance (AUC range: 0.893-0.902). Conclusions: cfDNA fragmentation profiles were similar between HiSeq and NovaSeq platforms, and classification accuracies from machine learning models trained on these platforms were equivalent. Our results indicate HiSeq and NovaSeq sequenced samples can be combined in models with no discernible loss in classification accuracy provided a balance of non-cancers and cancers are sequenced on both platforms. The lower cost of NovaSeq sequencing may enable wider adoption of genome-wide fragmentation-based approaches for cancer detection. Citation Format: Akshaya V. Annapragada, Dimitrios Mathios, Stephen Cristiano, Jamie E. Medina, Vilmos Adleff, Noushin Niknafs, Jacob Carey, Nic Dracopoli, Peter Bach, Jillian Phallen, Victor E. Velculescu, Robert B. Scharpf. Towards population-scale screening of human cancer using genome-wide fragmentation profiles of cell-free DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 5159.

Read full abstract

Loss In Classification Accuracy Research Articles

Related Topics

Articles published on Loss In Classification Accuracy

MixTrain: accelerating DNN training via input mixing.

Memory-bound k-mer selection for large and evolutionarily diverse reference libraries.

Memory-bound k -mer selection for large and evolutionary diverse reference libraries.

Automatic Identification and Severity Classification of Retinal Biomarkers in SD-OCT Using Dilated Depthwise Separable Convolution ResNet with SVM Classifier

Testing the accuracy of the SexEst software for sex estimation in a modern Greek sample

Development of Fabrication Techniques for Magneto-Optical Diffractive Deep Neural Networks

Booth Encoding-Based Energy Efficient Multipliers for Deep Learning Systems

An Ensemble Learning Method Based on One-Class and Binary Classification for Credit Scoring

A Technique for Approximate Communication in Network-on-Chips for Image Classification

Bayesian Photonic Accelerators for Energy Efficient and Noise Robust Neural Processing

Acoustic scene analysis using analog spiking neural network

A Hybrid Privacy-Preserving Deep Learning Approach for Object Classification in Very High-Resolution Satellite Images

Accelerating 3D Convolutional Neural Network with Channel Bottleneck Module for EEG-Based Emotion Recognition.

SCANN: Synthesis of Compact and Accurate Neural Networks

A photosensor employing data-driven binning for ultrafast image recognition

A Neuromorphic Processing System With Spike-Driven SNN Processor for Wearable ECG Classification.

Abstract 5159: Towards population-scale screening of human cancer using genome-wide fragmentation profiles of cell-free DNA

Bioinspired random projections for robust, sparse classification

Evaluation and Stratification for Chinese International Education Quality with Deep Learning Model.

One Parameter Defense—Defending Against Data Inference Attacks via Differential Privacy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Loss In Classification Accuracy Research Articles

Related Topics

Articles published on Loss In Classification Accuracy

MixTrain: accelerating DNN training via input mixing.

Memory-bound k-mer selection for large and evolutionarily diverse reference libraries.

Memory-bound k -mer selection for large and evolutionary diverse reference libraries.

Automatic Identification and Severity Classification of Retinal Biomarkers in SD-OCT Using Dilated Depthwise Separable Convolution ResNet with SVM Classifier

Testing the accuracy of the SexEst software for sex estimation in a modern Greek sample

Development of Fabrication Techniques for Magneto-Optical Diffractive Deep Neural Networks

Booth Encoding-Based Energy Efficient Multipliers for Deep Learning Systems

An Ensemble Learning Method Based on One-Class and Binary Classification for Credit Scoring

A Technique for Approximate Communication in Network-on-Chips for Image Classification

Bayesian Photonic Accelerators for Energy Efficient and Noise Robust Neural Processing

Acoustic scene analysis using analog spiking neural network

A Hybrid Privacy-Preserving Deep Learning Approach for Object Classification in Very High-Resolution Satellite Images

Accelerating 3D Convolutional Neural Network with Channel Bottleneck Module for EEG-Based Emotion Recognition.

SCANN: Synthesis of Compact and Accurate Neural Networks

A photosensor employing data-driven binning for ultrafast image recognition

A Neuromorphic Processing System With Spike-Driven SNN Processor for Wearable ECG Classification.

Abstract 5159: Towards population-scale screening of human cancer using genome-wide fragmentation profiles of cell-free DNA

Bioinspired random projections for robust, sparse classification

Evaluation and Stratification for Chinese International Education Quality with Deep Learning Model.

One Parameter Defense—Defending Against Data Inference Attacks via Differential Privacy