Abstract

Copy number variation (CNV), is defined as repetitions or deletions of genomic segments of 1 Kb to 5 Mb, and is a major trigger for human disease. The high-throughput and low-cost characteristics of next-generation sequencing technology provide the possibility of the detection of CNVs in the whole genome, and also greatly improve the clinical practicability of next-generation sequencing (NGS) testing. However, current methods for the detection of CNVs are easily affected by sequencing and mapping errors, and uneven distribution of reads. In this paper, we propose an improved approach, CNV-MEANN, for the detection of CNVs, involving changing the structure of the neural network used in the MFCNV method. This method has three differences relative to the MFCNV method: (1) it utilizes a new feature, mapping quality, to replace two features in MFCNV, (2) it considers the influence of the loss categories of CNV on disease prediction, and refines the output structure, and (3) it uses a mind evolutionary algorithm to optimize the backpropagation (neural network) neural network model, and calculates individual scores for each genome bin to predict CNVs. Using both simulated and real datasets, we tested the performance of CNV-MEANN and compared its performance with those of seven widely used CNV detection methods. Experimental results demonstrated that the CNV-MEANN approach outperformed other methods with respect to sensitivity, precision, and F1-score. The proposed method was able to detect many CNVs that other approaches could not, and it reduced the boundary bias. CNV-MEANN is expected to be an effective method for the analysis of changes in CNVs in the genome.

Highlights

  • Copy number variations (CNVs) (Freeman et al, 2006; Redon et al, 2006) are important genomic structural variations, which are widespread in the human genome, and cause a variety of complex diseases, such as Crohn’s disease (Fellermann et al, 2006; Aldhous et al, 2010), ankylosing spondylitis (Wang et al, 2013), Alzheimer’s disease (Brouwers et al, 2012), and autism (Sebat et al., 2007)

  • Based on the above research, we propose an improved method called CNV-MEANN (CNV detection of neural network based on mind evolutionary algorithm)

  • There are three main reasons for the superior performance of CNV-MEANN: (1) it selected numerous samples used as the training data from data of different configurations, and the fault tolerance is improved, (2) it extracted a feature called mapping quality that better reflects the states of CNV, and considered the joint actions of multiple features by using the neural network, whereas other methods only consider the marginal effects of each feature, and (3) it used an MEA to optimizing the neural network, enhancing the robustness of the model

Read more

Summary

METHODS

We propose an improved approach, CNV-MEANN, for the detection of CNVs, involving changing the structure of the neural network used in the MFCNV method. This method has three differences relative to the MFCNV method: (1) it utilizes a new feature, mapping quality, to replace two features in MFCNV, (2) it considers the influence of the loss categories of CNV on disease prediction, and refines the output structure, and (3) it uses a mind evolutionary algorithm to optimize the backpropagation (neural network) neural network model, and calculates individual scores for each genome bin to predict CNVs. This method has three differences relative to the MFCNV method: (1) it utilizes a new feature, mapping quality, to replace two features in MFCNV, (2) it considers the influence of the loss categories of CNV on disease prediction, and refines the output structure, and (3) it uses a mind evolutionary algorithm to optimize the backpropagation (neural network) neural network model, and calculates individual scores for each genome bin to predict CNVs Using both simulated and real datasets, we tested the performance of CNV-MEANN and compared its performance with those of seven widely used CNV detection methods.

INTRODUCTION
Methods
Data Preprocessing and Quantification of Eigenvalues
Workflow of CNV-MEANN
Construction of a Neural Network
Calculation of the Individual Scores by the Mind Evolutionary Algorithm
RESULTS
Prediction of CNVs
Real Data Applications
Findings
DISCUSSION AND CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call