COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.

Indrajit Saha,Arjit Seal,Nimisha Ghosh,Dariusz Plewczynski,Debasree Maity

doi:10.3389/fgene.2021.569120

Abstract

The COVID-19 disease for Novel coronavirus (SARS-CoV-2) has turned out to be a global pandemic. The high transmission rate of this pathogenic virus demands an early prediction and proper identification for the subsequent treatment. However, polymorphic nature of this virus allows it to adapt and sustain in different kinds of environment which makes it difficult to predict. On the other hand, there are other pathogens like SARS-CoV-1, MERS-CoV, Ebola, Dengue, and Influenza as well, so that a predictor is highly required to distinguish them with the use of their genomic information. To mitigate this problem, in this work COVID-DeepPredictor is proposed on the framework of deep learning to identify an unknown sequence of these pathogens. COVID-DeepPredictor uses Long Short Term Memory as Recurrent Neural Network for the underlying prediction with an alignment-free technique. In this regard, k-mer technique is applied to create Bag-of-Descriptors (BoDs) in order to generate Bag-of-Unique-Descriptors (BoUDs) as vocabulary and subsequently embedded representation is prepared for the given virus sequences. This predictor is not only validated for the dataset using -fold cross-validation but also for unseen test datasets of SARS-CoV-2 sequences and sequences from other viruses as well. To verify the efficacy of COVID-DeepPredictor, it has been compared with other state-of-the-art prediction techniques based on Linear Discriminant Analysis, Random Forests, and Gradient Boosting Method. COVID-DeepPredictor achieves 100% prediction accuracy on validation dataset while on test datasets, the accuracy ranges from 99.51 to 99.94%. It shows superior results over other prediction techniques as well. In addition to this, accuracy and runtime of COVID-DeepPredictor are considered simultaneously to determine the value of k in k-mer, a comparative study among k values in k-mer, Bag-of-Descriptors (BoDs), and Bag-of-Unique-Descriptors (BoUDs) and a comparison between COVID-DeepPredictor and Nucleotide BLAST have also been performed. The code, training, and test datasets used for COVID-DeepPredictor are available at http://www.nitttrkol.ac.in/indrajit/projects/COVID-DeepPredictor/.

Highlights

The first case of COVID-19 surfaced in Wuhan, China in December 2019 (Huang et al, 2020; Meng et al, 2020; Yan L. et al, 2020)
The results show that the method has a prediction accuracy of 98.08% for binary classes (COVID vs. No-Findings) and 87.02% for multiple classes (COVID vs. No-Findings vs. Pneumonia)
To validate COVID-DeepPredictor, experiments are conducted on genomic sequences of different pathogenic viruses

Summary

Introduction

The first case of COVID-19 surfaced in Wuhan, China in December 2019 (Huang et al, 2020; Meng et al, 2020; Yan L. et al, 2020). In no time it spread to 212 countries and territories (Worldometer, 2021) worldwide creating a pandemic in its wake. The resulting mutated viruses may sometimes cause an outbreak of infection in humans e.g., the case of SARS-CoV-2. Next-generation sequencing using metagenomic analysis has recently been used to identify the genetic features of SARS-CoV-2 (Zhou et al, 2020)

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Genetics	Publication Date: Feb 11, 2021
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics

Lead the way for us

Similar Papers

Machine Learning Can be Used to Predict Function but Not Pain After Surgery for Thumb Carpometacarpal Osteoarthritis.
Nina L Loos ... J Sebastiaan Souer
Clinical Orthopaedics & Related Research | VOL. 480
Nina L Loos, et. al.Nina L Loos ... J Sebastiaan Souer
18 Jan 2022
Clinical Orthopaedics & Related Research | VOL. 480

ECSYAPPS – A framework for analyzing the effectiveness of classification techniques for early prediction of students academic performance in education sector
Sunita M Dol ... Pradip M Jawandhiya
Engineering Applications of Artificial Intelligence | VOL. 134
Sunita M Dol, et. al.Sunita M Dol ... Pradip M Jawandhiya
04 Jun 2024
Engineering Applications of Artificial Intelligence | VOL. 134

A Primer on Machine Learning.
Audrene S Edwards ... Tun Jie
Transplantation | VOL. 105
Audrene S Edwards, et. al.Audrene S Edwards ... Tun Jie
18 Aug 2020
Transplantation | VOL. 105

Real-time dual prediction of intradialytic hypotension and hypertension using an explainable deep learning model
Donghwan Yun ... Seung Seok Han
Scientific Reports | VOL. 13
Donghwan Yun, et. al.Donghwan Yun ... Seung Seok Han
23 Oct 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics