Word2vec neural model-based technique to generate protein vectors for combating COVID-19: a machine learning approach.

Toby A Adjuik,Daniel Ananey-Obiri

doi:10.1007/s41870-022-00949-2

Abstract

The world was ambushed in 2019 by the COVID-19 virus which affected the health, economy, and lifestyle of individuals worldwide. One way of combating such a public health concern is by using appropriate, rapid, and unbiased diagnostic tools for quick detection of infected people. However, a current dearth of bioinformatics tools necessitates modeling studies to help diagnose COVID-19 cases. Molecular-based methods such as the real-time reverse transcription polymerase chain reaction (rRT-PCR) for detecting COVID-19 is time consuming and prone to contamination. Modern bioinformatics tools have made it possible to create large databases of protein sequences of various diseases, apply data mining techniques, and accurately diagnose diseases. However, the current sequence alignment tools that use these databases are not able to detect novel COVID-19 viral sequences due to high sequence dissimilarity. The objective of this study, therefore, was to develop models that can accurately classify COVID-19 viral sequences rapidly using protein vectors generated by neural word embedding technique. Five machine learning models; K nearest neighbor regression (KNN), support vector machine (SVM), random forest (RF), Linear discriminant analysis (LDA), and Logistic regression were developed using datasets from the National Center for Biotechnology. Our results suggest, the RF model performed better than all other models on the training dataset with 99% accuracy score and 99.5% accuracy on the testing dataset. The implication of this study is that, rapid detection of the COVID-19 virus in suspected cases could potentially save lives as less time will be needed to ascertain the status of a patient.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management	Publication Date: May 19, 2022
Citations: 14	License type: NO-CC CODE

R Discovery Prime

Word2vec neural model-based technique to generate protein vectors for combating COVID-19: a machine learning approach.

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management

Lead the way for us

Similar Papers

Simulated Joint Infection Assessment by Rapid Detection of Live Bacteria with Real-Time Reverse Transcription Polymerase Chain Reaction
Patrick Birmingham ... Rocky S Tuan
The Journal of Bone and Joint Surgery-American Volume | VOL. 90
Patrick Birmingham, et. al.Patrick Birmingham ... Rocky S Tuan
01 Mar 2008
The Journal of Bone and Joint Surgery-American Volume | VOL. 90

Machine Learning Model Based on Radiomic Features for Differentiation between COVID-19 and Pneumonia on Chest X-ray
Young Jae Kim
Sensors | VOL. 22
Young Jae KimYoung Jae Kim
05 Sep 2022
Sensors | VOL. 22

Predictive Modelling of Customer Sustainable Jewelry Purchases Using Machine Learning Algorithms
Anjali Munde ... Jasmandeep Kaur
Procedia Computer Science | VOL. 235
Anjali Munde, et. al.Anjali Munde ... Jasmandeep Kaur
01 Jan 2024
Procedia Computer Science | VOL. 235

Machine Learning Approach to Simulate Soil CO2 Fluxes under Cropping Systems
Toby A Adjuik ... Sarah C Davis
Agronomy | VOL. 12
Toby A Adjuik, et. al.Toby A Adjuik ... Sarah C Davis
14 Jan 2022
Agronomy | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Word2vec neural model-based technique to generate protein vectors for combating COVID-19: a machine learning approach.

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management