Profiling DNA Sequence of SARS-Cov-2 Virus Using Machine Learning Algorithm

Lailil Muflikhah,Muh Arif Rahman,Agus Wahyu Widodo

doi:10.11591/eei.v11i2.3487

Lailil Muflikhah, Muh Arif Rahman + Show 1 more

Open Access

https://doi.org/10.11591/eei.v11i2.3487

Copy DOI

Abstract

Corona virus disease-19 (COVID-19) is growing rapidly because it is an infectious disease. This disease is caused by a virus belonging to the type of DNA virus with very diverse genetics. This study proposes a feature extraction method using k-mer to obtain nucleotide frequencies in protein coding. In profiling viral DNA sequences, this study proposes to obtain similarity by country using hierarchical k-means, where the results are averaged by the hierarchical clustering method and then find the initial cluster center. The experimental results show that the silhouette, purity, and entropy are 0.867, 0.208, and 0.892, respectively. Then, we apply the Gini index feature selection to find the important components as characteristics in each country. The selected components are implemented using the ensemble method, Random Forest, to evaluate their performance. The experimental results showed high performance, including sensitivity, accuracy, specificity, and area under the curve (AUC).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bulletin of Electrical Engineering and Informatics	Publication Date: Apr 1, 2022
Citations: 3	License type: CC BY-SA 4.0

R Discovery Prime

R Discovery Prime

Profiling DNA Sequence of SARS-Cov-2 Virus Using Machine Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: Bulletin of Electrical Engineering and Informatics

Lead the way for us

Similar Papers

Interferon lambda 3 in the early phase of coronavirus disease-19 can predict oxygen requirement.
...
European Journal of Clinical Investigation | VOL. 52
, et. al. ...
12 May 2022
European Journal of Clinical Investigation | VOL. 52

Editor's evaluation: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Eduardo Franco
-
Eduardo FrancoEduardo Franco
05 Sep 2022
05 Sep 2022

Decision letter: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Andrew N Mertens ... Eduardo Franco
-
Andrew N Mertens, et. al.Andrew N Mertens ... Eduardo Franco
05 Sep 2022
05 Sep 2022

Author response: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Sharia M Ahmed ... Ben J Brintz
-
Sharia M Ahmed, et. al.Sharia M Ahmed ... Ben J Brintz
21 Dec 2022
21 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Profiling DNA Sequence of SARS-Cov-2 Virus Using Machine Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: Bulletin of Electrical Engineering and Informatics