Missing data techniques in classification for cardiovascular dysautonomias diagnosis.

Ali Idri,Ibtissam Abnane,José Luis Fernandez-Aleman,Ilham Kadi

doi:10.1007/s11517-020-02266-x

Abstract

Missing data (MD) is a common and inevitable problem facing data mining (DM)-based decision systems in e-health since many medical historical datasets contain a huge number of missing values. Therefore, a pre-processing stage is usually required to deal with missing values before building any DM-based decision system. The purpose of this paper is to evaluate the impact of MD techniques on classification systems in cardiovascular dysautonomias diagnosis. We analyzed and compared the accuracy rates of four classification techniques: random forest (RF), support vector machines (SVM), C4.5 decision tree, and Naive Bayes (NB), using two MD techniques: deletion or imputation with k-nearest neighbors (KNN). A total of 216 experiments were therefore carried out using three missingness mechanisms (MCAR: missing completely at random, MAR: missing at random and NMAR: not missing at random), two MD techniques (deletion and KNN imputation), nine MD percentages from 10 to 90% over a dataset collected from the autonomic nervous system (ANS) unit of the University Hospital Avicenne in Morocco. The results obtained suggest that using KNN imputation rather than deletion enhances the accuracy rates of the four classifiers. Moreover, the MD percentages have a negative impact on the performance of classification techniques regardless of the MD mechanisms and MD techniques used. In fact, the accuracy rates of the four classifiers decrease as the MD percentage increases. Graphical abstract.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Missing data techniques in classification for cardiovascular dysautonomias diagnosis.

Abstract

Talk to us

Similar Papers

More From: Medical & Biological Engineering & Computing

Lead the way for us

Journal: Medical & Biological Engineering & Computing	Publication Date: Sep 24, 2020
Citations: 6

Similar Papers

Missing data techniques in analogy-based software development effort estimation
Ali Idri ... Alain Abran
The Journal of Systems & Software | VOL. 117
Ali Idri, et. al.Ali Idri ... Alain Abran
27 Apr 2016
The Journal of Systems & Software | VOL. 117

Evaluating Fuzzy Analogy on incomplete software projects data
Ibtissam Abnane ... Ali Idri
-
Ibtissam Abnane, et. al.Ibtissam Abnane ... Ali Idri
01 Dec 2016
01 Dec 2016

Improved Analogy-based Effort Estimation with Incomplete Mixed Data
Ibtissam Abnane ... Ali Idri
-
Ibtissam Abnane, et. al.Ibtissam Abnane ... Ali Idri
26 Sep 2018
26 Sep 2018

Ensemble Case based Reasoning Imputation in Breast Cancer Classification.
...
Journal of Information Science and Engineering | VOL. 37
, et. al. ...
01 Sep 2021
Journal of Information Science and Engineering | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Missing data techniques in classification for cardiovascular dysautonomias diagnosis.

Abstract

Talk to us

Similar Papers

More From: Medical &amp; Biological Engineering &amp; Computing

More From: Medical & Biological Engineering & Computing