A new complement naïve Bayesian approach for biomedical data classification

Amare Anagaw,Yang-Lang Chang

doi:10.1007/s12652-018-1160-1

Abstract

Biomedical data classification tasks are very challenging because data is usually large, noised and imbalanced. Particularly the noise can reduce system performance in terms of classification accuracy, time in building a classifier and the size of the classifier. Accordingly, most existing learning algorithms have integrated various approaches to enhance their learning abilities from noisy environments, but the existence of noise can still introduce serious negative impacts. A more reasonable solution might be to employ some preprocessing mechanisms to handle noisy instances before a learner is formed. Therefore, we introduce a method called double learning to improve the classification performance of our model. As to the author’s knowledge, most of the previous works used the normal (noise free) instances for model construction (training) after the noise instances are isolated. This approach increases computational task on model construction for active learners and total computational time for passive learners. It also ignores minority data instance which leads to miss classification of instances from minority group as test cases. The main idea of this paper is to construct a model using noised instances. This approach minimizes the model construction time by reducing the number of instances and improves classification performance. Therefore, only the identified noised data are used for model construction instead of the normal (noise free) data. Since noised instances are used for model construction, the entire naive Bayesian working logic is reversed. This method is called complement naive Bayesian (CNB) which makes use of the idea of complement based learning to improve the accuracy performance. Finally, the performance of the proposed CNB is compared to naive Bayesian and some other classification algorithms with the single photon emission computed tomography, Indian liver patient dataset, Wilt and Tic-Tac-Toe endgame datasets. The experimental results demonstrated that the proposed approach has shown promising results in terms of computational time and accuracy performance on both balanced and imbalanced datasets used.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new complement naïve Bayesian approach for biomedical data classification

Abstract

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing

Lead the way for us

Journal: Journal of Ambient Intelligence and Humanized Computing	Publication Date: Dec 8, 2018
Citations: 27

Similar Papers

Class Noise vs. Attribute Noise: A Quantitative Study
Xingquan Zhu ... Xindong Wu
Artificial Intelligence Review | VOL. 22
Xingquan Zhu, et. al.Xingquan Zhu ... Xindong Wu
01 Nov 2004
Artificial Intelligence Review | VOL. 22

A novel feature selection approach for biomedical data classification
Yonghong Peng ... Jianmin Jiang
Journal of Biomedical Informatics | VOL. 43
Yonghong Peng, et. al.Yonghong Peng ... Jianmin Jiang
30 Jul 2009
Journal of Biomedical Informatics | VOL. 43

Performance Evaluation of Sentiment Analysis on Balanced and Imbalanced Dataset Using Ensemble Approach
Shini George ... V Srividhya
Indian Journal of Science and Technology | VOL. 15
Shini George, et. al.Shini George ... V Srividhya
05 May 2022
Indian Journal of Science and Technology | VOL. 15

RSMOTE: improving classification performance over imbalanced medical datasets.
Mehdi Naseriparsa ... Ahmed Al-Shammari
Health Information Science and Systems | VOL. 8
Mehdi Naseriparsa, et. al.Mehdi Naseriparsa ... Ahmed Al-Shammari
12 Jun 2020
Health Information Science and Systems | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new complement naïve Bayesian approach for biomedical data classification

Abstract

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing