On classification of biological data using outlier detection

Yushan Qiu Yushan Qiu,Wenpin Hou Wenpin Hou,Xiaoqing Cheng Xiaoqing Cheng,Wai-Ki Ching Wai-Ki Ching

doi:10.1049/cp.2015.0617

Abstract

With the rapid development of information technology, the number of datasets, as well as their complexity and dimension, have been growing dramatically. This dramatic growth of biology data and non-biological commercial databases becomes a challenging issue in data mining. Classification technique is one of the major tools in the captured research area. However, the performance of classification may be degraded when there exists noise in the captured databases. Therefore, outlier detection becomes an urgent need and the issue of how to integrate outlier detection method and classification techniques is an important and challenging issue. In this paper, we proposed a novel and effective approach based on k-means clustering to identify outliers in the databases. In particular, we employed one of famous classification techniques, Support Vector Machine (SVM), owing to its ability to handle highdimensional data set. We also compare the classification results with the multivariate outlier detection method. Numerical results on two different data sets indicate that the classification results after removing the outliers by our proposed method are much better than the multivariate outlier detection method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On classification of biological data using outlier detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluation of robust outlier detection methods for zero-inflated complex data
M Templ ... P Filzmoser
Journal of Applied Statistics | VOL. 47
M Templ, et. al.M Templ ... P Filzmoser
27 Sep 2019
Journal of Applied Statistics | VOL. 47

Protein-Protein Etkileşim Verileri için Aykırı Değer Tespiti Uygulanmasi Gerekir mi?
Ezgi Ayyildiz ... Vilda Purutçuoğlu
Turkiye Klinikleri Journal of Biostatistics | VOL. 10
Ezgi Ayyildiz, et. al.Ezgi Ayyildiz ... Vilda Purutçuoğlu
01 Jan 2018
Turkiye Klinikleri Journal of Biostatistics | VOL. 10

Partition-based outlier detection for high dimensional data
Jutian Zhang ... Xin Wang
Journal of Physics: Conference Series | VOL. 2898
Jutian Zhang, et. al.Jutian Zhang ... Xin Wang
01 Nov 2024
Journal of Physics: Conference Series | VOL. 2898

Multivariate outlier detection applied to multiply imputed laboratory data
Kay I Penny ... Ian T Jolliffe
Statistics in Medicine | VOL. 18
Kay I Penny, et. al.Kay I Penny ... Ian T Jolliffe
30 Jul 1999
Statistics in Medicine | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On classification of biological data using outlier detection

Abstract

Talk to us

Similar Papers