An ensemble classifier with random projection for predicting multi-label protein subcellular localization

Shibiao Wan,Sun-Yuan Kung,Bai Zhang,Man-Wai Mak,Yue Wang

doi:10.1109/bibm.2013.6732715

Abstract

In protein subcellular localization prediction, a predominant scenario is that the number of available features is much larger than the number of data samples. Among the large number of features, many of them may contain redundant or irrelevant information, causing the prediction systems suffer from overfitting. To address this problem, this paper proposes a dimensionality-reduction method that applies random projection (RP) to construct an ensemble multi-label classifier for predicting protein subcellular localization. Specifically, the frequencies of occurrences of gene-ontology terms are used as feature vectors, which are projected onto lower-dimensional spaces by random projection matrices whose elements conform to a distribution with zero mean and unit variance. The transformed low-dimensional vectors are classified by an ensemble of one-vs-rest multi-label support vector machine (SVM) classifiers, each corresponding to one of the RP matrices. The scores obtained from the ensemble are then fused for making the final decision. Experimental results on two recent datasets suggest that the proposed method can reduce the dimensions by six folds and remarkably improve the classification performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An ensemble classifier with random projection for predicting multi-label protein subcellular localization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Ensemble random projection for multi-label classification with application to protein subcellular localization
Shibiao Wan ... Sun-Yuan Kung
-
Shibiao Wan, et. al.Shibiao Wan ... Sun-Yuan Kung
01 May 2014
01 May 2014

ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization
Wen-Lin Huang ... Shinn-Ying Ho
BMC Bioinformatics | VOL. 9
Wen-Lin Huang, et. al.Wen-Lin Huang ... Shinn-Ying Ho
01 Feb 2008
BMC Bioinformatics | VOL. 9

Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.
Bo Li ... Lijun Cai
Molecules | VOL. 24
Bo Li, et. al.Bo Li ... Lijun Cai
06 Mar 2019
Molecules | VOL. 24

Abstract 4907: RanBALL: Identifying B-cell acute lymphoblastic leukemia subtypes based on an ensemble random projection model
Lusheng Li ... Jieqiong Wang
Cancer Research | VOL. 84
Lusheng Li, et. al.Lusheng Li ... Jieqiong Wang
22 Mar 2024
Cancer Research | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ensemble classifier with random projection for predicting multi-label protein subcellular localization

Abstract

Talk to us

Similar Papers