Robust prediction of protein subcellular localization combining PCA and WSVMs

Jiang Tian,Hong Gu,Wenqi Liu,Chiyang Gao

doi:10.1016/j.compbiomed.2011.05.016

Abstract

Automated prediction of protein subcellular localization is an important tool for genome annotation and drug discovery, and Support Vector Machines (SVMs) can effectively solve this problem in a supervised manner. However, the datasets obtained from real experiments are likely to contain outliers or noises, which can lead to poor generalization ability and classification accuracy. To explore this problem, we adopt strategies to lower the effect of outliers. First we design a method based on Weighted SVMs, different weights are assigned to different data points, so the training algorithm will learn the decision boundary according to the relative importance of the data points. Second we analyse the influence of Principal Component Analysis (PCA) on WSVM classification, propose a hybrid classifier combining merits of both PCA and WSVM. After performing dimension reduction operations on the datasets, kernel-based possibilistic c-means algorithm can generate more suitable weights for the training, as PCA transforms the data into a new coordinate system with largest variances affected greatly by the outliers. Experiments on benchmark datasets show promising results, which confirms the effectiveness of the proposed method in terms of prediction accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust prediction of protein subcellular localization combining PCA and WSVMs

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Jun 30, 2011
Citations: 11

Similar Papers

A method for improving protein localization prediction from datasets with outliers
Jiang Tian ... Wenqi Liu
-
Jiang Tian, et. al. Jiang Tian ... Wenqi Liu
01 Mar 2009
01 Mar 2009

Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou’s pseudo amino acid composition
Guo-Liang Fan ... Qian-Zhong Li
Journal of Theoretical Biology | VOL. 304
Guo-Liang Fan, et. al.Guo-Liang Fan ... Qian-Zhong Li
22 Mar 2012
Journal of Theoretical Biology | VOL. 304

Improved Prediction of Eukaryotic Protein Subcellular Localization Using Particle Swarm Optimization of Multiple Classifiers
Sirapop Nuannimnoi ... Supatcha Lertampaiporn
-
Sirapop Nuannimnoi, et. al.Sirapop Nuannimnoi ... Supatcha Lertampaiporn
01 Nov 2017
01 Nov 2017

Improving prediction of protein subcellular localization using evolutionary information and sequence-order information
Minghui Wang ... Zhewen Fan
-
Minghui Wang, et. al. Minghui Wang ... Zhewen Fan
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust prediction of protein subcellular localization combining PCA and WSVMs

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine