A Novel Sequence-Based Method for Phosphorylation Site Prediction with Feature Selection and Analysis

Zhi-Song He,Xiang-Ying Kong,Kuo-Chen Chou,Xiao-He Shi,Yu-Bei Zhu

doi:10.2174/092986612798472893

Abstract

Phosphorylation is one of the most important post-translational modifications, and the identification of protein phosphorylation sites is particularly important for studying disease diagnosis. However, experimental detection of phosphorylation sites is labor intensive. It would be beneficial if computational methods are available to provide an extra reference for the phosphorylation sites. Here we developed a novel sequence-based method for serine, threonine, and tyrosine phosphorylation site prediction. Nearest Neighbor algorithm was employed as the prediction engine. The peptides around the phosphorylation sites with a fixed length of thirteen amino acid residues were extracted via a sliding window along the protein chains concerned. Each of such peptides was coded into a vector with 6,072 features, derived from Amino Acid Index (AAIndex) database, for the classification/detection. Incremental Feature Selection, a feature selection algorithm based on the Maximum Relevancy Minimum Redundancy (mRMR) method was used to select a compact feature set for a further improvement of the classification performance. Three predictors were established for identifying the three types of phosphorylation sites, achieving the overall accuracies of 66.64%, 66.11%% and 66.69%, respectively. These rates were obtained by rigorous jackknife cross-validation tests.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Sequence-Based Method for Phosphorylation Site Prediction with Feature Selection and Analysis

Abstract

Talk to us

Similar Papers

More From: Protein & Peptide Letters

Lead the way for us

Journal: Protein & Peptide Letters	Publication Date: Jan 1, 2012
Citations: 40

Similar Papers

GalNAc-transferase specificity prediction based on feature selection method
Lin Lu ... Yu-Dong Cai
Peptides | VOL. 30
Lin Lu, et. al.Lin Lu ... Yu-Dong Cai
08 Oct 2008
Peptides | VOL. 30

Prediction of Tyrosine Sulfation with mRMR Feature Selection and Analysis
Shen Niu ... Kaiyan Feng
Journal of Proteome Research | VOL. 9
Shen Niu, et. al.Shen Niu ... Kaiyan Feng
11 Nov 2010
Journal of Proteome Research | VOL. 9

Predict and analyze S-nitrosylation modification sites with the mRMR and IFS approaches
Bi-Qing Li ... Shen Niu
Journal of Proteomics | VOL. 75
Bi-Qing Li, et. al.Bi-Qing Li ... Shen Niu
11 Dec 2011
Journal of Proteomics | VOL. 75

Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm.
Shaopeng Wang ... Yu-Dong Cai
BioMed Research International | VOL. 2016
Shaopeng Wang, et. al.Shaopeng Wang ... Yu-Dong Cai
01 Jan 2015
BioMed Research International | VOL. 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Sequence-Based Method for Phosphorylation Site Prediction with Feature Selection and Analysis

Abstract

Talk to us

Similar Papers

More From: Protein &amp; Peptide Letters

More From: Protein & Peptide Letters