SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications

Chi-Chang Chang,Yen-Wei Chu,Chi-Hua Tung,Chi-Wei Chen,Chin-Hau Tu

doi:10.1038/s41598-018-33951-5

Abstract

Most modern tools used to predict sites of small ubiquitin-like modifier (SUMO) binding (referred to as SUMOylation) use algorithms, chemical features of the protein, and consensus motifs. However, these tools rarely consider the influence of post-translational modification (PTM) information for other sites within the same protein on the accuracy of prediction results. This study applied the Random Forest machine learning method, as well as motif screening models and a feature selection combination mechanism, to develop a SUMOylation prediction system, referred to as SUMOgo. With regard to prediction method, PTM sites were coded as new functional features in addition to structural features, such as sequence-based binary coding, encoded chemical features of proteins, and encoded secondary structure information that is important for PTM. Twenty cycles of prediction were conducted with a 1:1 combination of positive test data and random negative data. Matthew’s correlation coefficient of SUMOgo reached 0.511, which is higher than that of current commonly used tools. This study further verified the important role of PTM in SUMOgo and includes a case study on CREB binding protein (CREBBP). The website for the final tool is http://predictor.nchu.edu.tw/SUMOgo.

Highlights

Post-translational modification (PTM) of proteins refers to the chemical modification of proteins after their translation[1,2,3]
Our research developed a SUMOylation prediction tool, named SUMOgo, which we used to explore whether such competition can affect the accuracy of SUMOylation prediction tools and whether the rules of other post-translational modification (PTM) can be applied to SUMOylation
The results showed that the prediction accuracy of SUMOgo is greater than that of other SUMOylation site prediction tools with an average Matthews correlation coefficient (MCC) of up to 0.511

Summary

Methods

Positive and negative data sets not matching the consensus motif were named CN_P and CN_N, respectively. We refer to this procedure for motif screening models as the CNCY system. A comparison of consensus motif types and the positive and negative data set ratio (P/N ratio) is necessary prior to the construction of the prediction model. Positive and negative data sets with different proportions of CN and CY were combined into SVM learning for the calculation of the average MCC for each item after prediction and for constructing motif screening models

Feature Total bits Position

Results and Discussion

Consensus motif CN CY CN CY

No mod FSC

Author Contributions

Additional Information

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Oct 19, 2018
Citations: 35	License type: open-access

R Discovery Prime

R Discovery Prime

SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Sumoylated SnoN Represses Transcription in a Promoter-specific Manner
Ying-Han R Hsu ... Shirin Bonni
Journal of Biological Chemistry | VOL. 281
Ying-Han R Hsu, et. al.Ying-Han R Hsu ... Shirin Bonni
01 Nov 2006
Journal of Biological Chemistry | VOL. 281

SUMOylation regulates nuclear accumulation and signaling activity of the soluble intracellular domain of the ErbB4 receptor tyrosine kinase
Anna M Knittle ... Klaus Elenius
Journal of Biological Chemistry | VOL. 292
Anna M Knittle, et. al.Anna M Knittle ... Klaus Elenius
01 Dec 2017
Journal of Biological Chemistry | VOL. 292

ISUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.
Yaser Daanial Khan ... Nabeel Sabir Khan
PeerJ | VOL. 9
Yaser Daanial Khan, et. al.Yaser Daanial Khan ... Nabeel Sabir Khan
04 Aug 2021
PeerJ | VOL. 9

A Method of Mapping Protein Sumoylation Sites by Mass Spectrometry Using a Modified Small Ubiquitin-like Modifier 1 (SUMO-1) and a Computational Program
Matthew Knuesel ... Hiu Tom Cheung
Molecular & Cellular Proteomics | VOL. 4
Matthew Knuesel, et. al.Matthew Knuesel ... Hiu Tom Cheung
14 Jul 2005
Molecular & Cellular Proteomics | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports