SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the\xa0human proteome based on multi-stage ensemble-learning models

Xiaochuan Wang,Chen Li,Fuyi Li,Varun S Sharma,Jiangning Song,Geoffrey I Webb

doi:10.1186/s12859-019-3178-6

Abstract

BackgroundS-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (−SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C). Recent experimental studies have revealed that S-sulphenylation plays critical roles in many biological functions, such as protein regulation and cell signaling. State-of-the-art bioinformatic advances have facilitated high-throughput in silico screening of protein S-sulphenylation sites, thereby significantly reducing the time and labour costs traditionally required for the experimental investigation of S-sulphenylation.ResultsIn this study, we have proposed a novel hybrid computational framework, termed SIMLIN, for accurate prediction of protein S-sulphenylation sites using a multi-stage neural-network based ensemble-learning model integrating both protein sequence derived and protein structural features. Benchmarking experiments against the current state-of-the-art predictors for S-sulphenylation demonstrated that SIMLIN delivered competitive prediction performance. The empirical studies on the independent testing dataset demonstrated that SIMLIN achieved 88.0% prediction accuracy and an AUC score of 0.82, which outperforms currently existing methods.ConclusionsIn summary, SIMLIN predicts human S-sulphenylation sites with high accuracy thereby facilitating biological hypothesis generation and experimental validation. The web server, datasets, and online instructions are freely available at http://simlin.erc.monash.edu/ for academic purposes.

Highlights

S-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (−SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C)
We propose a novel bioinformatics tool for improved prediction of protein S-sulphenylation sites, named SIMLIN, integrating a number of protein sequencederived and protein structural features based on the sequence motifs previously identified in [6, 7]
Proteome-wide prediction and functional enrichment analysis In order to more effectively portray the distribution of predicted S-sulphenylation sites and their potential molecular functions, we performed human proteome-wide S-sulphenylation site prediction using the protein sequences collected from the UniProt database (Version Sep 2017) and our proposed SIMLIN framework

Summary

Results

We have proposed a novel hybrid computational framework, termed SIMLIN, for accurate prediction of protein S-sulphenylation sites using a multi-stage neural-network based ensemble-learning model integrating both protein sequence derived and protein structural features. Benchmarking experiments against the current state-of-the-art predictors for S-sulphenylation demonstrated that SIMLIN delivered competitive prediction performance. The empirical studies on the independent testing dataset demonstrated that SIMLIN achieved 88.0% prediction accuracy and an AUC score of 0.82, which outperforms currently existing methods

Conclusions

Background

Results and discussion

Method SOHPRED

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Nov 21, 2019
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the\xa0human proteome based on multi-stage ensemble-learning models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

SysPTM: A Systematic Resource for Proteomic Research on Post-translational Modifications
Hong Li ... Yixue Li
Molecular & Cellular Proteomics | VOL. 8
Hong Li, et. al.Hong Li ... Yixue Li
01 Aug 2009
Molecular & Cellular Proteomics | VOL. 8

An ensemble deep learning model for exhaust emissions prediction of heavy oil-fired boiler combustion
Zhezhe Han ... Chuanlong Xu
Fuel | VOL. 308
Zhezhe Han, et. al.Zhezhe Han ... Chuanlong Xu
15 Sep 2021
Fuel | VOL. 308

DeepDN_iGlu: prediction of lysine glutarylation sites based on attention residual learning method and DenseNet.
Jianhua Jia ... Mingwei Sun
Mathematical biosciences and engineering : MBE | VOL. 20
Jianhua Jia, et. al.Jianhua Jia ... Mingwei Sun
01 Jan 2021
Mathematical biosciences and engineering : MBE | VOL. 20

Novel Oxidative Modifications in Redox-Active Cysteine Residues
Jaeho Jeong ... Kong-Joo Lee
Molecular & Cellular Proteomics | VOL. 10
Jaeho Jeong, et. al.Jaeho Jeong ... Kong-Joo Lee
01 Mar 2011
Molecular & Cellular Proteomics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the\xa0human proteome based on multi-stage ensemble-learning models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics