Interpretable machine learning identification of arginine methylation sites

Syed Danish Ali,Hilal Tayara,Kil To Chong

doi:10.1016/j.compbiomed.2022.105767

Abstract

Protein methylation is one of the most prominent posttranslation modifications that essentially regulates several biological processes in eukaryotes. Therefore, identification of the arginine methylation site is crucial in deciphering its characteristics and functions in cell biology, disease mechanisms, and guided drug development. The computation methods address the long-term bottleneck together with the cost, time, and labor required in experimental methods for large-scale identification of protein arginine methylation sites. In this study, we proposed a robust machine learning-based computational tool known as iIRMethyl, employing the primary sequence and physicochemical properties of protein along with a two-step feature selection method for optimal selection of feature descriptors. Moreover, the performance of iIRMethyl was comprehensively evaluated via k-fold cross-validation on a benchmark dataset and independent test dataset. iIRMethyl demonstrated a remarkably greater performance than the state-of-the-art method and achieved an average area under the curve value of 0.99 for both k-fold cross-validation and an independent test set in the identification of protein arginine methylation sites. Furthermore, the outcomes reveal that iIRMethyl is a robust and accurate computational tool for large-scale identification of arginine methylation sites and would facilitate the understanding of their functional mechanisms and accelerating their application in drug development and clinical therapy. Additionally, the prediction mechanism of the proposed model iIRMethyl is interpreted using the SHapley Additive exPlanation algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interpretable machine learning identification of arginine methylation sites

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Jun 21, 2022
Citations: 6

Similar Papers

N‐Ace: Using solvent accessibility and physicochemical properties to identify protein N‐acetylation sites
Tzong‐Yi Lee ... Wen‐Chi Chang
Journal of Computational Chemistry | VOL. 31
Tzong‐Yi Lee, et. al.Tzong‐Yi Lee ... Wen‐Chi Chang
08 Sep 2010
Journal of Computational Chemistry | VOL. 31

Identifying protein arginine methylation sites using global features of protein sequence coupled with support vector machine optimized by particle swarm optimization algorithm
Yan Zhang ... Ruqin Yu
Chemometrics and Intelligent Laboratory Systems | VOL. 146
Yan Zhang, et. al.Yan Zhang ... Ruqin Yu
18 May 2015
Chemometrics and Intelligent Laboratory Systems | VOL. 146

Renal tumor segmentation, visualization, and segmentation confidence using ensembles of neural networks in patients undergoing surgical resection.
Sophie Bachanek ... Tanja Yani Janssen
European radiology | VOL. -
Sophie Bachanek, et. al.Sophie Bachanek ... Tanja Yani Janssen
23 Aug 2024
European radiology | VOL. -

Evaluating the robustness of models developed from field spectral data in predicting African grass foliar nitrogen concentration using WorldView-2 image as an independent test dataset
Onisimo Mutanga ... Elfatih M Abdel-Rahman
International Journal of Applied Earth Observation and Geoinformation | VOL. 34
Onisimo Mutanga, et. al.Onisimo Mutanga ... Elfatih M Abdel-Rahman
06 Sep 2014
International Journal of Applied Earth Observation and Geoinformation | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interpretable machine learning identification of arginine methylation sites

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine