Abstract

S-Adenosyl methionine (SAM), a universal methyl group donor, plays a vital role in biosynthesis and acts as an inhibitor to many enzymes. Due to protein interaction-dependent biological role, SAM has become a favorite target in various therapeutical and clinical studies such as treating cancer, Alzheimer’s, epilepsy, and neurological disorders. Therefore, the identification of the SAM interacting proteins and their interaction sites is a biologically significant problem. However, wet-lab techniques, though accurate, to identify SAM interactions and interaction sites are tedious and costly. Therefore, efficient and accurate computational methods for this purpose are vital to the design and assist such wet-lab experiments. In this study, we present machine learning-based models to predict SAM interacting proteins and their interaction sites by using only primary structures of proteins. Here we modeled SAM interaction prediction through whole protein sequence features along with different classifiers. Whereas, we modeled SAM interaction site prediction through overlapping sequence windows and ranking with multiple instance learning that allows handling imprecisely annotated SAM interaction sites. Through a series of simulation studies along with biological significant evaluation, we showed that our proposed models give a state-of-the-art performance for both SAM interaction and interaction site prediction. Through data mining in this study, we have also identified various characteristics of amino acid sub-sequences and their relative position to effectively locate interaction sites in a SAM interacting protein. Python code for training and evaluating our proposed models together with a webserver implementation as SIP (Sam Interaction Predictor) is available at the URL: https://sites.google.com/view/wajidarshad/software.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call