IFF: Identifying key residues in intrinsically disordered regions of proteins using machine learning.

Wen-Lin Ho,Hsuan-Cheng Huang,Jie-Rong Huang

doi:10.1002/pro.4739

Abstract

Conserved residues in protein homolog sequence alignments are structurally or functionally important. For intrinsically disordered proteins or proteins with intrinsically disordered regions (IDRs), however, alignment often fails because they lack a steric structure to constrain evolution. Although sequences vary, the physicochemical features of IDRs may be preserved in maintaining function. Therefore, a method to retrieve common IDR features may help identify functionally important residues. We applied unsupervised contrastive learning to train a model with self-attention neuronal networks on human IDR orthologs. Parameters in the model were trained to match sequences in ortholog pairs but not in other IDRs. The trained model successfully identifies previously reported critical residues from experimental studies, especially those with an overall pattern (e.g., multiple aromatic residues or charged blocks) rather than short motifs. This predictive model can be used to identify potentially important residues in other proteins, improving our understanding of their functions. The trained model can be run directly from the Jupyter Notebook in the GitHub repository using Binder (mybinder.org). The only required input is the primary sequence. The training scripts are available on GitHub (https://github.com/allmwh/IFF). The training datasets have been deposited in an Open Science Framework repository (https://osf.io/jk29b).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Protein science : a publication of the Protein Society	Publication Date: Aug 22, 2023
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

IFF: Identifying key residues in intrinsically disordered regions of proteins using machine learning.

Abstract

Talk to us

Similar Papers

More From: Protein science : a publication of the Protein Society

Lead the way for us

Similar Papers

Co-Evolution of Intrinsically Disordered Proteins with Folded Partners Witnessed by Evolutionary Couplings.
Rita Pancsa ... Fruzsina Zsolyomi
International Journal of Molecular Sciences | VOL. 19
Rita Pancsa, et. al.Rita Pancsa ... Fruzsina Zsolyomi
25 Oct 2018
International Journal of Molecular Sciences | VOL. 19

IDRMutPred: predicting disease-associated germline nonsynonymous single nucleotide variants (nsSNVs) in intrinsically disordered regions.
Jing-Bo Zhou ... Yao Xiong
Bioinformatics (Oxford, England) | VOL. 36
Jing-Bo Zhou, et. al.Jing-Bo Zhou ... Yao Xiong
05 Aug 2020
Bioinformatics (Oxford, England) | VOL. 36

IDPs: Less Disordered and More Ordered than Expected
Robert Konrat
Biophysical Journal | VOL. 109
Robert KonratRobert Konrat
01 Oct 2015
Biophysical Journal | VOL. 109

Identification of potential short linear motifs (SLiMs) in intrinsically disordered sequences of proteins by fast time-scale backbone dynamics
Snigdha Maiti ... Soumya De
Journal of Magnetic Resonance Open | VOL. 10-11
Snigdha Maiti, et. al.Snigdha Maiti ... Soumya De
20 Dec 2021
Journal of Magnetic Resonance Open | VOL. 10-11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IFF: Identifying key residues in intrinsically disordered regions of proteins using machine learning.

Abstract

Talk to us

Similar Papers

More From: Protein science : a publication of the Protein Society