Abstract
A need exists to develop bioinformatics for predicting differences in protein function, especially for members of a domain family who share a common fold, yet are found in a diverse array of proteins. Many domain families have been conserved over large evolutionary spans and representative genomic data during these periods are now available. This allows a simple method for grouping domain sequences to reveal common and unique/specific binding residues. As such, we hypothesize that sequence alignment analysis of the yeast SH3 domain family across ancestral species in the fungal kingdom can determine whether each member encodes specific information to bind unique peptide targets. With this approach, we identify important specific residues for a given domain as those that show little conservation within an alignment of yeast domain family members (paralogs) but are conserved in an alignment of its direct relatives (orthologs). We find most of the yeast SH3 domain family members have maintained unique amino acid conservation patterns that suggest they bind peptide targets with high intrinsic specificity through varying degrees of non-canonical recognition. For a minority of domains, we predict a less diverse binding surface, likely requiring additional factors to bind targets specifically. We observe that our predictions are consistent with high throughput binding data, which suggests our approach can probe intrinsic binding specificity in any other interaction domain family that is maintained during evolution.
Highlights
Signals are transmitted through cellular pathways via relays of protein-protein interactions resulting in specific outputs, such as cell growth, differentiation, or apoptosis
The 60 canonical SH3 domain residues are defined per numbering from a previous study [46] and for these positions the Specific Conservation (SC) value is calculated
The specific conservation (SC) value indicates for a position, how many times more “conserved” it is amongst the orthologs compared to the paralogs
Summary
Signals are transmitted through cellular pathways via relays of protein-protein interactions resulting in specific outputs, such as cell growth, differentiation, or apoptosis. To achieve the correct responses from signaling pathways, the protein-protein interactions involved must be specific, and not potentiate inappropriate activation of off-target pathways. This requisite precision can be readily achieved by proteins that possess high “intrinsic specificity”, directly binding their intended targets much more tightly than any other protein. For protein-DNA interactions, this can involve differences of three orders of magnitude or more in Kd value between target and non-target binding [1].
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have