Abstract

Proteins adapt to novel environments and/or gain function by substitution in amino acid sequences. Therefore, mutations in protein-coding genes are subject to selection pressure. The strength and character of selection pressure may vary among the regions of the protein. Thus, the spatial distribution of selection pressure provides information on the adaptive evolution of the protein. We developed a hierarchical Bayesian model that detects the spatial distribution of selection pressure on a protein. We expressed selection pressure by the substitution rate ratio of nonsynonymous to synonymous substitutions in the DNA sequence. The Potts model describes the prior distribution of spatial aggregation of selection pressure. The hyperparameters that define the strength and range of spatial clustering are estimated by maximizing the marginal likelihood. Because our prior distribution is un-normalized, we calculated the log marginal likelihood by "thermodynamic integration." We applied the method to historical data on the influenza hemagglutinin protein, comparing the estimated spatial distribution of the substitution rate ratio with that of antigenic sites A-E. The amino acid residues with higher substitution rate ratios, representing diversifying selection pressure, overlapped the antigenic sites.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.