Abstract

The env gene of human immunodeficiency virus (HIV) is a functionally important gene responsible for the production of protein products (gp120 and gp41) involved in host cell recognition, binding, and entry. This occurs through a complex and, as yet, not fully understood process of protein-protein interaction and within and between protein functional communication. Exposure on the surface of active HIV virions means the gp120-gp41 complexes are subjected to intense immune system pressure and have, therefore, evolved mechanisms to avoid neutralization. Using protein-coding sequences representing all the HIV type-1 (HIV-1) group M subtypes, we have identified amino acids within the env gene whose evolution is inextricably linked over the entire HIV-1 group M epidemic. We identified 848 pairs of coevolving residues (involving 263 out of 764 amino acid sites), which represent 0.29% of all possible pairs. Of the coevolving pairs, 68% were significantly correlated by hydrophobicity, molecular weight, or both hydrophobicity and molecular weight. Subsequent grouping of coevolving pairs resulted in the identification of 290 groups of amino acid residues, with the size of these groups ranging from 2 to 10 amino acid residues. Many of these dependencies are correlated by function including CD4 binding, coreceptor binding, glycosylation, and protein-protein interaction. This analysis provides important information regarding the functional dependencies observed within all the HIV-1 group M subtypes and may assist in the identification of functional protein domains and therapeutic targets within the HIV-1 env gene.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call