HIV-1 RNA dimerization is a critical step in viral life cycle. It is a prerequisite for genome packaging and plays an important role in reverse transcription and recombination. Dimerization is promoted by the DIS (dimerization initiation site) hairpin located in the 5′ leader of HIV-1 genome. Despite the high genetic diversity in HIV-1 group M, only five apical loops (AAGCGCGCA, AAGUGCGCA, AAGUGCACA, AGGUGCACA and AGUGCAC) are commonly found in DIS hairpins. We refer to the parent DISes with these apical loops as DISLai, DISTrans, DISF, DISMal, and DISC, respectively. Based on identity or similarity of DIS hairpins to parent DISes, we distributed HIV-1 M genomes into five dimerization groups. Comparison of the primary and secondary structures of DIS, SD and Psi hairpins in about 3000 HIV-1 M genomes showed that the mutation frequencies at particular nucleotide positions of these hairpins differ among the dimerization groups, and DISF may be an origin of other parent DISes. We found that DIS, SD and Psi hairpins have hundreds of variants, only some of them occurring rather frequently. The lower part of DIS hairpin with G x AGG internal loop is highly conserved in both HIV-1 and SIV genomes. We supposed that the G-quadruplex, located 56 nts downstream of the Gag start codon, may participate in switching of HIV-1 leader RNA from BMH (branched multiple hairpins) to LDI (long distance interaction) conformation.
Read full abstract