Eukaryotic retrotransposons encode a reverse transcriptase that binds RNA to template DNA synthesis. The ancestral non-long terminal repeat (non-LTR) retrotransposons encode a protein that performs target-primed reverse transcription (TPRT), in which the nicked genomic target site initiates complementary DNA (cDNA)synthesis directly into the genome. The best understood model system for biochemical studies of TPRT is the R2 protein from the silk moth Bombyx mori. The R2 protein selectively binds the 3' untranslated region of its encoding RNA as template for DNA insertion to its target site in 28S ribosomal DNA. Here, binding and TPRT assays define RNA contributions to RNA-protein interaction, template use for TPRT and the fidelity of template positioning for TPRT cDNA synthesis. We quantify both sequence and structure contributions to protein-RNA interaction. RNA determinants of binding affinity overlap but are not equivalent to RNA features required for TPRT and its fidelity of template positioning for full-length TPRT cDNA synthesis. Additionally, we show that a previously implicated RNA-binding protein surface of R2 protein makes RNA binding affinity dependent on the presence of two stem-loops. Our findings inform evolutionary relationships across R2 retrotransposon RNAs and are a step toward understanding the mechanism and template specificity of non-LTR retrotransposon mobility.
Read full abstract