Harnessing DNA as a high-density storage medium for information storage and molecular recording of signals has been of increasing interest in the biotechnology field. Recently, progress in enzymatic DNA synthesis, DNA digital data storage, and DNA-based molecular recording has been made by leveraging the activity of the template-independent DNA polymerase, terminal deoxynucleotidyl transferase (TdT). TdT adds deoxyribonucleotides to the 3' end of single-stranded DNA, generating random sequences of single-stranded DNA. TdT can use several divalent cations for its enzymatic activity and exhibits shifts in deoxyribonucleotide incorporation frequencies in response to changes in its reaction environment. However, there is limited understanding of sequence-structure-function relationships regarding these properties, which in turn limits our ability to modulate TdT to further advance TdT-based tools. Most TdT literature to-date explores the activity of murine, bovine or human TdTs; studies probing TdT sequence and structure largely focus on strictly conserved residues that are functionally critical to TdT activity. Here, we explore non-conserved TdT sequence space by surveying the natural diversity of TdT. We characterize a diverse set of TdT homologs from different organisms and identify several TdT residues/regions that confer differences in TdT behavior between homologs. The observations in this study can design rules for targeted TdT libraries, in tandem with a screening assay, to modulate TdT properties. Moreover, the data can be useful in guiding further studies of potential residues of interest. Overall, we characterize TdTs that have not been previously studied in the literature, and we provide new insights into TdT sequence-function relationships.
Read full abstract