Accelerate Literature Icon
Want to do a literature review? Try our new Literature Review workflow

Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases

  • TL;DR
  • Abstract
  • Highlights & Summary
  • PDF
  • Literature Map
  • Similar Papers
TL;DR

This study classifies the diverse TnpAY1 family of HuH endonucleases involved in single-stranded DNA processing across prokaryotes into three classes and 12 subclasses, revealing widespread distribution, domain variability, and potential roles in DNA editing and adaptive responses, supported by genomic and structural analyses.

Abstract
Translate article icon Translate Article Star icon

BackgroundSome mobile genetic elements target the lagging strand template during DNA replication. Bacterial examples are insertion sequences IS608 and ISDra2 (IS200/IS605 family members). They use obligatory single-stranded circular DNA intermediates for excision and insertion and encode a transposase, TnpAIS200, which recognizes subterminal secondary structures at the insertion sequence ends. Similar secondary structures, Repeated Extragenic Palindromes (REP), are present in many bacterial genomes. TnpAIS200-related proteins, TnpAREP, have been identified and could be responsible for REP sequence proliferation. These proteins share a conserved HuH/Tyrosine core domain responsible for catalysis and are involved in processes of ssDNA cleavage and ligation. Our goal is to characterize the diversity of these proteins collectively referred as the TnpAY1 family.ResultsA genome-wide analysis of sequences similar to TnpAIS200 and TnpAREP in prokaryotes revealed a large number of family members with a wide taxonomic distribution. These can be arranged into three distinct classes and 12 subclasses based on sequence similarity. One subclass includes sequences similar to TnpAIS200. Proteins from other subclasses are not associated with typical insertion sequence features. These are characterized by specific additional domains possibly involved in protein/DNA or protein/protein interactions. Their genes are found in more than 25% of species analyzed. They exhibit a patchy taxonomic distribution consistent with dissemination by horizontal gene transfers followed by loss. The tnpAREP genes of five subclasses are flanked by typical REP sequences in a REPtron-like arrangement. Four distinct REP types were characterized with a subclass specific distribution. Other subclasses are not associated with REP sequences but have a large conserved domain located in C-terminal end of their sequence. This unexpected diversity suggests that, while most likely involved in processing single-strand DNA, proteins from different subfamilies may play a number of different roles.ConclusionsWe established a detailed classification of TnpAY1 proteins, consolidated by the analysis of the conserved core domains and the characterization of additional domains. The data obtained illustrate the unexpected diversity of the TnpAY1 family and provide a strong framework for future evolutionary and functional studies. By their potential function in ssDNA editing, they may confer adaptive responses to host cell physiology and metabolism.

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 8
  • 10.1074/jbc.m109.072520
DNA Structure Specificity Conferred on a Replicative Helicase by Its Loader
  • Jan 1, 2010
  • Journal of Biological Chemistry
  • Milind K Gupta + 2 more

Prokaryotic and eukaryotic replicative helicases can translocate along single-stranded and double-stranded DNA, with the central cavity of these multimeric ring helicases being able to accommodate both forms of DNA. Translocation by such helicases along single-stranded DNA results in the unwinding of forked DNA by steric exclusion and appears critical in unwinding of parental strands at the replication fork, whereas translocation over double-stranded DNA has no well-defined role. We have found that the accessory factor, DnaC, that promotes loading of the Escherichia coli replicative helicase DnaB onto single-stranded DNA may also act to confer DNA structure specificity on DnaB helicase. When present in excess, DnaC inhibits DnaB translocation over double-stranded DNA but not over single-stranded DNA. Inhibition of DnaB translocation over double-stranded DNA requires the ATP-bound form of DnaC, and this inhibition is relieved during translocation over single-stranded DNA indicating that stimulation of DnaC ATPase is responsible for this DNA structure specificity. These findings demonstrate that DnaC may provide the DNA structure specificity lacking in DnaB, limiting DnaB translocation to bona fide replication forks. The ability of other replicative helicases to translocate along single-stranded and double-stranded DNA raises the possibility that analogous regulatory mechanisms exist in other organisms.

  • Research Article
  • Cite Count Icon 103
  • 10.1093/emboj/cdg246
A novel type of replicative enzyme harbouring ATPase, primase and DNA polymerase activity.
  • May 15, 2003
  • The EMBO Journal
  • G Lipps

Although DNA replication is a process common in all domains of life, primase and replicative DNA polymerase appear to have evolved independently in the bacterial domain versus the archaeal/eukaryal branch of life. Here, we report on a new type of replication protein that constitutes the first member of the DNA polymerase family E. The protein ORF904, encoded by the plasmid pRN1 from the thermoacidophile archaeon Sulfolobus islandicus, is a highly compact multifunctional enzyme with ATPase, primase and DNA polymerase activity. Recombinant purified ORF904 hydrolyses ATP in a DNA-dependent manner. Deoxynucleotides are preferentially used for the synthesis of primers approximately 8 nucleotides long. The DNA polymerase activity of ORF904 synthesizes replication products of up to several thousand nucleotides in length. The primase and DNA polymerase activity are located in the N-terminal half of the protein, which does not show homology to any known DNA polymerase or primase. ORF904 constitutes a new type of replication enzyme, which could have evolved independently from the eubacterial and archaeal/eukaryal proteins of DNA replication.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 32
  • 10.1186/s13100-018-0134-3
Insertion sequence elements-mediated structural variations in bacterial genomes
  • Aug 29, 2018
  • Mobile DNA
  • Etienne Nzabarushimana + 1 more

Mobile genetic elements (MGEs) impact the evolution and stability of their host genomes. Insertion sequence (IS) elements are the most common MGEs in bacterial genomes and play a crucial role in mediating large-scale variations in bacterial genomes. It is understood that IS elements and MGEs in general coexist in a dynamical equilibrium with their respective hosts. Current studies indicate that the spontaneous movement of IS elements does not follow a constant rate in different bacterial genomes. However, due to the paucity and sparsity of the data, these observations are yet to be conclusive. In this paper, we conducted a comparative analysis of the IS-mediated genome structural variations in ten mutation accumulation (MA) experiments across eight strains of five bacterial species containing IS elements, including four strains of the E. coli. We used GRASPER algorithm, a denovo structural variation (SV) identification algorithm designed to detect SVs involving repetitive sequences in the genome. We observed highly diverse rates of IS insertions and IS-mediated recombinations across different bacterial species as well as across different strains of the same bacterial species. We also observed different rates of the elements from the same IS family in different bacterial genomes, suggesting that the distinction in rates might not be due to the different composition of IS elements across bacterial genomes.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 40
  • 10.1371/journal.pcbi.1003680
Large-scale genomic analysis suggests a neutral punctuated dynamics of transposable elements in bacterial genomes.
  • Jun 26, 2014
  • PLoS Computational Biology
  • Jaime Iranzo + 3 more

Insertion sequences (IS) are the simplest and most abundant form of transposable DNA found in bacterial genomes. When present in multiple copies, it is thought that they can promote genomic plasticity and genetic exchange, thus being a major force of evolutionary change. The main processes that determine IS content in genomes are, though, a matter of debate. In this work, we take advantage of the large amount of genomic data currently available and study the abundance distributions of 33 IS families in 1811 bacterial chromosomes. This allows us to test simple models of IS dynamics and estimate their key parameters by means of a maximum likelihood approach. We evaluate the roles played by duplication, lateral gene transfer, deletion and purifying selection. We find that the observed IS abundances are compatible with a neutral scenario where IS proliferation is controlled by deletions instead of purifying selection. Even if there may be some cases driven by selection, neutral behavior dominates over large evolutionary scales. According to this view, IS and hosts tend to coexist in a dynamic equilibrium state for most of the time. Our approach also allows for a detection of recent IS expansions, and supports the hypothesis that rapid expansions constitute transient events—punctuations—during which the state of coexistence of IS and host becomes perturbated.

  • Research Article
  • Cite Count Icon 105
  • 10.1101/gr.089615.108
Inference of the impact of insertion sequence (IS) elements on bacterial genome diversification through analysis of small-size structural polymorphisms in Escherichia coli O157 genomes
  • Jun 29, 2009
  • Genome Research
  • Tadasuke Ooka + 7 more

Mobile genetic elements play important roles in the evolution and diversification of bacterial genomes. In enterohemorrhagic Escherichia coli O157, a major factor that affects genomic diversity is prophages, which generate most of the large-size structural polymorphisms (LSSPs) observed in O157 genomes. Here, we describe the results of a systematic analysis of numerous small-size structural polymorphisms (SSSPs) that were detected by comparing the genomes of eight clinical isolates with a sequenced strain, O157 Sakai. Most of the SSSPs were generated by genetic events associated with only two insertion sequence (IS) elements, IS629 and ISEc8, and a number of genes that were inactivated or deleted by these events were identified. Simple excisions of IS629 and small deletions (footprints) formed by the excision of IS629, both of which are rarely described in bacteria, were also detected. In addition, the distribution of IS elements was highly biased toward prophages, prophage-like integrative elements, and plasmids. Based on these and our previous results, we conclude that, in addition to prophages, these two IS elements are major contributors to the genomic diversification of O157 strains and that LSSPs have been generated mainly by bacteriophages and SSSPs by IS elements. We also suggest that IS elements possibly play a role in the inactivation and immobilization of incoming phages and plasmids. Taken together, our results reveal the true impact of IS elements on the diversification of bacterial genomes and highlight their novel role in genome evolution.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 16
  • 10.1074/jbc.m800856200
Catalysis of Strand Annealing by Replication Protein A Derives from Its Strand Melting Properties
  • Aug 1, 2008
  • Journal of Biological Chemistry
  • Jeremy D Bartos + 4 more

Eukaryotic DNA-binding protein replication protein A (RPA) has a strand melting property that assists polymerases and helicases in resolving DNA secondary structures. Curiously, previous results suggested that human RPA (hRPA) promotes undesirable recombination by facilitating annealing of flaps produced transiently during DNA replication; however, the mechanism was not understood. We designed a series of substrates, representing displaced DNA flaps generated during maturation of Okazaki fragments, to investigate the strand annealing properties of RPA. Until cleaved by FEN1 (flap endonuclease 1), such flaps can initiate homologous recombination. hRPA inhibited annealing of strands lacking secondary structure but promoted annealing of structured strands. Apparently, both processes primarily derive from the strand melting properties of hRPA. These properties slowed the spontaneous annealing of unstructured single strands, which occurred efficiently without hRPA. However, structured strands without hRPA displayed very slow spontaneous annealing because of stable intramolecular hydrogen bonding. hRPA appeared to transiently melt the single strands so that they could bind to form double strands. In this way, melting ironically promoted annealing. Time course measurements in the presence of hRPA suggest that structured single strands achieve an equilibrium with double strands, a consequence of RPA driving both annealing and melting. Promotion of annealing reached a maximum at a specific hRPA concentration, presumably when all structured single-stranded DNA was melted. Results suggest that displaced flaps with secondary structure formed during Okazaki fragment maturation can be melted by hRPA and subsequently annealed to a complementary ectopic DNA site, forming recombination intermediates that can lead to genomic instability.

  • Research Article
  • Cite Count Icon 22
  • 10.1074/jbc.m503244200
Gly-103 in the N-terminal Domain of Saccharomyces cerevisiae Rad51 Protein Is Critical for DNA Binding
  • Jul 1, 2005
  • Journal of Biological Chemistry
  • Xiao-Ping Zhang + 4 more

Rad51 is a homolog of the bacterial RecA protein and is central for recombination in eukaryotes performing homology search and DNA strand exchange. Rad51 and RecA share a core ATPase domain that is structurally similar to the ATPase domains of helicases and the F1 ATPase. Rad51 has an additional N-terminal domain, whereas RecA protein has an additional C-terminal domain. Here we show that glycine 103 in the N-terminal domain of Saccharomyces cerevisiae Rad51 is important for binding to single-stranded and duplex DNA. The Rad51-G103E mutant protein is deficient in DNA strand exchange and ATPase activity due to a primary DNA binding defect. The N-terminal domain of Rad51 is connected to the ATPase core through an extended elbow linker that ensures flexibility of the N-terminal domain. Molecular modeling of the Rad51-G103E mutant protein shows that the negatively charged glutamate residue lies on the surface of the N-terminal domain facing a positively charged patch composed of Arg-260, His-302, and Lys-305 on the ATPase core domain. A possible structural explanation for the DNA binding defect is that a charge interaction between Glu-103 and the positive patch restricts the flexibility of the N-terminal domain. Rad51-G103E was identified in a screen for Rad51 interaction-deficient mutants and was shown to ablate the Rad54 interaction in two-hybrid assays (Krejci, L., Damborsky, J., Thomsen, B., Duno, M., and Bendixen, C. (2001) Mol. Cell. Biol. 21, 966-976). Surprisingly, we found that the physical interaction of Rad51-G103E with Rad54 was not affected. Our data suggest that the two-hybrid interaction defect was an indirect consequence of the DNA binding defect.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 7
  • 10.3389/fmicb.2022.901762
Toxic/Bioactive Peptide Synthesis Genes Rearranged by Insertion Sequence Elements Among the Bloom-Forming Cyanobacteria Planktothrix.
  • Jul 28, 2022
  • Frontiers in microbiology
  • Elisabeth Entfellner + 6 more

It has been generally hypothesized that mobile elements can induce genomic rearrangements and influence the distribution and functionality of toxic/bioactive peptide synthesis pathways in microbes. In this study, we performed in depth genomic analysis by completing the genomes of 13 phylogenetically diverse strains of the bloom-forming freshwater cyanobacteria Planktothrix spp. to investigate the role of insertion sequence (IS) elements in seven pathways. Chromosome size varied from 4.7–4.8 Mbp (phylogenetic Lineage 1 of P. agardhii/P. rubescens thriving in shallow waterbodies) to 5.4–5.6 Mbp (Lineage 2 of P. agardhii/P. rubescens thriving in deeper physically stratified lakes and reservoirs) and 6.3–6.6 Mbp (Lineage 3, P. pseudagardhii/P. tepida including planktic and benthic ecotypes). Although the variation in chromosome size was positively related to the proportion of IS elements (1.1–3.7% on chromosome), quantitatively, IS elements and other paralogs only had a minor share in chromosome size variation. Thus, the major part of genomic variation must have resulted from gene loss processes (ancestor of Lineages 1 and 2) and horizontal gene transfer (HGT). Six of seven peptide synthesis gene clusters were found located on the chromosome and occurred already in the ancestor of P. agardhii/P. rubescens, and became partly lost during evolution of Lineage 1. In general, no increased IS element frequency in the vicinity of peptide synthesis gene clusters was observed. We found a higher proportion of IS elements in ten breaking regions related to chromosomal rearrangements and a tendency for colocalization of toxic/bioactive peptide synthesis gene clusters on the chromosome.

  • Research Article
  • Cite Count Icon 106
  • 10.1046/j.1365-2958.2000.02161.x
Evidence of recent lateral gene transfer among hyperthermophilic archaea.
  • Nov 1, 2000
  • Molecular Microbiology
  • Jocelyne Diruggiero + 8 more

A total of 153 nucleotide differences were found over a contiguous 16 kb region between two hyperthermophilic Archaea, Pyrococcus furiosus and Thermococcus litoralis. The 16 kb region in P. furiosus is flanked by insertion sequence (IS) elements with inverted and direct repeats. Both IS elements contain a single open reading frame (ORF) encoding a putative protein of 233 amino acids identified as a transposase. This 16 kb region has the features of a typical bacterial composite transposon and represents a possible mechanism for lateral gene transfer between Archaea or possibly between Archaea and Bacteria. A total of 23 homologous IS elements was found in the genome sequence of P. furiosus, whereas no full-length IS elements were identified in the genomes of Pyrococcus abyssi and Pyrococcus horikoshii. Only one IS element was found in T. litoralis. In P. furiosus and T. litoralis, the 16 kb region contains an ABC transport system for maltose and trehalose that was characterized biochemically for T. litoralis. Regulation of expression studies showed that the malE gene, located on the transposon, and the encoded trehalose/maltose-binding protein (TMBP) are induced in the presence of maltose and trehalose in both P. furiosus and T. litoralis. The implications of transposition as a mechanism for lateral gene transfer among Archaea are discussed.

  • Research Article
  • Cite Count Icon 16
  • 10.1016/j.chom.2024.03.005
A metagenomics pipeline reveals insertion sequence-driven evolution of the microbiota
  • Apr 1, 2024
  • Cell host & microbe
  • Joshua M Kirsch + 2 more

A metagenomics pipeline reveals insertion sequence-driven evolution of the microbiota

  • Research Article
  • 10.1101/2023.10.06.561241
A metagenomics pipeline reveals insertion sequence-driven evolution of the microbiota
  • Mar 9, 2024
  • bioRxiv
  • Joshua M Kirsch + 2 more

Insertion sequence (IS) elements are mobile genetic elements in bacterial genomes that support adaptation. We developed a database of IS elements coupled to a computational pipeline that identifies IS element insertions in the microbiota. We discovered that diverse IS elements insert into the genomes of intestinal bacteria regardless of human host lifestyle. These insertions target bacterial accessory genes that aid in their adaptation to unique environmental conditions. Using IS expansion in Bacteroides, we show that IS activity leads to insertion “hot spots” in accessory genes. We show that IS insertions are stable and can be transferred between humans. Extreme environmental perturbations force IS elements to fall out of the microbiota and many fail to rebound following homeostasis. Our work shows that IS elements drive bacterial genome diversification within the microbiota and establishes a framework for understanding how strain level variation within the microbiota impacts human health.

  • Book Chapter
  • Cite Count Icon 3
  • 10.1007/978-981-13-3411-5_8
Bacterial Transposable Elements and IS-Excision Enhancer (IEE)
  • Jan 1, 2019
  • Masahiro Kusumoto + 1 more

Insertion sequence (IS) elements are the simplest transposable elements (or mobile genetic elements) and are widely distributed in bacteria. The transposition and proliferation of IS elements induce not only insertional gene inactivation and modification of gene expression but also a wide range of genomic rearrangements, such as deletions, inversions, and duplications. IS-mediated bacterial genome diversification has been extensively studied in enterohemorrhagic Escherichia coli (EHEC). Excision of IS elements occurs frequently in O157, the major serogroup of EHEC isolated from humans, and IS-excision enhancer (IEE) promotes IS excision from the O157 genome in a transposase-dependent manner. IEE promotes the excision of IS elements belonging to several IS families, and various types of genomic deletions are also generated via IEE-promoted IS excision in O157. In addition, IEE has been found in specific lineages of enterotoxigenic E. coli (ETEC) strains isolated from swine, in which the iee genes are located on integrative elements that are similar to SpLE1 of EHEC O157. iee-positive ETEC lineages also contain multiple copies of IS elements at genomic locations that exhibit significant variations between strains, as observed in O157. These data and the phylogeny of IEE homologs found in a broad range of bacteria suggest that IEE has coevolved with IS elements and plays pivotal roles in bacterial genome evolution by inducing IS removal and genomic deletion.

  • Research Article
  • Cite Count Icon 23
  • 10.1016/j.ijantimicag.2018.09.020
Insertion sequences in the CRISPR-Cas system regulate horizontal antimicrobial resistance gene transfer in Shigella strains
  • Oct 2, 2018
  • International Journal of Antimicrobial Agents
  • Shuaiyin Chen + 7 more

Insertion sequences in the CRISPR-Cas system regulate horizontal antimicrobial resistance gene transfer in Shigella strains

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 19
  • 10.1074/jbc.m607749200
Structural Reorganization and the Cooperative Binding of Single-stranded Telomere DNA in Sterkiella nova
  • Dec 1, 2006
  • The Journal of biological chemistry
  • Pawel Buczek + 1 more

In Sterkiella nova, alpha and beta telomere proteins bind cooperatively with single-stranded DNA to form a ternary alpha.beta.DNA complex. Association of telomere protein subunits is DNA-dependent, and alpha-beta association enhances DNA affinity. To further understand the molecular basis for binding cooperativity, we characterized several possible stepwise assembly pathways using isothermal titration calorimetry. In one path, alpha and DNA first form a stable alpha.DNA complex followed by the addition of beta in a second step. Binding energy accumulates with nearly equal free energy of association for each of these steps. Heat capacity is nonetheless dramatically different, with DeltaCp = -305 +/- 3 cal mol(-1) K(-1) for alpha binding with DNA and DeltaCp = -2010 +/- 20 cal mol(-1) K(-1) for the addition of beta to complete the alpha.beta.DNA complex. By examining alternate routes including titration of single-stranded DNA with a preformed alpha.beta complex, a significant portion of binding energy and heat capacity could be assigned to structural reorganization involving protein-protein interactions and repositioning of the DNA. Structural reorganization probably affords a mechanism to regulate high affinity binding of telomere single-stranded DNA with important implications for telomere biology. Regulation of telomere complex dissociation is thought to involve post-translational modifications in the lysine-rich C-terminal portion of beta. We observed no difference in binding energetics or crystal structure when comparing complexes prepared with full-length beta or a C-terminally truncated form, supporting interesting parallels between the intrinsically disordered regions of histones and this portion of beta.

  • Research Article
  • Cite Count Icon 86
  • 10.1038/297159a0
Structural analysis of insertion sequence IS5.
  • May 1, 1982
  • Nature
  • Manfred Kröger + 1 more

Among the transposable DNA elements that have been detected in bacterial genomes and plasmids1–3, the small insertion sequence (IS) elements have been recognized as a class of autonomous, basic units of transposition. IS elements are not only fully competent of transposition and able to cause several other related chromosomal aberrations such as deletion of adjacent DNA segments4,5 and integration of circular plasmid DNAs6, but have also proved to be the active units of transposition in the larger composite transposon (Tn) elements1–3,7–9. In addition, IS and Tn elements interfere with local transcription patterns in the host genome by causing polarity 10,11. Structural and functional analysis of the IS elements should therefore elucidate both the basis for autonomy of a genetic unit as small as 1,000 base pairs (bp), and the transposition mechanism, including its inherent step of DNA duplication12. We have now determined the 1,195 bp sequence of the transposable DNA element IS5 in phage λKH100. Near-terminal signal structures appear to protect the central region from outside-in transcriptions and translations, and render IS polarity as a direct consequence of IS autonomy. The nucleotide structure shows an unusual, highly compact organization of two completely overlapping, antiparallel genes and correlated control sequences.

Save Icon
Up Arrow
Open/Close
Setting-up Chat
Loading Interface