Abstract

One of the first steps towards elucidating the biological function of a putative transcriptional regulator is to ascertain its preferred DNA-binding sequences. This may be rapidly and effectively achieved through the application of a combinatorial approach, one involving very large numbers of randomized oligonucleotides and reiterative selection and amplification steps to enrich for high-affinity nucleic acid-binding sequences. Previously, we had developed the novel combinatorial approach Restriction Endonuclease Protection, Selection and Amplification (REPSA), which relies not on the physical separation of ligand-nucleic acid complexes but instead selects on the basis of ligand-dependent inhibition of enzymatic template inactivation, specifically cleavage by type IIS restriction endonucleases. Thus, no prior knowledge of the ligand is required for REPSA, making it more amenable for discovery purposes. Here we describe using REPSA, massively parallel sequencing, and bioinformatics to identify the preferred DNA-binding sites for the transcriptional regulator SbtR, encoded by the TTHA0167 gene from the model extreme thermophile Thermus thermophilus HB8. From the resulting position weight matrix, we can identify multiple operons potentially regulated by SbtR and postulate a biological role for this protein in regulating extracellular transport processes. Our study provides a proof-of-concept for the application of REPSA for the identification of preferred DNA-binding sites for orphan transcriptional regulators and a first step towards determining their possible biological roles.

Highlights

  • Genome projects have yielded considerable information since the sequencing of the first whole microorganism genome, Haemophilus influenza, in 1995 [1,2]

  • We describe the application of REPSA to determine the preferred DNA-binding sequences for the T. thermophilus HB8 transcription regulatory protein encoded by the TTHA0167 gene, SbtR

  • This study provides a proof-of-concept that REPSA may be used to initiate our understanding of transcription regulatory proteins through the definition of their preferred DNA binding sequences, thereby leading to postulates of their potential biological functions

Read more

Summary

Introduction

Genome projects have yielded considerable information since the sequencing of the first whole microorganism genome, Haemophilus influenza, in 1995 [1,2]. Of these, detailed DNA binding information (e.g., position-specific scoring matrices or sequence logos) is available for less than half [20] This is even more apparent for a less well-characterized organism such as T. thermophilus, where of its 2173 identified protein-coding genes, only ~70 are predicted to be transcription factors and detailed DNA binding information is only available for a handful [21,22,23,24,25,26,27,28,29,30]. Over 10k selected DNAs were sequenced and a 14-mer sequence logo with extremely high significance E = 1.7 x 10−2443 was determined Mapping these sequences to the T. thermophilus HB8 genome identified several promoter regions that should bind SbtR, these directing expression of genes encoding proteins with presumed transport, nucleic acid modification, and regulatory functions. This study provides a proof-of-concept that REPSA may be used to initiate our understanding of transcription regulatory proteins through the definition of their preferred DNA binding sequences, thereby leading to postulates of their potential biological functions

Results
59 Desalt SbtR mutant 7 DNA probe precursor
Discussion
TTHA0733 Transcriptional regulator MarR family
Materials and Methods
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call