Abstract

Human genome sequencing has resulted in a great body of data, including a stunningly large number of single nucleotide polymorphisms (SNPs) with unknown phenotypic manifestations. Identification and comprehensive analysis of regulatory SNPs in human gene promoters will help quantify the effects of these SNPs on human health. Based on our experimental and computer-aided study of SNPs in TATA boxes and the use of literature data, we have derived an equation for TBP/TATA equilibrium binding in three successive steps: TATA-binding protein (TBP) sliding along DNA due to their nonspecific affinity for each other ↔ recognition of the TATA box ↔ stabilization of the TBP/TATA complex. Using this equation, we have analyzed TATA boxes containing SNPs associated with human diseases and made in silico predictions of changes in TBP/TATA affinity. An electrophoretic mobility shift assay (EMSA)-based experimental study performed under the most standardized conditions demonstrates that the experimentally measured values are highly correlated with the predicted values: the coefficient of linear correlation, r, was 0.822 at a significance level of α<10−7 for equilibrium K D values, (-ln K D), and 0.785 at a significance level of α<10−3 for changes in equilibrium K D (δ) due to SNPs in the TATA boxes (). It has been demonstrated that the SNPs associated with increased risk of human diseases such as α-, β- and δ-thalassemia, myocardial infarction and thrombophlebitis, changes in immune response, amyotrophic lateral sclerosis, lung cancer and hemophilia B Leyden cause 2–4-fold changes in TBP/TATA affinity in most cases. The results obtained strongly suggest that the TBP/TATA equilibrium binding equation derived can be used for analysis of TATA-box sequences and identification of SNPs with a potential of being functionally important.

Highlights

  • Single nucleotide polymorphisms (SNPs) represent the commonest type of genetic variation in man

  • The ODNs are identical to TATA boxes with the flanking regions of the promoters of the genes being studied in healthy individuals and patients with single nucleotide polymorphisms (SNPs) in the TATA boxes in the promoters of the genes being studied

  • Most commonly affected is the synthesis of a- and b-globin chains, which corresponds to a- and b-thalassemia, respectively

Read more

Summary

Introduction

Single nucleotide polymorphisms (SNPs) represent the commonest type of genetic variation in man. Most of more than 15 million annotated polymorphisms are located in DNA coding regions, which makes the mechanism of their action on the phenotype perfectly clear: a certain protein will be deficient [1]. Polymorphisms in regulatory regions and the effects of these polymorphisms on gene expression remain to be much less well studied. Many of non-coding SNPs lie within regulatory DNA motifs, altering their affinity for transcription factors and altering the expression levels of genes cis-targeted by those motifs, which accounts for differences in responses to external and internal signals, propensity to certain diseases, and sensitivity to therapy, to mention a few. The core promoter is the term for the DNA region spanning ,100 nucleotides to the left (in the 59-region) and to the right (in the 39-region) from the transcription start site, with a variable number of regulatory regions [2] such as the TATA box, BRE (TFIIB recognition element), Inr (initiator element), MTE (motif ten element), DPE (downstream promoter element), DCE (downstream core element), and XCPE1 (X core promoter element 1) [3] and others

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call