Abstract

BackgroundThe binding of transcription factors to DNA plays an essential role in the regulation of gene expression. Numerous experiments elucidated binding sequences which subsequently have been used to derive statistical models for predicting potential transcription factor binding sites (TFBS). The rapidly increasing number of genome sequence data requires sophisticated computational approaches to manage and query experimental and predicted TFBS data in the context of other epigenetic factors and across different organisms.ResultsWe have developed D-Light, a novel client-server software package to store and query large amounts of TFBS data for any number of genomes. Users can add small-scale data to the server database and query them in a large scale, genome-wide promoter context. The client is implemented in Java and provides simple graphical user interfaces and data visualization. Here we also performed a statistical analysis showing what a user can expect for certain parameter settings and we illustrate the usage of D-Light with the help of a microarray data set.ConclusionsD-Light is an easy to use software tool to integrate, store and query annotation data for promoters. A public D-Light server, the client and server software for local installation and the source code under GNU GPL license are available at http://biwww.che.sbg.ac.at/dlight.

Highlights

  • The binding of transcription factors to DNA plays an essential role in the regulation of gene expression

  • The specific transcription of genes is largely controlled by the interplay of transcription factors (TFs) attached to their specific binding sites (TFBSs)

  • We first show a statistical analysis of the promoters from the human genome regarding the distribution of predicted TFBS with respect to different parameters

Read more

Summary

Introduction

The binding of transcription factors to DNA plays an essential role in the regulation of gene expression. Numerous experiments elucidated binding sequences which subsequently have been used to derive statistical models for predicting potential transcription factor binding sites (TFBS). The rapidly increasing number of genome sequence data requires sophisticated computational approaches to manage and query experimental and predicted TFBS data in the context of other epigenetic factors and across different organisms. The specific transcription of genes is largely controlled by the interplay of transcription factors (TFs) attached to their specific binding sites (TFBSs). It is commonly accepted that for higher organisms the concurrent binding of two or more TFs is required to change the transcriptional state of a gene. A number of computational tools have been developed to process experimental data for subsequent prediction of potential TFBSs and affected pathways.

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.