Abstract

BackgroundThe understanding of the mechanisms of transcriptional regulation remains a challenge for molecular biologists in the post-genome era. It is hypothesized that the regulatory regions of genes expressed in the same tissue or cell type share a similar structure. Though several studies have analyzed the promoters of genes expressed in specific metazoan tissues or cells, little research has been done in plants. Hence finding specific patterns of motifs to explain the promoter architecture of co-expressed genes in plants could shed light on their transcription mechanism.ResultsWe identified novel patterns of sets of motifs in promoters of genes co-expressed in four different plant structures (PSs) and in the entire plant in Arabidopsis thaliana. Sets of genes expressed in four PSs (flower, seed, root, shoot) and housekeeping genes expressed in the entire plant were taken from a database of co-expressed genes in A. thaliana. PS-specific motifs were predicted using three motif-discovery algorithms, 8 of which are novel, to the best of our knowledge. A support vector machine was trained using the average upstream distance of the identified motifs from the translation start site on both strands of binding sites. The correctly classified promoters per PS were used to construct specific patterns of sets of motifs to describe the promoter architecture of those co-expressed genes. The discovered PS-specific patterns were tested in the entire A. thaliana genome, correctly identifying 77.8%, 81.2%, 70.8% and 53.7% genes expressed in petal differentiation, synergid cells, root hair and trichome, as well as 88.4% housekeeping genes.ConclusionsWe present five patterns of sets of motifs which describe the promoter architecture of co-expressed genes in five PSs with the ability to predict them from the entire A. thaliana genome. Based on these findings, we conclude that the positioning and orientation of transcription factor binding sites at specific distances from the translation start site is a reliable measure to differentiate promoters of genes expressed in different A. thaliana structures from background genomic promoters. Our method can be used to predict novel motifs and decipher a similar promoter architecture for genes co-expressed in A. thaliana under different conditions.

Highlights

  • The understanding of the mechanisms of transcriptional regulation remains a challenge for molecular biologists in the post-genome era

  • We have worked with promoter sets of genes expressed in four different A. thaliana structures and in the whole plant

  • Five distinct patterns of sets of PSspecific motifs able to describe the promoter region of co-expressed genes were built and shown to be useful in predicting genes expressed in specific biological processes from the entire A. thaliana genome

Read more

Summary

Introduction

The understanding of the mechanisms of transcriptional regulation remains a challenge for molecular biologists in the post-genome era. Regarding the dependency among TFBSs and the hypothesis that genes showing similar expression profiles could share common structural characteristics in their regulatory regions, Vandenbon et al proposed a simple Markov chain-based promoter architecture model as an alternative to the CRM approaches. Their method included characteristics such as orientation, position with respect to the translation start site (TLS) and order of predicted occurrences of over-represented motifs [11]. They did not focus on the combination of their predicted motifs to identify patterns of sets of motifs within promoters of co-expressed genes in specific structures of this organism

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.