Abstract

BackgroundFrom initial seed germination through reproduction, plants continuously reprogram their transcriptional repertoire to facilitate growth and development. This dynamic is mediated by a diverse but inextricably-linked catalog of regulatory proteins called transcription factors (TFs). Statistically quantifying TF binding site (TFBS) abundance in promoters of differentially expressed genes can be used to identify binding site patterns in promoters that are closely related to stress-response. Output from today’s transcriptomic assays necessitates statistically-oriented software to handle large promoter-sequence sets in a computationally tractable fashion.ResultsWe present Marina, an open-source software for identifying over-represented TFBSs from amongst large sets of promoter sequences, using an ensemble of 7 statistical metrics and binding-site profiles. Through software comparison, we show that Marina can identify considerably more over-represented plant TFBSs compared to a popular software alternative.ConclusionsMarina was used to identify over-represented TFBSs in a two time-point RNA-Seq study exploring the transcriptomic interplay between soybean (Glycine max) and soybean rust (Phakopsora pachyrhizi). Marina identified numerous abundant TFBSs recognized by transcription factors that are associated with defense-response such as WRKY, HY5 and MYB2. Comparing results from Marina to that of a popular software alternative suggests that regardless of the number of promoter-sequences, Marina is able to identify significantly more over-represented TFBSs.

Highlights

  • From initial seed germination through reproduction, plants continuously reprogram their transcriptional repertoire to facilitate growth and development

  • In order for the plant to respond to these signals, plants must utilize regulatory proteins known as transcription factors (TFs) to facilitate transcriptional reprogramming in a dynamic, tissue-dependent manner

  • We show its capability to identify overrepresented TF binding site (TFBS) regardless of input size

Read more

Summary

Results

We present Marina, an open-source software for identifying over-represented TFBSs from amongst large sets of promoter sequences, using an ensemble of 7 statistical metrics and binding-site profiles. We show that Marina can identify considerably more over-represented plant TFBSs compared to a popular software alternative

Conclusions
Background
Results and discussion
20. Horspool NR
28. Jaccard P
32. Cramer H
34. Mosteller F
41. Eulgem T
46. Dong X
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call