Abstract

Transcription factors are proteins involved in converting DNA to RNA by binding to specific regions of DNA. Many computational methods developed for predicting transcription factor binding sites in DNA are either tissue-specific or species-specific methods, so cannot be used without prior knowledge of tissue or species. Some prediction methods are limited to short DNA sequences only, so cannot be used to find potential transcription factor binding sites in long DNA sequences. In this study, we developed a new method that predicts transcription factor binding sites in DNA sequences of any length without prior knowledge of tissue or species. In independent testing with datasets that were not used in training the method, it achieved reasonably good performances (accuracy of 81.84 % and MCC of 0.634 in one testing, and accuracy of 71.16 % and MCC of 0.403 in another testing). Our method will be useful for finding putative transcription factor binding sites in the absence of prior knowledge of tissue or species.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call