Abstract

Deep Learning techniques (DL) significantly improved the accuracy of predictions and classifications of deoxyribonucleic acid (DNA). On the other hand, identifying and predicting splice sites in eukaryotes is difficult due to many erroneous discoveries. To address this issue, we propose a deep learning model for recognizing and anticipating splice sites in eukaryotic DNA sequences based on a bidirectional Long Short-Term Memory (LSTM) Recurrent Neural Network (RNN) and Gated recurrent unit (GRU). The non-coding introns of the gene are spliced out, and the coding exons are joined during the splicing of the original mRNA transcript. This bidirectional LSTM-RNN-GRU model incorporates intron features in order of their length constraints, beginning with splice site donor (GT) and ending with splice site acceptor (AG). The performance of the model improves as the number of training epochs grows. The best level of accuracy for this model is 96.1 percent.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call