Exploring variable-length features (motifs) for predicting binding sites through interpretable deep neural networks

Chandra Mohan Dasari,Santhosh Amilpur,Raju Bhukya

doi:10.1016/j.engappai.2021.104485

Abstract

Transcription factor binding sites (TFBS) and RNA-binding proteins (RBP) plays a key role in gene regulation, transcription, RNA editing. Identifying and locating these potential sites is essential for detecting pathogenic variation in many biological processes. Some portions of binding sites are recognized by biological experiments that are time-intensive and expensive. Many computational approaches are considered as possible alternative solutions and few deep learning methods are recently developed for fast and accurate prediction of binding sites. Although existing approaches achieve competent performance, many methods requires specialized feature set and moreover interpretability remains challenging. To overcome these problems, we propose an interpretable deep learning technique called protein binding variable pattern predictor (PBVPP), which uses a wide variety of experimental data and performance metrics to predict binding sites. The novelty of our proposed method is based on three key factors: (i) PBVPP along with its variant has the capability to extract vital features from large-scale genomic sequences obtained by high throughput technology to predict the occurrence of TFBS and RBP sites. (ii) The proposed interpretable model reveals how to mine vital features, and also extract variable length patterns for accurate prediction of binding sites. (iii) The obtained motifs are validated against the TFBSshape DNA (JASPAR) database’s known target motifs. The proposed model has shown an improvement of 5.88%, 5.01% over state-of-the-art methods in terms of receiver operating curve for TFBS, RBP and also shown tremendous improvement of 60% in precision recall curve for TFBS prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring variable-length features (motifs) for predicting binding sites through interpretable deep neural networks

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Oct 7, 2021
Citations: 11

Similar Papers

A Review About RNA–Protein-Binding Sites Prediction Based on Deep Learning
Jianrong Yan ... Min Zhu
IEEE Access | VOL. 8
Jianrong Yan, et. al.Jianrong Yan ... Min Zhu
01 Jan 2020
IEEE Access | VOL. 8

PAR-CliP - A Method to Identify Transcriptome-wide the Binding Sites of RNA Binding Proteins
Markus Hafner ... Jean Hausser
Journal of Visualized Experiments | VOL. 8
Markus Hafner, et. al.Markus Hafner ... Jean Hausser
02 Jul 2010
Journal of Visualized Experiments | VOL. 8

PAR-CliP - A Method to Identify Transcriptome-wide the Binding Sites of RNA Binding Proteins
Alexander Ulrich ... Thomas Tuschl
Journal of Visualized Experiments | VOL. -
Alexander Ulrich, et. al.Alexander Ulrich ... Thomas Tuschl
02 Jul 2010
Journal of Visualized Experiments | VOL. -

Computational Prediction of RNA-Binding Proteins and Binding Sites.
Jingna Si ... Rongling Wu
International Journal of Molecular Sciences | VOL. 16
Jingna Si, et. al.Jingna Si ... Rongling Wu
03 Nov 2015
International Journal of Molecular Sciences | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring variable-length features (motifs) for predicting binding sites through interpretable deep neural networks

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence