SAResNet: self-attention residual network for predicting DNA-protein binding.

Long-Chen Shen,Yan Liu,Dong-Jun Yu,Jiangning Song

doi:10.1093/bib/bbab101

Abstract

Knowledge of the specificity of DNA-protein binding is crucial for understanding the mechanisms of gene expression, regulation and gene therapy. In recent years, deep-learning-based methods for predicting DNA-protein binding from sequence data have achieved significant success. Nevertheless, the current state-of-the-art computational methods have some drawbacks associated with the use of limited datasets with insufficient experimental data. To address this, we propose a novel transfer learning-based method, termed SAResNet, which combines the self-attention mechanism and residual network structure. More specifically, the attention-driven module captures the position information of the sequence, while the residual network structure guarantees that the high-level features of the binding site can be extracted. Meanwhile, the pre-training strategy used by SAResNet improves the learning ability of the network and accelerates the convergence speed of the network during transfer learning. The performance of SAResNet is extensively tested on 690 datasets from the ChIP-seq experiments with an average AUC of 92.0%, which is 4.4% higher than that of the best state-of-the-art method currently available. When tested on smaller datasets, the predictive performance is more clearly improved. Overall, we demonstrate that the superior performance of DNA-protein binding prediction on DNA sequences can be achieved by combining the attention mechanism and residual structure, and a novel pipeline is accordingly developed. The proposed methodology is generally applicable and can be used to address any other sequence classification problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SAResNet: self-attention residual network for predicting DNA-protein binding.

Abstract

Talk to us

Similar Papers

More From: Briefings in bioinformatics

Lead the way for us

Journal: Briefings in bioinformatics	Publication Date: Apr 9, 2021
Citations: 34

Similar Papers

A Macaque Brain Extraction Model Based on U-Net Combined with Residual Structure.
Qianshan Wang ... Saddam Naji Abdu Nasher
Brain Sciences | VOL. 12
Qianshan Wang, et. al.Qianshan Wang ... Saddam Naji Abdu Nasher
12 Feb 2022
Brain Sciences | VOL. 12

Multi-scale Visual Aggregation Residual Network for Super-Resolution
Boxiang Xue ... Zhenghua Zhou
-
Boxiang Xue, et. al.Boxiang Xue ... Zhenghua Zhou
26 Jul 2022
26 Jul 2022

Athlete Training Sensor Data Detection Research Based on Optimized Convolutional Neural Network
Qiang Qian ... Yuanyuan Gao
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Qiang Qian, et. al.Qiang Qian ... Yuanyuan Gao
26 Jun 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Sports Risk Analysis Based on Knowledge Discovery and Data Driven
Jinling Zheng ... Chunyan Fan
Security and Communication Networks | VOL. 2022
Jinling Zheng, et. al.Jinling Zheng ... Chunyan Fan
27 May 2022
Security and Communication Networks | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SAResNet: self-attention residual network for predicting DNA-protein binding.

Abstract

Talk to us

Similar Papers

More From: Briefings in bioinformatics