An Integrative Framework for Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning

Fang Jing,Zhen Cao,Shao-Wu Zhang,Shihua Zhang

doi:10.1109/tcbb.2019.2901789

Abstract

Knowing the transcription factor binding sites (TFBSs) is essential for modeling the underlying binding mechanisms and follow-up cellular functions. Convolutional neural networks (CNNs) have outperformed methods in predicting TFBSs from the primary DNA sequence. In addition to DNA sequences, histone modifications and chromatin accessibility are also important factors influencing their activity. They have been explored to predict TFBSs recently. However, current methods rarely take into account histone modifications and chromatin accessibility using CNN in an integrative framework. To this end, we developed a general CNN model to integrate these data for predicting TFBSs. We systematically benchmarked a series of architecture variants by changing network structure in terms of width and depth, and explored the effects of sample length at flanking regions. We evaluated the performance of the three types of data and their combinations using 256 ChIP-seq experiments and also compared it with competing machine learning methods. We find that contributions from these three types of data are complementary to each other. Moreover, the integrative CNN framework is superior to traditional machine learning methods with significant improvements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Integrative Framework for Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Jan 1, 2021
Citations: 53

Similar Papers

Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning
Fang Jing ... Zhen Cao
-
Fang Jing, et. al.Fang Jing ... Zhen Cao
01 Jan 2018
01 Jan 2018

Deep convolutional neural networks for predicting leukemia-related transcription factor binding sites from DNA sequence data
Jian He ... Yanzhi Guo
Chemometrics and Intelligent Laboratory Systems | VOL. 199
Jian He, et. al.Jian He ... Yanzhi Guo
18 Feb 2020
Chemometrics and Intelligent Laboratory Systems | VOL. 199

MaxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks.
Tareian A Cazares ... Teresa M Przytycka
PLOS Computational Biology | VOL. 19
Tareian A Cazares, et. al.Tareian A Cazares ... Teresa M Przytycka
31 Jan 2023
PLOS Computational Biology | VOL. 19

DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence.
Lei Deng ... Hui Wu
International journal of molecular sciences | VOL. 22
Lei Deng, et. al.Lei Deng ... Hui Wu
24 May 2021
International journal of molecular sciences | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Integrative Framework for Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics