FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Daniel Quang,Xiaohui Xie

doi:10.1016/j.ymeth.2019.03.020

Daniel Quang, Xiaohui Xie

Open Access

https://doi.org/10.1016/j.ymeth.2019.03.020

Copy DOI

Journal: Methods	Publication Date: Mar 26, 2019
Citations: 159	License type: cc-by

Affiliation: University of California, Irvine

Abstract

Due to the large numbers of transcription factors (TFs) and cell types, querying binding profiles of all valid TF/cell type pairs is not experimentally feasible. To address this issue, we developed a convolutional-recurrent neural network model, called FactorNet, to computationally impute the missing binding data. FactorNet trains on binding data from reference cell types to make predictions on testing cell types by leveraging a variety of features, including genomic sequences, genome annotations, gene expression, and signal data, such as DNase I cleavage. FactorNet implements several convenient strategies to reduce runtime and memory consumption. By visualizing the neural network models, we can interpret how the model predicts binding. We also investigate the variables that affect cross-cell type accuracy, and offer suggestions to improve upon this field. Our method ranked among the top teams in the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge, achieving first place on six of the 13 final round evaluation TF/cell type pairs, the most of any competing team. The FactorNet source code is publicly available, allowing users to reproduce our methodology from the ENCODE-DREAM Challenge.

Highlights

Due to the large numbers of transcription factors (TFs) and cell types, querying binding profiles of all TF/cell type pairs is not experimentally feasible, owing to constraints in time and resources
We investigate the variables that affect cross-cell type predictive performance to explain why the model performs better on some TF/cell types than others, and offer insights to improve upon this field
Final rankings in the Challenge are based on performances over 13 TF/cell type pairs

Summary

Introduction

Due to the large numbers of transcription factors (TFs) and cell types, querying binding profiles of all TF/cell type pairs is not experimentally feasible, owing to constraints in time and resources. With FactorNet, a researcher can perform a single sequencing assay, such as DNase-seq, on a cell type and computationally impute dozens of TF binding profiles. These methods require a collection of motifs and DNase-seq data to predict TF binding sites in a single tissue or cell type.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Methods

Lead the way for us

Similar Papers

Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns
Divyanshi Srivastava ... Shaun Mahony
Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms | VOL. 1863
Divyanshi Srivastava, et. al.Divyanshi Srivastava ... Shaun Mahony
19 Oct 2019
Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms | VOL. 1863

MixChIP: a probabilistic method for cell type specific protein-DNA binding analysis.
Sini Rautio ... Harri Lähdesmäki
BMC Bioinformatics | VOL. 16
Sini Rautio, et. al.Sini Rautio ... Harri Lähdesmäki
01 Dec 2015
BMC Bioinformatics | VOL. 16

Decision letter: Promoter sequence and architecture determine expression variability and confer robustness to genetic variants
George H Perry
-
George H PerryGeorge H Perry
07 Sep 2022
07 Sep 2022

Author response: Promoter sequence and architecture determine expression variability and confer robustness to genetic variants
Hjörleifur Einarsson ... Marco Salvatore
-
Hjörleifur Einarsson, et. al.Hjörleifur Einarsson ... Marco Salvatore
03 Nov 2022
03 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Methods