Statistical Approach for Functionally Validating Transcription Factor Bindings Using Population SNP and Gene Expression Data

Jing Xiang

doi:10.1184/r1/9964028.v1

Abstract

Understanding transcriptional gene regulation is an important step to understanding how essential mechanisms are controlled in biological systems. Functional assayssuch as ChIP-seq and DNase I have been used to obtain a binding map of transcription factor (TF) binding sites on DNA and to determine the transcriptional regulatory network of TFs and their target genes. However, binding alone may notresult in a change in target gene expression. Experimental approaches to identifying functional binding events involve performing artificial TF knockdown experimentsor genome editing [31, 45, 70] and then declaring the differentially expressed genes as functionally validated target genes. Instead of artificial perturbation, in order to functionally validate the TF binding map, we propose to leverage the naturally occurring genetic variations as the source of perturbations that vary gene expressions and to analyze population single nucleotide polymorphism (SNP) and gene expression data. Experimental approaches typically target either a single TF or a family of TFs. In addition, in a single experiment, you must choose whether to perturb TF concentration through RNA interference or CRISPR interference, or TF binding affinity through genome editing. However, our approach is potentially more powerfulbecause any aspects of the TF-target interaction, including TF concentration and TF binding affinity, can be perturbed by a large number of SNPs found across the genome simultaneously and the effects are learned in a single analysis. In this thesis, we first introduce a statistical approach, based on conditional Gaussian Bayesian networks, that integrates population SNP and gene expression data with TF binding data to validate the TF binding map. We developed an efficientlearning algorithm for learning the gene regulatory network by using TF binding data as prior knowledge, and selecting the TF-target interactions that are validated based on population SNP and gene-expression data. Given the estimated network, we perform inference on the estimated probabilistic graphical models to determine downstream genes that are differentially expressed due to the effect of the TF-target interactions. We apply our method to learn transcriptional regulatory networks in lymphoblastoidcell lines (LCLs) and breast cancer tumours. First, we demonstrate our approach for validation of the TF binding map derived from ENCODE DNase I and ChIPseqdata from 71 TFs in LCLs, with SNP and gene expression data from the 1000 genomes and HapMap 3 projects respectively. We examined functional target genesthat were validated under perturbation of TF concentration and TF binding affinity. Finally, we apply our method to perform TF binding map validation for ER and itscoregulators which include 38 TFs obtained from Cistrome TF binding data, by using The Cancer Genome Atlas SNP and expression data from breast cancer tumors.We identified many previously known interactions between ER and its coregulators. We also found expression quantitative trait loci (eQTLs) in local binding regions oftarget genes that are potential super enhancers and eQTLs in coding regions that may affect the protein structure of important regulators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistical Approach for Functionally Validating Transcription Factor Bindings Using Population SNP and Gene Expression Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Rational gRNA design based on transcription factor binding data.
David Bergenholm ... Jens Nielsen
Synthetic biology (Oxford, England) | VOL. 6
David Bergenholm, et. al.David Bergenholm ... Jens Nielsen
27 Jul 2021
Synthetic biology (Oxford, England) | VOL. 6

Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity.
Yutong Jin ... Zhaohui S Qin
Frontiers in genetics | VOL. 12
Yutong Jin, et. al.Yutong Jin ... Zhaohui S Qin
09 Sep 2021
Frontiers in genetics | VOL. 12

Thematic review series: Systems Biology Approaches to Metabolic and Cardiovascular Disorders. Multi-organ whole-genome measurements and reverse engineering to uncover gene networks underlying complex traits
Jesper Tegnér ... Johan Björkegren
Journal of Lipid Research | VOL. 48
Jesper Tegnér, et. al.Jesper Tegnér ... Johan Björkegren
01 Feb 2007
Journal of Lipid Research | VOL. 48

Learning gene regulatory networks based on Dempster-Shafer evidence theory
Hongxia Zhang ... Ying-Fei Sun
-
Hongxia Zhang, et. al. Hongxia Zhang ... Ying-Fei Sun
01 Aug 2010
01 Aug 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Approach for Functionally Validating Transcription Factor Bindings Using Population SNP and Gene Expression Data

Abstract

Talk to us

Similar Papers