Application of alternative de novo motif recognition models for analysis of structural heterogeneity of transcription factor binding sites: a case study of FOXA2 binding sites.

A V Tsukanov,V G Levitsky,T I Merkulova

doi:10.18699/vj21.002

A V Tsukanov, V G Levitsky + Show 1 more

Open Access

https://doi.org/10.18699/vj21.002

Copy DOI

Abstract

The most popular model for the search of ChIP-seq data for transcription factor binding sites (TFBS) is the positional weight matrix (PWM). However, this model does not take into account dependencies between nucleotide occurrences in different site positions. Currently, two recently proposed models, BaMM and InMoDe, can do as much. However, application of these models was usually limited only to comparing their recognition accuracies with that of PWMs, while none of the analyses of the co-prediction and relative positioning of hits of different models in peaks has yet been performed. To close this gap, we propose the pipeline called MultiDeNA. This pipeline includes stages of model training, assessing their recognition accuracy, scanning ChIP-seq peaks and their classification based on scan results. We applied our pipeline to 22 ChIP-seq datasets of TF FOXA2 and considered PWM, dinucleotide PWM (diPWM), BaMM and InMoDe models. The combination of these four models allowed a significant increase in the fraction of recognized peaks compared to that for the sole PWM model: the increase was 26.3 %. The BaMM model provided the main contribution to the recognition of sites. Although the major fraction of predicted peaks contained TFBS of different models with coincided positions, the medians of the fraction of peaks containing the predictions of sole models were 1.08, 0.49, 4.15 and 1.73 % for PWM, diPWM, BaMM and InMoDe, respectively. Thus, FOXA2 BSs were not fully described by only a sole model, which indicates theirs heterogeneity. We assume that the BaMM model is the most successful in describing the structure of the FOXA2 BS in ChIP-seq datasets under study.

Highlights

Transcription factors (TFs) are proteins that can recognize certain regions of genomic DNA (TF binding sites, transcription factor binding sites (TFBS)) (Lambert et al, 2018)
Classification of ChIP-seq peaks based on the results of TFBS recognition by different models
The first one takes into account an intersection of positions of predicted TFBS of different models, the second one did not take it into account

Summary

Introduction

Transcription factors (TFs) are proteins that can recognize certain regions of genomic DNA (TF binding sites, TFBS) (Lambert et al, 2018). The main function of TFs is to increase or decrease a level of gene transcription (Latchman, 2001). The key stage of the regulation of gene expression is TF binding to DNA. This binding initiates a chain of molecular events that ensure the assembly and regulate the activity of the pre-initiation complex of RNA polymerase II, both through direct or indirect contacts with the components of this complex, and through the involvement of various modifying chromatin and remodeling proteins. One of the most important tasks of modern molecular biology is to identify genomic TFBSs

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Vavilovskii zhurnal genetiki i selektsii	Publication Date: Feb 1, 2021
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Application of alternative de novo motif recognition models for analysis of structural heterogeneity of transcription factor binding sites: a case study of FOXA2 binding sites.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Vavilovskii zhurnal genetiki i selektsii

Lead the way for us

Similar Papers

A Structural-Based Strategy for Recognition of Transcription Factor Binding Sites
Beisi Xu ... Dustin E Schones
PLoS ONE | VOL. 8
Beisi Xu, et. al.Beisi Xu ... Dustin E Schones
08 Jan 2013
PLoS ONE | VOL. 8

Structural Based Strategy for Predicting Transcription Factor Binding Sites
Beisi Xu ... Haojun Liang
BIO-PROTOCOL | VOL. 3
Beisi Xu, et. al.Beisi Xu ... Haojun Liang
01 Jan 2013
BIO-PROTOCOL | VOL. 3

Motif models proposing independent and interdependent impacts of nucleotides are related to high and low affinity transcription factor binding sites in Arabidopsis
Anton V Tsukanov ... Victor G Levitsky
Frontiers in Plant Science | VOL. 13
Anton V Tsukanov, et. al.Anton V Tsukanov ... Victor G Levitsky
28 Jul 2022
Frontiers in Plant Science | VOL. 13

MCOIN: a novel heuristic for determining TFBS motif width
...
-
, et. al. ...
18 Jun 2013
18 Jun 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of alternative de novo motif recognition models for analysis of structural heterogeneity of transcription factor binding sites: a case study of FOXA2 binding sites.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Vavilovskii zhurnal genetiki i selektsii