Abstract

Statistical features of the distribution of transcription factor binding sites in the mouse genome that are obtained by ChIP-seq experiments in embryonic stem cells have been considered. Clusters of sites that contain four or more different transcription factor binding sites in the mouse genome have been defined, also their location relatively to the regulatory regions of genes has been described. The presence of two types of site co-localization has been shown: clusters containing binding sites for factors Oct4, Nanog, Sox2, located in the distal regions, and clusters containing binding sites n-Myc, c-Myc, mainly located in the promoter regions of mouse genes. Analysis of new ChIPseq data about binding of transcription factors Nr5a2, Tbx3 in the same cell type has confirmed the division of clusters of transcription factors binding sites into two types: those containing the binding sites of regulators of pluripotency (Oct4, Nanog, and others) and those not. The computer program of the statistical data processing of gene location and chromatin domains that analyzes experimental data of site localization obtained by ChIP-seq in the mouse genome and the human genome has been developed. The presence of preferences at position of transcription factor binding sites of various types has been revealed, the distances between the nearest groups of TF binding sites Oct4, Nanog, Sox2 and TF binding sites n-Myc and c-Myc have been calculated using this program. The presence of nucleotide motifs of transcription factor binding sites in the selected areas of ChIP-seq has been estimated, nucleotide motifs have been refined. A correlation between the presence of motifs and the intensity of ChIPseq binding has been shown. Computer methods for estimating the clustering of different transcription factors binding sites for new data ChIP-seq have been developed. Programs are available upon the request to the authors.

Highlights

  • Разработана компьютерная программа расчета кластеров сайтов связывания различных транскрипционных факторов (ТФ) по дан­ ным геномных координат пиков профиля ChIP-seq (Chromatin ImmunoPrecipitation-sequencing)

  • Statistical features of the distribution of transcription factor binding sites in the mouse genome that are obtained by ChIP-seq experiments in embryonic stem cells have been considered

  • Analysis of new ChIPseq data about binding of transcription factors Nr5a2, Tbx3 in the same cell type has confirmed the division of clusters of transcription factors binding sites into two types: those containing the binding sites of regulators of pluripotency (Oct4, Nanog, and others) and those not

Read more

Summary

ОРИГИНАЛЬНОЕ ИССЛЕДОВАНИЕ

С помощью компьютерного анализа данных ChIP-seq, доступных в GEONCBI, построены полногеномные карты сайтов связывания транскрипционных факторов в эмбриональных стволовых клетках в геноме мыши для факторов c-Myc, Oct, Nanog, Sox, E2f1, n-Myc, Tbx, Eset, Nr5a2, Smad (Chen et al, 2008; Han et al, 2010; Heng et al, 2010; Lee et al, 2011), а также Cep, SRF, USF1 (Sirito et al, 1998; Xu et al, 2014; Kuzniewska et al, 2015). Материалы и методы В работе были использованы полногеномные карты сайтов связывания транскрипционных факторов в эмбриональных стволовых клетках, построенных по данным ChIP-seq для c-Myc, Oct, Nanog, Sox, E2f1, n-Myc, Tbx, Eset, Nr5a2, Smad в геноме мыши (Chen et al, 2008; Han et al, 2010; Heng et al, 2010; Lee et al, 2011). Chen с коллегами (2008) показал совместную локализацию связывания Tbx с группой Oct4-Sox2-Nanog (Доп. материалы 1)

Статистика расположения сайтов связывания транскрипционных факторов
Search for patterns of site locations using GeneDiscovery
Number of clusters
Findings
Percentage of present motifs
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call