Major cell-types in multiomic single-nucleus datasets impact statistical modeling of links between regulatory sequences and target genes

Francis J A Leblanc,Guillaume Lettre

doi:10.1038/s41598-023-31040-w

Francis J A Leblanc, Guillaume Lettre

Open Access

https://doi.org/10.1038/s41598-023-31040-w

Copy DOI

Abstract

Epigenomic profiling, including ATACseq, is one of the main tools used to define enhancers. Because enhancers are overwhelmingly cell-type specific, inference of their activity is greatly limited in complex tissues. Multiomic assays that probe in the same nucleus both the open chromatin landscape and gene expression levels enable the study of correlations (links) between these two modalities. Current best practices to infer the regulatory effect of candidate cis-regulatory elements (cCREs) in multiomic data involve removing biases associated with GC content by generating null distributions of matched ATACseq peaks drawn from different chromosomes. This strategy has been broadly adopted by popular single-nucleus multiomic workflows such as Signac. Here, we uncovered limitations and confounders of this approach. We found a strong loss of power to detect a regulatory effect for cCREs with high read counts in the dominant cell-type. We showed that this is largely due to cell-type-specific trans-ATACseq peak correlations creating bimodal null distributions. We tested alternative models and concluded that physical distance and/or the raw Pearson correlation coefficients are the best predictors for peak-gene links when compared to predictions from Epimap (e.g. CD14 area under the curve [AUC] = 0.51 with the method implemented in Signac vs. 0.71 with the Pearson correlation coefficients) or validation by CRISPR perturbations (AUC = 0.63 vs. 0.73).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Mar 9, 2023
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

Major cell-types in multiomic single-nucleus datasets impact statistical modeling of links between regulatory sequences and target genes

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

An ultra high-throughput method for single-cell joint analysis of open chromatin and transcriptome
Chenxu Zhu ... Armen Abnousi
Nature Structural & Molecular Biology | VOL. 26
Chenxu Zhu, et. al.Chenxu Zhu ... Armen Abnousi
01 Nov 2019
Nature Structural & Molecular Biology | VOL. 26

Identification and characterization of differentially expressed exosomal microRNAs in bovine milk infected with Staphylococcus aureus
Shaoyang Ma ... Eveline M Ibeagha-Awemu
BMC Genomics | VOL. 20
Shaoyang Ma, et. al.Shaoyang Ma ... Eveline M Ibeagha-Awemu
01 Dec 2019
BMC Genomics | VOL. 20

Evolution of genome base composition and genome size in bacteria
Hiromi Nishida
Frontiers in Microbiology | VOL. 3
Hiromi NishidaHiromi Nishida
01 Jan 2012
Frontiers in Microbiology | VOL. 3

Profiling plant histone modification at single-cell resolution using snCUT&Tag.
Weizhi Ouyang ... Guoliang Li
Plant Biotechnology Journal | VOL. 20
Weizhi Ouyang, et. al.Weizhi Ouyang ... Guoliang Li
16 Jan 2022
Plant Biotechnology Journal | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Major cell-types in multiomic single-nucleus datasets impact statistical modeling of links between regulatory sequences and target genes

Abstract

Talk to us

Similar Papers

More From: Scientific Reports