HPRep: Quantifying Reproducibility in HiChIP and PLAC-Seq Datasets

Jonathan D Rosen,Jiawen Chen,Ian R Jones,Yin Shen,Armen Abnousi,Michael Song,Ming Hu,Yun Li,Yuchen Yang

doi:10.3390/cimb43020082

Abstract

HiChIP and PLAC-Seq are emerging technologies for studying genome-wide long-range chromatin interactions mediated by the protein of interest, enabling more sensitive and cost-efficient interrogation of protein-centric chromatin conformation. However, due to the unbalanced read distribution introduced by protein immunoprecipitation, existing reproducibility measures developed for Hi-C data are not appropriate for the analysis of HiChIP and PLAC-Seq data. Here, we present HPRep, a stratified and weighted correlation metric derived from normalized contact counts, to quantify reproducibility in HiChIP and PLAC-Seq data. We applied HPRep to multiple real datasets and demonstrate that HPRep outperforms existing reproducibility measures developed for Hi-C data. Specifically, we applied HPRep to H3K4me3 PLAC-Seq data from mouse embryonic stem cells and mouse brain tissues as well as H3K27ac HiChIP data from human lymphoblastoid cell line GM12878 and leukemia cell line K562, showing that HPRep can more clearly separate among pseudo-replicates, real replicates, and non-replicates. Furthermore, in an H3K4me3 PLAC-Seq dataset consisting of 11 samples from four human brain cell types, HPRep demonstrated the expected clustering of data that could not be achieved by existing methods developed for Hi-C data, highlighting the need for a reproducibility metric tailored to HiChIP and PLAC-Seq data.

Highlights

Chromatin spatial organization plays a critical role in genome structure and transcriptional regulation [1,2,3]
To fill in this gap, we propose a novel method, HPRep, to measure the similarity or reproducibility between two HP datasets
Quantification of data reproducibility is critical to ensure scientific rigor, methods tailored for HiChIP and PLAC-Seq data are still lacking

Summary

Introduction

Chromatin spatial organization plays a critical role in genome structure and transcriptional regulation [1,2,3]. HiCRep [6] first performs 2D smoothing to reduce the stochastic noise resulting from the sparsity of Hi-C data, and quantifies reproducibility by calculating a stratified correlation, which is a weighted average of correlation coefficients between contact frequencies across specific one-dimensional (1D) genomic distance bands. QuASAR-Rep [9] determines a local correlation matrix by comparing observed interaction counts to background signal–distance values within a specified distance. This local correlation matrix is subsequently transformed by element-wise multiplication with a matrix of scaled interaction counts. The reproducibility between two samples is defined as the Pearson correlation coefficient between the corresponding transformed matrices

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Current Issues in Molecular Biology	Publication Date: Sep 17, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

HPRep: Quantifying Reproducibility in HiChIP and PLAC-Seq Datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Current Issues in Molecular Biology

Lead the way for us

Similar Papers

Differential expression of genes involved in metabolism between tumorigenitic human leukemia cell lines K562 and K562-n
Shu-Qing Lü ... Kang Ying
Chinese Journal of Cancer Research | VOL. 15
Shu-Qing Lü, et. al.Shu-Qing Lü ... Kang Ying
01 Jun 2003
Differential expression of genes involved in metabolism between tumorigenitic human leukemia cell lines K562 and K562-n
Shu-Qing Lü ... Kang Ying

TRPM7-mediated spontaneous Ca2+ entry regulates the proliferation and differentiation of human leukemia cell line K562.
Kiriko Takahashi ... Jun Ichikawa
Physiological Reports | VOL. 6
Kiriko Takahashi, et. al.Kiriko Takahashi ... Jun Ichikawa
01 Jul 2018
Physiological Reports | VOL. 6

Induction of cell differentiation by IMPDH antisense oligomer in HL-60 and K562 human leukemia cell lines.
Hiroshi Tsutani ... Shin Imamura
Advances in experimental medicine and biology | VOL. 370
Hiroshi Tsutani, et. al.Hiroshi Tsutani ... Shin Imamura
01 Jan 1995
Induction of cell differentiation by IMPDH antisense oligomer in HL-60 and K562 human leukemia cell lines.
Hiroshi Tsutani ... Shin Imamura

An Efficient Method for Electroporation of Small Interfering RNAs into ENCODE Project Tier 1 GM12878 and K562 Cell Lines.
Ryan Y Muller ... Ming C Hammond
Journal of Biomolecular Techniques : JBT | VOL. 26
Ryan Y Muller, et. al.Ryan Y Muller ... Ming C Hammond
29 Oct 2015
Journal of Biomolecular Techniques : JBT | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HPRep: Quantifying Reproducibility in HiChIP and PLAC-Seq Datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Current Issues in Molecular Biology