The ENmix DNA methylation analysis pipeline for Illumina BeadChip and comparisons with seven other preprocessing pipelines

Zongli Xu,Jack A Taylor,Liang Niu

doi:10.1186/s13148-021-01207-1

Abstract

BackgroundIllumina DNA methylation arrays are high-throughput platforms for cost-effective genome-wide profiling of individual CpGs. Experimental and technical factors introduce appreciable measurement variation, some of which can be mitigated by careful “preprocessing” of raw data.MethodsHere we describe the ENmix preprocessing pipeline and compare it to a set of seven published alternative pipelines (ChAMP, Illumina, SWAN, Funnorm, Noob, wateRmelon, and RnBeads). We use two large sets of duplicate sample measurements with 450 K and EPIC arrays, along with mixtures of isogenic methylated and unmethylated cell line DNA to compare raw data and that preprocessed via different pipelines.ResultsOur evaluations show that the ENmix pipeline performs the best with significantly higher correlation and lower absolute difference between duplicate pairs, higher intraclass correlation coefficients (ICC) and smaller deviations from expected methylation level in mixture experiments. In addition to the pipeline function, ENmix software provides an integrated set of functions for reading in raw data files from mouse and human arrays, quality control, data preprocessing, visualization, detection of differentially methylated regions (DMRs), estimation of cell type proportions, and calculation of methylation age clocks. ENmix is computationally efficient, flexible and allows parallel computing. To facilitate further evaluations, we make all datasets and evaluation code publicly available.ConclusionCareful selection of robust data preprocessing methods is critical for DNA methylation array studies. ENmix outperformed other pipelines in our evaluations to minimize experimental variation and to improve data quality and study power.

Highlights

Illumina Infinium Methylation BeadChip are being widely utilized to measure individual CpG methylation on an epigenome-wide scale
In addition to the preprocessing pipeline function, the Exponential–normal mixture model (ENmix) R software provides a set of functions to facilitate large-scale epigenetic analyses including direct import of IDAT files and Illumina manifest files, quality control measures, imputation, surrogate variable analysis for batch effects using internal control probes, intraclass correlation coefficients (ICC) calculation, epigenetic clocks, differential methylated region (DMR) analysis, and estimation of blood cell proportions
Evaluation results We applied each of the preprocessing pipelines listed in the methods to the technical duplicate datasets using each pipeline’s recommended default parameter values to evaluate how concordance between duplicates were improved (See evaluation R code in the Additional file 1)

Summary

Introduction

Illumina Infinium Methylation BeadChip are being widely utilized to measure individual CpG methylation on an epigenome-wide scale. We describe the combination of these methods into the ENmix preprocessing pipeline, named after our original background correction method, and describe features of the extended ENmix methylation analysis software. It is difficult for even experienced investigators to select from among diverse methods and implement them in their own array analysis. Experimental and technical factors introduce appreciable measurement variation, some of which can be mitigated by careful “preprocessing” of raw data

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Clinical Epigenetics	Publication Date: Dec 1, 2021
Citations: 45	License type: open-access

R Discovery Prime

R Discovery Prime

The ENmix DNA methylation analysis pipeline for Illumina BeadChip and comparisons with seven other preprocessing pipelines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Clinical Epigenetics

Lead the way for us

Similar Papers

NKX3.1 immunohistochemistry and methylome profiling in mesenchymal chondrosarcoma: additional diagnostic value for a well-defined disease?
Salomé Glauser ... Daniel Baumhoer
Pathology | VOL. 55
Salomé Glauser, et. al.Salomé Glauser ... Daniel Baumhoer
05 May 2023
Pathology | VOL. 55

Abstract C060: DNA Methylation analysis of African American colorectal cancers reveal race-specific alterations
David N Buckley ... Mary K Yagle
Cancer Epidemiology, Biomarkers & Prevention | VOL. 32
David N Buckley, et. al.David N Buckley ... Mary K Yagle
01 Dec 2023
Cancer Epidemiology, Biomarkers & Prevention | VOL. 32

Abstract 7004: DNA Methylation analysis of African American colorectal cancers reveal race-specific alterations
Seeta Rajpara ... Nathan Ellis
Cancer Research | VOL. 84
Seeta Rajpara, et. al.Seeta Rajpara ... Nathan Ellis
22 Mar 2024
Abstract 7004: DNA Methylation analysis of African American colorectal cancers reveal race-specific alterations
Seeta Rajpara ... Nathan Ellis

Reconstructing Denisovan Anatomy Using DNA Methylation Maps.
David Gokhman ... Tomas Marques-Bonet
Cell | VOL. 179
David Gokhman, et. al.David Gokhman ... Tomas Marques-Bonet
01 Sep 2019
Cell | VOL. 179

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The ENmix DNA methylation analysis pipeline for Illumina BeadChip and comparisons with seven other preprocessing pipelines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Clinical Epigenetics