Abstract

DNA methylation-based predictors of various biological metrics have been widely published and are becoming valuable tools in epidemiologic studies of epigenetics and personalized medicine. However, generating these predictors from original source software and web servers is complex and time consuming. Furthermore, different predictors were often derived based on data from different types of arrays, where array differences and batch effects can make predictors difficult to compare across studies. We integrate these published methods into a single R function to produce 158 previously published predictors for chronological age, biological age, exposures, lifestyle traits and serum protein levels using both classical and principal component-based methods. To mitigate batch and array differences, we also provide a modified RCP method (ref-RCP) that normalize input DNA methylation data to reference data prior to estimation. Evaluations in real datasets show that this approach improves estimate precision and comparability across studies. The function was included in software package ENmix, and is freely available from Bioconductor website (https://www.bioconductor.org/packages/release/bioc/html/ENmix.html). Supplementary data are available at Bioinformatics online.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call