Evaluating deep learning for predicting epigenomic profiles.

Shushan Toneyan,Peter K Koo,Ziqi Tang

doi:10.1038/s42256-022-00570-9

Shushan Toneyan, Peter K Koo + Show 1 more

Open Access

PDF Available

https://doi.org/10.1038/s42256-022-00570-9

Copy DOI

Export

Save

Cite

Journal: Nature Machine Intelligence	Publication Date: Dec 5, 2022
Citations: 36	License type: cc-by-nc-nd

Affiliation: Cold Spring Harbor Laboratory

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Deep learning has been successful at predicting epigenomic profiles from DNA sequences. Most approaches frame this task as a binary classification relying on peak callers to define functional activity. Recently, quantitative models have emerged to directly predict the experimental coverage values as a regression. As new models continue to emerge with different architectures and training configurations, a major bottleneck is forming due to the lack of ability to fairly assess the novelty of proposed models and their utility for downstream biological discovery. Here we introduce a unified evaluation framework and use it to compare various binary and quantitative models trained to predict chromatin accessibility data. We highlight various modeling choices that affect generalization performance, including a downstream application of predicting variant effects. In addition, we introduce a robustness metric that can be used to enhance model selection and improve variant effect predictions. Our empirical study largely supports that quantitative modeling of epigenomic profiles leads to better generalizability and interpretability.

Full Text