A comparison of methods for interpreting random forest models of genetic association in the presence of non-additive interactions

Alena Orlenko,Jason H Moore

doi:10.1186/s13040-021-00243-0

Alena Orlenko, Jason H Moore

Open Access

https://doi.org/10.1186/s13040-021-00243-0

Copy DOI

Journal: BioData mining	Publication Date: Jan 29, 2021
Citations: 21	License type: open-access

Affiliation: University of Pennsylvania

Abstract

BackgroundNon-additive interactions among genes are frequently associated with a number of phenotypes, including known complex diseases such as Alzheimer’s, diabetes, and cardiovascular disease. Detecting interactions requires careful selection of analytical methods, and some machine learning algorithms are unable or underpowered to detect or model feature interactions that exhibit non-additivity. The Random Forest method is often employed in these efforts due to its ability to detect and model non-additive interactions. In addition, Random Forest has the built-in ability to estimate feature importance scores, a characteristic that allows the model to be interpreted with the order and effect size of the feature association with the outcome. This characteristic is very important for epidemiological and clinical studies where results of predictive modeling could be used to define the future direction of the research efforts. An alternative way to interpret the model is with a permutation feature importance metric which employs a permutation approach to calculate a feature contribution coefficient in units of the decrease in the model’s performance and with the Shapely additive explanations which employ cooperative game theory approach. Currently, it is unclear which Random Forest feature importance metric provides a superior estimation of the true informative contribution of features in genetic association analysis.ResultsTo address this issue, and to improve interpretability of Random Forest predictions, we compared different methods for feature importance estimation in real and simulated datasets with non-additive interactions. As a result, we detected a discrepancy between the metrics for the real-world datasets and further established that the permutation feature importance metric provides more precise feature importance rank estimation for the simulated datasets with non-additive interactions.ConclusionsBy analyzing both real and simulated data, we established that the permutation feature importance metric provides more precise feature importance rank estimation in the presence of non-additive interactions.

Highlights

Non-additive interactions among genes are frequently associated with a number of phenotypes, including known complex diseases such as Alzheimer’s, diabetes, and cardiovascular disease
permutation feature importance (PFI), SHAP, and built-in feature importance coefficients (BIC) metrics were estimated for the fitted random forest (RF) models and further compared to the real feature importances retrieved with the HIBACHI sensitivity analysis
For all combinations of factors that were considered in HIBACHI simulations, PFI metric consistently outperformed BIC and SHAP metrics in the ability to determine feature importance order

Summary

Introduction

Non-additive interactions among genes are frequently associated with a number of phenotypes, including known complex diseases such as Alzheimer’s, diabetes, and cardiovascular disease. An alternative way to interpret the model is with a permutation feature importance metric which employs a permutation approach to calculate a feature contribution coefficient in units of the decrease in the model’s performance and with the Shapely additive explanations which employ cooperative game theory approach It is unclear which Random Forest feature importance metric provides a superior estimation of the true informative contribution of features in genetic association analysis. A group of methods provide a graphical description of the model’s global behavior (i.e. partial dependency plots [1] and decision tree surrogate models) Several methods such as individual conditional expectation (ICE) plots [2], local interpretable model-agnostic explanations or LIME [3] and Shapley additive explanations or SHAP [4] focus on explaining individual model predictions. Interpretability (along with performance) is the key quality of the machine learning model, when it is applied to biomedical research goals such as biomarkers discovery and patient diagnostics

Objectives

Methods

Results

Conclusion