The Univariate Flagging Algorithm (UFA): An interpretable approach for predictive modeling.

Mallory Sheth,Albert Gerovitch,Roy Welsch,Natasha Markuzon

doi:10.1371/journal.pone.0223161

Mallory Sheth, Albert Gerovitch + Show 2 more

Open Access

PDF Available

https://doi.org/10.1371/journal.pone.0223161

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In many data classification problems, a number of methods will give similar accuracy. However, when working with people who are not experts in data science such as doctors, lawyers, and judges among others, finding interpretable algorithms can be a critical success factor. Practitioners have a deep understanding of the individual input variables but far less insight into how they interact with each other. For example, there may be ranges of an input variable for which the observed outcome is significantly more or less likely. This paper describes an algorithm for automatic detection of such thresholds, called the Univariate Flagging Algorithm (UFA). The algorithm searches for a separation that optimizes the difference between separated areas while obtaining a high level of support. We evaluate its performance using six sample datasets and demonstrate that thresholds identified by the algorithm align well with published results and known physiological boundaries. We also introduce two classification approaches that use UFA and show that the performance attained on unseen test data is comparable to or better than traditional classifiers when confidence intervals are considered. We identify conditions under which UFA performs well, including applications with large amounts of missing or noisy data, applications with a large number of inputs relative to observations, and applications where incidence of the target is low. We argue that ease of explanation of the results, robustness to missing data and noise, and detection of low incidence adverse outcomes are desirable features for clinical applications that can be achieved with relatively simple classifier, like UFA.

Highlights

Classifiers can be evaluated by multiple parameters, including accuracy, robustness, sensitivity to missing data, or ease of interpretability
Donoho and Jin [1] have demonstrated that the use of very simple univariate discriminant analysis, making no use of covariance matrices, led to a similar performance on a standard series of datasets [2] compared to much more sophisticated popular machine learning methods
We present results for datasets that vary greatly in terms of complexity and target/non-target ratio, allowing us to identify conditions for which Univariate Flagging Algorithm (UFA) is well suited

Summary

Introduction

Classifiers can be evaluated by multiple parameters, including accuracy, robustness, sensitivity to missing data, or ease of interpretability. Good predictive accuracy is often by far the most important evaluation metric. Donoho and Jin [1] have demonstrated that the use of very simple univariate discriminant analysis, making no use of covariance matrices, led to a similar performance on a standard series of datasets [2] compared to much more sophisticated popular machine learning methods (including Boosted decision trees, Random Forests, SVM, KNN, PAM and DLDA). Authors of the Mas-oMenos algorithm [3] compared their simplified approach to more sophisticated algorithms for treatment predictions of bladder, breast, and ovarian cancers, and came to the conclusion that model interpretation and validation were more important than complexity

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Oct 11, 2019
Citations: 2	License type: CC BY 4.0

R Discovery Prime

The Univariate Flagging Algorithm (UFA): An interpretable approach for predictive modeling.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Effective Recognition System of American Sign Language Alphabets using Machine Learning Classifiers, ANN and CNN
Diponkor Bala ... Mohammed Mynuddin
-
Diponkor Bala, et. al.Diponkor Bala ... Mohammed Mynuddin
21 Dec 2022
21 Dec 2022

A comparative evaluation of time-delay, deep learning and echo state neural networks when used as simulated transhumeral prosthesis controllers
Charles R Day ... Dimitra Blana
-
Charles R Day, et. al.Charles R Day ... Dimitra Blana
01 Jul 2020
01 Jul 2020

Seperability of four-class motor imagery data using independent components analysis
M Naeem ... R Leeb
Journal of Neural Engineering | VOL. 3
M Naeem, et. al.M Naeem ... R Leeb
27 Jun 2006
Journal of Neural Engineering | VOL. 3

Pre-insemination prediction of dystocia in dairy cattle
Ahmad Alsahaf ... George Azzopardi
Preventive Veterinary Medicine | VOL. 210
Ahmad Alsahaf, et. al.Ahmad Alsahaf ... George Azzopardi
05 Dec 2022
Preventive Veterinary Medicine | VOL. 210

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

The Univariate Flagging Algorithm (UFA): An interpretable approach for predictive modeling.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE