Detecting the impact of subject characteristics on machine learning-based diagnostic applications

Elias Chaibub Neto,Meghasyam Tummalacherla,Andrew D Trister,Lara Mangravite,Larsson Omberg,Brian M Bot,Phil Snyder,Abhishek Pratap,Stephen H Friend,Thanneer M Perumal

doi:10.1038/s41746-019-0178-x

Elias Chaibub Neto, Meghasyam Tummalacherla + Show 8 more

Open Access

https://doi.org/10.1038/s41746-019-0178-x

Copy DOI

Abstract

Collection of high-dimensional, longitudinal digital health data has the potential to support a wide-variety of research and clinical applications including diagnostics and longitudinal health tracking. Algorithms that process these data and inform digital diagnostics are typically developed using training and test sets generated from multiple repeated measures collected across a set of individuals. However, the inclusion of repeated measurements is not always appropriately taken into account in the analytical evaluations of predictive performance. The assignment of repeated measurements from each individual to both the training and the test sets (“record-wise” data split) is a common practice and can lead to massive underestimation of the prediction error due to the presence of “identity confounding.” In essence, these models learn to identify subjects, in addition to diagnostic signal. Here, we present a method that can be used to effectively calculate the amount of identity confounding learned by classifiers developed using a record-wise data split. By applying this method to several real datasets, we demonstrate that identity confounding is a serious issue in digital health studies and that record-wise data splits for machine learning- based applications need to be avoided.

Highlights

The development of clinically actionable digital health assessments derived from high fidelity data obtained from wearables, smartphones, and in-home monitoring systems can transform the early diagnosis and treatment of health complications and diseases
Few digital health measures have made it into clinical practice, early research have shown the clinical potential of digital biomarkers for assessment and remote management of diseases.[1,2,3,4,5,6]
As the sensors embedded in consumer grade devices get increasingly sophisticated at capturing data of high fidelity and frequency, machine learning models trained on such data are often able to uniquely identify the individual from whom each data stream is collected,[7] illustrating the high sensitivity of sensor data to capture individualized “digital fingerprints” of the data contributors

Summary

INTRODUCTION

The development of clinically actionable digital health assessments derived from high fidelity data obtained from wearables, smartphones, and in-home monitoring systems can transform the early diagnosis and treatment of health complications and diseases. Saeb et al.[8] demonstrated via simulation studies that (for diagnostic applications) splitting the data into training and test sets in a record-wise fashion can lead to massive underestimation of prediction error achieved by the machine learning algorithm. Since the easier task of subject identification replaces the harder task of disease recognition, classifiers trained on data split in a record-wise manner end up achieving overly optimistic prediction accuracy estimates,[8] that are not disease relevant This practice is extremely common—a literature review by Saeb et al.[8] found out that 28 out of 62 papers using repeated measurements for diagnostic purposes employed the record-wise data split. The ability to correctly assess the predictive performance of machine learning diagnostic systems has important practical implications since overestimated predictive accuracies can negatively impact decisions made by funding agencies, and might stimulate the deployment of untested health related apps, a current serious concern in the mobile health field.[10]

RESULTS

DISCUSSION

CODE AVAILABILITY

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: npj Digital Medicine	Publication Date: Oct 11, 2019
Citations: 54	License type: open-access

R Discovery Prime

R Discovery Prime

Detecting the impact of subject characteristics on machine learning-based diagnostic applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: npj Digital Medicine

Lead the way for us

Similar Papers

Transparent sharing of digital health data: A call to action
David J Slotwiner ... Debbe Mccall
Heart Rhythm | VOL. 16
David J Slotwiner, et. al.David J Slotwiner ... Debbe Mccall
08 May 2019
Heart Rhythm | VOL. 16

Has traditional medicine had its day? The need to redefine academic medicine
Victor J Dzau ... Celynne A Balatbat
The Lancet | VOL. 400
Victor J Dzau, et. al.Victor J Dzau ... Celynne A Balatbat
20 Sep 2022
The Lancet | VOL. 400

Digital Public Health: Quality, Interoperability and Capability Maturity
Siaw-Teng Liaw
-
Siaw-Teng LiawSiaw-Teng Liaw
30 Jan 2024
30 Jan 2024

6.F. Workshop: Digital public health in Europe - What is the new normal?
...
European Journal of Public Health | VOL. 33
, et. al. ...
24 Oct 2023
European Journal of Public Health | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting the impact of subject characteristics on machine learning-based diagnostic applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: npj Digital Medicine