Not just “big” data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions

Mary E Mcnamara,Mackenzie Zisser,Christopher G Beevers,Jason Shumake

doi:10.1016/j.brat.2022.104086

Mary E Mcnamara, Mackenzie Zisser + Show 2 more

Open Access

https://doi.org/10.1016/j.brat.2022.104086

Copy DOI

Journal: Behaviour Research and Therapy	Publication Date: Apr 14, 2022
Citations: 22	License type: publisher-specific-oa

Affiliation: The University of Texas at Austin

Abstract

There is strong interest in developing a more efficient mental health care system. Digital interventions and predictive models of treatment prognosis will likely play an important role in this endeavor. This article reviews the application of popular machine learning models to the prediction of treatment prognosis, with a particular focus on digital interventions. Assuming that the prediction of treatment prognosis will involve modeling a complex combination of interacting features with measurement error in both the predictors and outcomes, our simulations suggest that to optimize complex prediction models, sample sizes in the thousands will be required. Machine learning methods capable of discovering complex interactions and nonlinear effects (e.g., decision tree ensembles such as gradient boosted machines) perform particularly well in large samples when the predictors and outcomes have virtually no measurement error. However, in the presence of moderate measurement error, these methods provide little or no benefit over regularized linear regression, even with very large sample sizes (N = 100,000) and a non-linear ground truth. Given these sample size requirements, we argue that the scalability of digital interventions, especially when used in combination with optimal measurement practices, provides one of the most effective ways to study treatment prediction models. We conclude with suggestions about how to implement these algorithms into clinical practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Not just “big” data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions

Abstract

Talk to us

Similar Papers

More From: Behaviour Research and Therapy

Lead the way for us

Similar Papers

Digital Technology in Cardiovascular Health: Role and Evidence Supporting Its Use.
Pamela Martyn-Nemeth ... Laura L Hayman
Journal of Cardiovascular Nursing | VOL. 38
Pamela Martyn-Nemeth, et. al.Pamela Martyn-Nemeth ... Laura L Hayman
31 Mar 2023
Journal of Cardiovascular Nursing | VOL. 38

Importance of sample size in clinical trials
Ganeshs Kumar
International Journal of Clinical and Experimental Physiology | VOL. 1
Ganeshs KumarGaneshs Kumar
01 Jan 2014
International Journal of Clinical and Experimental Physiology | VOL. 1

Digital Mental Health Interventions for Alleviating Depression and Anxiety During Psychotherapy Waiting Lists: Systematic Review.
Sijia Huang ... Thomas J Nyman
JMIR mental health | VOL. 11
Sijia Huang, et. al.Sijia Huang ... Thomas J Nyman
10 Sep 2024
JMIR mental health | VOL. 11

Commentary: The reliability of telomere length measurements
Simon Verhulst ... Jeremy D Kark
International Journal of Epidemiology | VOL. 44
Simon Verhulst, et. al.Simon Verhulst ... Jeremy D Kark
24 Sep 2015
International Journal of Epidemiology | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Not just “big” data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions

Abstract

Talk to us

Similar Papers

More From: Behaviour Research and Therapy