Abstract

Population health is multidimensional in nature, having complex relationships with the various health determinants. However, most previous studies investigate a single dimension of population health using linear models, failing to capture the nonlinearity in the data and interdependence of multiple dimensions in health outcomes. In this paper, we propose a data-driven multivariate statistical learning approach to simultaneously model various aspects of population health—characterizing the length and quality of life—as a function of health behaviors, clinical care, socioeconomic factors, physical environment, and demographics. We also propose a novel percentile-based variable selection for multivariate regression, without compromising the model’s generalization performance. We demonstrate the applicability of our proposed data-driven methodological framework using the New York State as a case study. Leveraging cross-validation techniques and statistical hypothesis tests, the results indicate that multivariate tree boosting method outperforms the traditionally-used univariate linear regression model and random forest in modeling multidimensional population health. The variable importance heat-map illustrates the relative influence of the key health determinants on the various dimensions of population health. Partial dependence plots are used to quantify the marginal effects and the nonlinear relationships between the health outcomes and health inputs. Our results show that teen birth rate is strongly associated with both length of life (e.g., child mortality) and quality of life (e.g., physically unhealthy days). Socioeconomic status is the key indicator to predict child and infant mortality. Our proposed framework can be used as a decision support tool for accurately assessing and predicting multivariate population health.

Highlights

  • Human health and wellbeing is the key to a thriving and equitable society [1]

  • RESULTS we present the results from our case study to illustrate the applicability of our proposed data-driven multivariate framework for modeling the multidimensional population health outcomes

  • We propose a datadriven multivariate framework to simultaneously model nine dimensions of population health— characterizing the length of life and quality of life—as a nonlinear function of health behaviors, clinical care, socioeconomic factors, physical environment and demographics

Read more

Summary

Introduction

Human health and wellbeing is the key to a thriving and equitable society [1]. Population health is often conceptualized as the health status and health outcomes within a group of people, instead of considering the individual health at a time [2]. Efforts have been made to enhance the overall population health by improving the overall or mean population health of a community, and eliminating health disparities within that population [3]–[5]. The “Healthy People 2030”, developed by the U.S Department of Health and Human Services Advisory Committee on National Health Promotion and Disease Prevention for 2030, sets data-driven national objectives to improve health and well-being over the decade [1]. Even though increasing attention has been paid to enhancing population health, several challenges still exist towards quantitatively assessing population health. A number of studies focus on assessing population health, most of them consider only one of the dimensions such as health-related quality of life

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.