Abstract

Cross-sectional studies are widely prevalent since they are more feasible to conduct compared with longitudinal studies. However, cross-sectional data lack the temporal information required to study the evolution of the underlying dynamics. This temporal information is essential to develop predictive computational models, which is the first step towards causal modelling. We propose a method for inferring computational models from cross-sectional data using Langevin dynamics. This method can be applied to any system where the data-points are influenced by equal forces and are in (local) equilibrium. The inferred model will be valid for the time span during which this set of forces remains unchanged. The result is a set of stochastic differential equations that capture the temporal dynamics, by assuming that groups of data-points are subject to the same free energy landscape and amount of noise. This is a ‘baseline’ method that initiates the development of computational models and can be iteratively enhanced through the inclusion of domain expert knowledge as demonstrated in our results. Our method shows significant predictive power when compared against two population-based longitudinal datasets. The proposed method can facilitate the use of cross-sectional datasets to obtain an initial estimate of the underlying dynamics of the respective systems.

Highlights

  • Longitudinal studies require a huge investment in terms of time, money and effort, depending on the system studied

  • We present a baseline method for inferring predictive computational models from cross-sectional data based on Langevin dynamics

  • We compare our model predictions against two longitudinal datasets: the College study dataset [16] and the Hoorn study dataset [17,18,19], where we consider the first timepoint as the cross-sectional data and the subsequent time-points are compared with the model predictions

Read more

Summary

Introduction

Longitudinal studies require a huge investment in terms of time, money and effort, depending on the system studied. Concentrations; population-based cohort studies in public health involve asking each of the participants 2 to visit the hospital, measuring various physiological variables, and assessing psychological well-being through interviews and questionnaires. This leads to a relative abundance of cross-sectional datasets in these fields. The price to pay is that cross-sectional data lack the temporal information needed to study the evolution of the underlying dynamics. This hampers the development of models that can make predictions ( predictive models) or even simulate the effects of interventions (causal models) in these fields. In order to use the abundant cross-sectional data to study the dynamics of system behaviour it is important to design methods that aim to infer the temporal dynamics from these data

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call