Abstract
There is growing interest in multivariate dependent outcome models that include a mixture of different kinds of discrete and continuous variables. This may be attributed to at least two reasons. The first is the ability to generate multivariate distributions through the use of relatively flexible copula-based methods and/or effective factorization techniques for the covariance matrices. The second is the development of computationally efficient ways to estimate models based on variational methods for Bayesian inference or maximum approximate composite marginal likelihood methods for frequentist inference. However, there are two important assumptions in earlier mixed data models: (i) marginal normality of unobserved factors that generate jointness among the main outcome variables of interest, and (ii) independence between the unobserved factors and the propensity equations underlying the main outcomes of interest. In the current paper, we simultaneously relax both these assumptions and develop a flexible Generalized Heterogeneous Data Model (GHDM) for mixed data modeling. We then propose a hybrid MSL-MACML inference approach for estimation. We demonstrate an application of our proposed model in the context of individuals’ high-density residential neighborhood living choice and monthly bicycling frequency. The empirical results highlight the benefits of our proposed methodology, both from a policy standpoint as well as a predictive data fit standpoint.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.