Abstract

Finite mixtures of regressions have been used to analyze data that come from a heterogeneous population. When more than one response is observed, accommodating a multivariate response can be useful. In this article, we go a step further and introduce a multivariate extension that includes a latent overlapping cluster indicator variable that allows for potential overdispersion. A generalized mixture of multivariate regressions in connection with the proposed model and a new EM algorithm for fitting are provided. In addition, we allow for high-dimensional predictors via shrinkage estimation. This model proves particularly useful in the analysis of complex data like the search for cancer therapeutic biomarkers. We demonstrate this using the genomics of drug sensitivity in cancer resource.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call