Abstract

Summary Motivated by the Australian National University poll, we consider a situation where survey data have been collected from respondents for several categorical variables and a primary geographic classification, e.g. postcode. Here, a common and important problem is to obtain estimates for a second target geography that overlaps with the primary geography but has not been collected from the respondents. We examine this problem when areal level census information is available for both geographic classifications. Such a situation is challenging from a small area estimation perspective for several reasons: there is a misalignment between the census and survey information as well as the geographical classifications; the geographic areas are potentially small and so prediction can be difficult because of the sparse or spatially missing data issue; and there is the possibility of non-stationary spatial dependence. To address these problems we develop a Bayesian model using latent processes, underpinned by a non-stationary spatial basis that combines Moran's I and multiresolution basis functions with a small but representative set of knots. The study results based on simulated data demonstrate that the model can be highly effective and gives more accurate estimates for areas defined by the target geography than several existing models. The model also performs well for the Australian National University poll data to predict on a second geographic classification: statistical area level 2.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call