Abstract

This paper presents an optimal spatial sampling (OSS) design for fielding the demographic and health. The proposed design (a) develops a context specific sampling frame at a fine spatial resolution, (b) captures maximum spatial autocorrelation-controlled semivariance in the selected attribute (a composite index of population concentration and socio-economic characteristics in the context of this paper) of the sampling domain, (c) ensures spatial coverage and representation, (d) minimizes sample size, and (e) minimizes redundancy in the selection of sample sites. OSS was tested for drawing a sample for fielding a pilot General Social Survey (GSS) in Chicago metropolitan area (MSA) in the summer of 2010. Fine resolution LandScan population data, coupled with the U.S. Census data, were used to develop a multivariate contextual sampling frame. Our analysis suggests that a set of 97 sample sites captured 80% of the total spatial autocorrelation-controlled semivariance in the composite index used for optimizing sample sites. Maximizing spatial autocorrelation-controlled semivariance using OSS also ensured representation of the population variance.The OSS design outperformed other widely-used spatial sampling designs, such as Generalized Random Tessellation Stratified sampling (GRTS) in terms of spatial coverage and population representation. The domain (or area) of each optimal site, defined using the extent of local spatial autocorrelation, serves as a stratum and formulates bases for drawing inferences. The simulation experiment suggests that the relative efficiency of the OSS was better than that of other sampling designs. However, for a skewed quantity the efficiency of OSS drops and prediction bias (measured by percent difference between observed and predicted mean) increases. Therefore, it is important that the variable used for optimization of sample sites is normalized to achieve the best performance of the OSS.Various methods, including reverse geocoding, can be used to develop enumeration list and draw respondent(s) from each stratum. Geocoding respondent is also useful for the collection of multi-layer socio-physical contextual data at reduced cost. This, in turn, is likely to extend the scope of the survey data to a multi-level, interdisciplinary setting.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call