Abstract

AbstractEffective sample size accounts for the equivalent number of independent observations contained in a sample of correlated data. This notion has been widely studied in the context of univariate spatial variables. In that case, the effective sample size determines the reduction in the sample size due to the existing spatial correlation. In this paper, we generalize the methodology for multivariate spatial variables to provide a common effective sample size when all variables have been measured at the same locations. Together with the definition, we provide examples to investigate what an effective sample size looks like. An application for a soil contamination data set is considered. To reduce the dimensions of the process, clustering techniques are applied to obtain three bivariate vectors that are modeled using coregionalization models. Because the sample size of the data set is moderate and the locations are very unevenly distributed in the study area, the spatial analysis is challenging and interesting. We find that due to the presence of spatial autocorrelation, the sample size can be reduced by 38.53%, avoiding the duplication of information.Recommendations for Resource Managers: Before carrying out a sample survey with georeferenced data, it is essential to consider the impact of spatial correlation on sample size calculations. When the nature of the problem requires multivariate characteristics analysis, we provide a methodology to evaluate the effective sample size from a multivariate perspective. If the sample size is large, the effective sample size allows us to define the size of the subsample that should be used to preserve the theoretical properties of the estimation of the mean.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call