Integrating Probability and Nonprobability Samples for Survey Inference

Arkadiusz Wiśniowski,Joseph W Sakshaug,Diego Andres Perez Ruiz,Annelies G Blom

doi:10.1093/jssam/smz051

Abstract

Abstract Survey data collection costs have risen to a point where many survey researchers and polling companies are abandoning large, expensive probability-based samples in favor of less expensive nonprobability samples. The empirical literature suggests this strategy may be suboptimal for multiple reasons, among them that probability samples tend to outperform nonprobability samples on accuracy when assessed against population benchmarks. However, nonprobability samples are often preferred due to convenience and costs. Instead of forgoing probability sampling entirely, we propose a method of combining both probability and nonprobability samples in a way that exploits their strengths to overcome their weaknesses within a Bayesian inferential framework. By using simulated data, we evaluate supplementing inferences based on small probability samples with prior distributions derived from nonprobability data. We demonstrate that informative priors based on nonprobability data can lead to reductions in variances and mean squared errors for linear model coefficients. The method is also illustrated with actual probability and nonprobability survey data. A discussion of these findings, their implications for survey practice, and possible research extensions are provided in conclusion.

Highlights

For more than a decade, the survey research industry has witnessed an increasing competition between two distinct sampling paradigms: probability and nonprobability sampling
To create the prior based on nonprobability data, we propose using the scaling factors V and k0 that depend on the potential bias in the maximum likelihood estimator (MLE) of the coefficients based on nonprobability sample data, which is assessed against the MLE based on the probability sample data
We evaluate a method of integrating relatively small probability samples with nonprobability samples to improve the efficiency and reduce the mean squared error for estimated regression coefficients

Summary

Introduction

For more than a decade, the survey research industry has witnessed an increasing competition between two distinct sampling paradigms: probability and nonprobability sampling. Nonprobability sampling involves some form of arbitrary selection of elements into the sample for which inclusion probabilities are unknowable (and possibly zero for some population elements). In practice, unbiased estimation is not assured as response rates in probability surveys can be quite low. Another challenge of probability sampling is the need for large sample sizes for robust estimation, which can be problematic for survey organizations working with small- to medium-sized budgets

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Survey Statistics and Methodology	Publication Date: Jan 27, 2020
Citations: 35	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Integrating Probability and Nonprobability Samples for Survey Inference

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Survey Statistics and Methodology

Lead the way for us

Similar Papers

Combining Scientific and Non-scientific Surveys to Improve Estimation and Reduce Costs
Joseph W Sakshaug ... Diego Andres Perez Ruiz
-
Joseph W Sakshaug, et. al.Joseph W Sakshaug ... Diego Andres Perez Ruiz
10 Aug 2020
10 Aug 2020

Using Probability vs. Nonprobability Sampling to Identify Hard-to-Access Participants for Health-Related Research
Lucy Feild ... Edward P Lemay
Journal of Aging and Health | VOL. 18
Lucy Feild, et. al.Lucy Feild ... Edward P Lemay
01 Aug 2006
Journal of Aging and Health | VOL. 18

Enhancing estimation methods for integrating probability and nonprobability survey samples with machine-learning techniques. An application to a Survey on the impact of the COVID-19 pandemic in Spain.
María Del Mar Rueda ... Ramón Ferri-García
Biometrical journal. Biometrische Zeitschrift | VOL. 65
María Del Mar Rueda, et. al.María Del Mar Rueda ... Ramón Ferri-García
22 Sep 2022
Biometrical journal. Biometrische Zeitschrift | VOL. 65

Supplementing Small Probability Samples with Nonprobability Samples: A Bayesian Approach
Joseph W Sakshaug ... Arkadiusz Wiśniowski
Journal of Official Statistics | VOL. 35
Joseph W Sakshaug, et. al.Joseph W Sakshaug ... Arkadiusz Wiśniowski
01 Sep 2019
Journal of Official Statistics | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrating Probability and Nonprobability Samples for Survey Inference

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Survey Statistics and Methodology