Probability Sampling in Matched Case-Control Study in Drug Abuse

Surya Raj Niraula,Frederick A Connell

doi:10.6000/1929-6029.2018.07.01.3

Abstract

Although random sampling is generally considered to be the gold standard for population-based research, the majority of drug abuse research is based on non-random sampling despite the well-known limitations of this kind of sampling. We compared the statistical properties of two surveys of drug abuse in the same community: one using snowball sampling of drug users who then identified “friend controls” and the other using a random sample of non-drug users (controls) who then identified “friend cases”. Models to predict drug abuse based on risk factors were developed for each data set using conditional logistic regression. Bootstrap analysis of the random-sample data set showed less variation, and did not change the significance of the predictors when compared to the non-bootstrap analysis. Comparison of ROC curves using the model derived from the random-sample data set was similar when fitted to either data set (0.93 for random-sample data vs. 0.91 for snowball-sample data (p=0.35)); however, when the model derived from the snowball-sample data set was fitted to each of the data sets, the areas under the curve were significantly different (0.98 vs. 0.83, p<.001). The proposed method of random sampling of controls appears to be superior from a statistical perspective to snowball sampling and may represent a viable alternative to snowball sampling.

Full Text