Abstract

Tian et al. (2007) introduced a so-called hidden sensitivity model for evaluating the association of two sensitive questions with binary outcomes. However, in practice, we sometimes need to assess the association between one sensitive binary variable (e.g., whether or not a drug user, the number of sex partner being⩽1 or >1, and so on) and one nonsensitive binary variable (e.g., good or poor health status, with or without cervical cancer, and so on). To address this issue, by sufficiently utilizing the information contained in the non-sensitive binary variable, in this paper, we propose a new survey scheme, called combination questionnaire design/model, which consists of a main questionnaire and a supplemental questionnaire. The introduction of the supplemental questionnaire which is indeed a design of direct questioning can effectively reduce the noncompliance behavior since more respondents will not be faced with the sensitive question. Likelihood-based inferences including maximum likelihood estimates via the expectation-maximization algorithm, asymptotic confidence intervals, and bootstrap confidence intervals of parameters of interest are derived. A likelihood ratio test is provided to test the association between the two binary random variables. Bayesian inferences are also discussed. Simulation studies are performed, and a cervical cancer data set in Atlanta is used to illustrate the proposed methods.

Highlights

  • Warner [1] introduced a randomized response technique to obtain truthful answers to questions with sensitive attributes

  • By sufficiently utilizing the information contained in the non-sensitive binary variable, in this paper, we propose a new survey scheme, called combination questionnaire design/model, which consists of a main questionnaire and a supplemental questionnaire

  • The introduction of the supplemental questionnaire which is a design of direct questioning can effectively reduce the noncompliance behavior since more respondents will not be faced with the sensitive question

Read more

Summary

Introduction

Warner [1] introduced a randomized response technique to obtain truthful answers to questions with sensitive attributes. Kim and Warde [6] considered a multinomial randomized response model which can handle untruthful responses They derived the Pearson product moment correlation estimator which may be used to quantify the linear relationship between two variables when multinomial response data are observed according to a randomized response procedure. We could directly adopt the hidden sensitivity model, the information contained in the nonsensitive binary variable cannot be utilized in the design Such information can be used to enhance the degree of privacy protection, so that more respondents will not be faced with the sensitive question. A discussion and an appendix on the mode of a group Dirichlet density and a sampling method from it are presented

The Survey Design
Likelihood-Based Inferences
Bayesian Inferences
Simulation Studies
Analyzing Cervical Cancer Data in Atlanta
Findings
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call