Contribution to the analysis of complex survey data and cluster-correlated biological data using inverse sampling

Emmanuel Benhin

doi:10.22215/etd/2004-05924

Abstract

This thesis addresses two main areas of statistical application. The first part of the thesis focuses on the analysis of complex survey data using inverse sampling. In the second part of the thesis some new ideas of analyzing cluster-correlated biological data are discussed. Part I of this thesis discusses various inverse sampling schemes, some theory of inverse sampling and explore its strengths and weaknesses. We propose an estimating equations approach for handling complex parameters, such as ratios and “census” regression parameters, which naturally extends to poststratification estimation. We also explore the use of inverse sampling analyses of categorical complex survey data. Inverse sampling methods for testing hypotheses using Wald and Quasi-score tests for complex survey data are also studied. The first part of Part II of this thesis discusses analyses of cluster-correlated biological data. We present some theory of the proposed methods and explore potential areas of application. In the second part of Part II of this thesis, we discuss some methods of analyzing cluster-correlated binary response data. Conditional logistic regression (CLR) method for analyzing cluster-correlated binary response data implicitly assumes that the dependence arising from random cluster effects is a nuisance and all unmeasured cluster-specific risk factors are aggregated into a cluster-specific baseline. It is however, invalid when these assumptions fail. We propose an alternative method to rectify these shortcomings: mean conditional estimating equation. Some properties and basic theory of the proposed method are discussed. The final chapter of Part II of this thesis discusses estimation methods for analyzing data having two types of correlation: within-cluster correlation and longitudinal correlation where the cluster sizes may be ignorable or nonignorable. We first study the effect on the efficiency of the regression parameter estimators when the assumed working correlation structures depart from the true correlation structure. Secondly, we demonstrate that methods that implicitly assume ignorable cluster sizes can lead to asymptotically invalid inferences when this assumption fails. We study properties of proposed alternative approaches that lead to asymptotically valid inferences when cluster sizes are ignorable or nonignorable. (Abstract shortened by UMI.)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contribution to the analysis of complex survey data and cluster-correlated biological data using inverse sampling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Methods for analysis of complex survey data: an application using the Tanzanian 2015 Demographic and Health Survey and Service Provision Assessment.
Ashley Sheffel ... Scott Zeger
Journal of Global Health | VOL. 9
Ashley Sheffel, et. al.Ashley Sheffel ... Scott Zeger
01 Dec 2019
Journal of Global Health | VOL. 9

Analysis of Complex Sample Survey Data
Eun Sul Lee ... Ronald N Forthofer
Sociological Methods & Research | VOL. 15
Eun Sul Lee, et. al.Eun Sul Lee ... Ronald N Forthofer
01 Nov 1986
Sociological Methods & Research | VOL. 15

Classic Linear Mediation Analysis of Complex Survey Data Using Balanced Repeated Replication
Yujiao Mai ... Hui Zhang
-
Yujiao Mai, et. al.Yujiao Mai ... Hui Zhang
01 Jan 2020
01 Jan 2020

Analysis of complex survey data using SAS
Stephen R Cole
Computer Methods and Programs in Biomedicine | VOL. 64
Stephen R ColeStephen R Cole
15 Nov 2000
Computer Methods and Programs in Biomedicine | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contribution to the analysis of complex survey data and cluster-correlated biological data using inverse sampling

Abstract

Talk to us

Similar Papers