Regression Analysis of Data from Complex Surveys

D Holt,P D Winter,T M F Smith

doi:10.2307/2982065

Abstract

SUMMARY Three methods of carrying out a regression analysis of data collected by means of a survey of complex design are investigated. Least squares methods which ignore population structure such as clustering or stratification can give seriously misleading results. Probability weighted methods are much better and give reasonable inferences for equal probability designs. However, for designs with widely differing selection probabilities the inferences can be poor. The best results were obtained for an estimator derived from maximum likelihood theory. This estimator requires that values of a design variable be known for all units in the population. Some aspects of the robustness of this procedure are studied. REGRESSION analysis is widely used in the analysis of data derived from a sample survey of complex design. McKennell (1970) describes a survey of residents around Heathrow Airport in which a stratified design with unequal sampling fractions was employed. A regression equation was fitted to the survey data which related the respondents' subjective attitudes to noise to various measures of physical exposure. A simplified version of this equation, the Noise and Number Index, now features prominently in discussions on the siting of future airports. In a subsequent survey (HMSO, 1971) a stratified cluster sample was employed and similar regression analyses were carried out. DeMets and Halperin (1977) employ regression analysis on data from a purposive sample of patients in the Framingham Heart Study. They were interested in the effect of dietary cholesterol on serum cholesterol, both measured concurrently, and based the sample on patients with the highest and lowest values of initial serum cholesterol level. This paper is concerned with the question of what advice to give clients who wish to estimate regression parameters and to calculate the variances of their estimates using data obtained from a survey of complex design. We assume that the appropriateness of the regression model is not in question and that in practice diagnostic plots and checks would be made to confirm this. We assume further that the statistician and his client agree that the appropriate model for study is a single equation fitted to all the data rather than separate equations fitted to subsets of the data. If separate models are fitted to subsets of the data in such a way that the data may be divided into mutually exclusive groups then the analysis of each group would still fall within the scope of this paper.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Regression Analysis of Data from Complex Surveys

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society. Series A (General)

Lead the way for us

Journal: Journal of the Royal Statistical Society. Series A (General)	Publication Date: Jan 1, 1980
Citations: 153

Similar Papers

Plasma concentration of C-reactive protein and risk of ischemic stroke and transient ischemic attack: the Framingham study.
N S Rost ... J M Massaro
Stroke | VOL. 32
N S Rost, et. al.N S Rost ... J M Massaro
01 Nov 2001
Stroke | VOL. 32

Cardiovascular risk assessment based on US cohort studies: findings from a National Heart, Lung, and Blood institute workshop.
Scott M Grundy ... Lawrence M Friedman
Circulation | VOL. 104
Scott M Grundy, et. al.Scott M Grundy ... Lawrence M Friedman
24 Jul 2001
Circulation | VOL. 104

On the Use of Repeated Measurements in Regression Analysis with Dichotomous Responses
Margaret Wu ... James H Ware
Biometrics | VOL. 35
Margaret Wu, et. al.Margaret Wu ... James H Ware
01 Jun 1979
Biometrics | VOL. 35

Effect of Dietary Protein and Cholesterol on Cholesterol Concentration and Lipoprotein Pattern in the Serum of Chickens
Maria A.E Mol ... Clive E West
The Journal of Nutrition | VOL. 112
Maria A.E Mol, et. al.Maria A.E Mol ... Clive E West
01 Jun 1982
The Journal of Nutrition | VOL. 112

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regression Analysis of Data from Complex Surveys

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society. Series A (General)