Comparison of generalized estimating equations and quadratic inference functions using data from the National Longitudinal Survey of Children and Youth (NLSCY) database.

Adefowope Odueyungbo,Dillon Browne,Lehana Thabane,Noori Akhtar-Danesh

doi:10.1186/1471-2288-8-28

Abstract

BackgroundThe generalized estimating equations (GEE) technique is often used in longitudinal data modeling, where investigators are interested in population-averaged effects of covariates on responses of interest. GEE involves specifying a model relating covariates to outcomes and a plausible correlation structure between responses at different time periods. While GEE parameter estimates are consistent irrespective of the true underlying correlation structure, the method has some limitations that include challenges with model selection due to lack of absolute goodness-of-fit tests to aid comparisons among several plausible models. The quadratic inference functions (QIF) method extends the capabilities of GEE, while also addressing some GEE limitations.MethodsWe conducted a comparative study between GEE and QIF via an illustrative example, using data from the "National Longitudinal Survey of Children and Youth (NLSCY)" database. The NLSCY dataset consists of long-term, population based survey data collected since 1994, and is designed to evaluate the determinants of developmental outcomes in Canadian children. We modeled the relationship between hyperactivity-inattention and gender, age, family functioning, maternal depression symptoms, household income adequacy, maternal immigration status and maternal educational level using GEE and QIF. Basis for comparison include: (1) ease of model selection; (2) sensitivity of results to different working correlation matrices; and (3) efficiency of parameter estimates.ResultsThe sample included 795, 858 respondents (50.3% male; 12% immigrant; 6% from dysfunctional families). QIF analysis reveals that gender (male) (odds ratio [OR] = 1.73; 95% confidence interval [CI] = 1.10 to 2.71), family dysfunctional (OR = 2.84, 95% CI of 1.58 to 5.11), and maternal depression (OR = 2.49, 95% CI of 1.60 to 2.60) are significantly associated with higher odds of hyperactivity-inattention. The results remained robust under GEE modeling. Model selection was facilitated in QIF using a goodness-of-fit statistic. Overall, estimates from QIF were more efficient than those from GEE using AR (1) and Exchangeable working correlation matrices (Relative efficiency = 1.1117; 1.3082 respectively).ConclusionQIF is useful for model selection and provides more efficient parameter estimates than GEE. QIF can help investigators obtain more reliable results when used in conjunction with GEE.

Highlights

The generalized estimating equations (GEE) technique is often used in longitudinal data modeling, where investigators are interested in population-averaged effects of covariates on responses of interest
In these illustrations we model the relationship between a binary response variable and covariates such as child's age and gender, family functioning, maternal depression symptoms, household
It would be interesting to compare the goodness-of-fit tests provided by quadratic inference functions (QIF) to those provided by Barnhart and Williamson [5] and Horton et al [4] in GEE

Summary

Introduction

The generalized estimating equations (GEE) technique is often used in longitudinal data modeling, where investigators are interested in population-averaged effects of covariates on responses of interest. GEE involves specifying a model relating covariates to outcomes and a plausible correlation structure between responses at different time periods. While GEE parameter estimates are consistent irrespective of the true underlying correlation structure, the method has some limitations that include challenges with model selection due to lack of absolute goodness-of-fit tests to aid comparisons among several plausible models. Investigators often encounter situations in which plausible statistical models for observed data require an assumption of correlation between successive measurements on the same subjects (longitudinal data) or related subjects (clustered data) enrolled in clinical studies. Statistical models that fail to account for correlation between repeated measures are likely to produce invalid inferences since parameter estimates may not be consistent and standard error estimates may be wrong [1].

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: May 9, 2008
Citations: 30	License type: cc-by

R Discovery Prime

R Discovery Prime

Comparison of generalized estimating equations and quadratic inference functions using data from the National Longitudinal Survey of Children and Youth (NLSCY) database.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

An evaluation of quadratic inference functions for estimating intervention effects in cluster randomized trials
Hengshi Yu ... Elizabeth L Turner
Contemporary Clinical Trials Communications | VOL. 19
Hengshi Yu, et. al.Hengshi Yu ... Elizabeth L Turner
05 Jul 2020
Contemporary Clinical Trials Communications | VOL. 19

The effect of cluster size imbalance and covariates on the estimation performance of quadratic inference functions
Philip M. Westgate ... Thomas M. Braun
Statistics in Medicine | VOL. 31
Philip M. Westgate, et. al.Philip M. Westgate ... Thomas M. Braun
13 Mar 2012
Statistics in Medicine | VOL. 31

Sample size determination for quadratic inference functions in longitudinal design with dichotomous outcomes
Youna Hu ... Peter X.‐K Song
Statistics in Medicine | VOL. 31
Youna Hu, et. al.Youna Hu ... Peter X.‐K Song
23 Feb 2012
Statistics in Medicine | VOL. 31

Parenting-by-gender interactions in child psychopathology: attempting to address inconsistencies with a Canadian national database
Dillon T Browne ... Carolyn Byrne
Child and Adolescent Psychiatry and Mental Health | VOL. 4
Dillon T Browne, et. al.Dillon T Browne ... Carolyn Byrne
27 Jan 2010
Child and Adolescent Psychiatry and Mental Health | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of generalized estimating equations and quadratic inference functions using data from the National Longitudinal Survey of Children and Youth (NLSCY) database.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology