Two-Part and Related Regression Models for Longitudinal Data.

V.T Farewell,B.D.M Tom,S Yiu,L Su,D.L Long

doi:10.1146/annurev-statistics-060116-054131

V.T Farewell, B.D.M Tom + Show 3 more

Open Access

https://doi.org/10.1146/annurev-statistics-060116-054131

Copy DOI

Abstract

Statistical models that involve a two-part mixture distribution are applicable in a variety of situations. Frequently, the two parts are a model for the binary response variable and a model for the outcome variable that is conditioned on the binary response. Two common examples are zero-inflated or hurdle models for count data and two-part models for semicontinuous data. Recently, there has been particular interest in the use of these models for the analysis of repeated measures of an outcome variable over time. The aim of this review is to consider motivations for the use of such models in this context and to highlight the central issues that arise with their use. We examine two-part models for semicontinuous and zero-heavy count data, and we also consider models for count data with a two-part random effects distribution.

Highlights

Statistical analysis based on two-part models arises in a variety of contexts
The two parts are a model for the binary response variable and a model for the outcome variable that is conditioned on the binary response
In this article we focus on this specific type of two-part models, as well as models with a comparable two-part structure for a random effects distribution in longitudinal settings

Summary

INTRODUCTION

Statistical analysis based on two-part models arises in a variety of contexts. A simple, but common and useful, version of such models involves a model for a binary indicator variable and a model for another response variable given that the binary indicator takes one of the indicator’s two values. The structure was introduced by Cohen (1963) and given by Johnson & Kotz (1969), but was popularized by Lambert (1992), who provided an excellent introduction with regression formulations These models, the so-called zero-inflated Poisson (ZIP) models and their variants, combine a Poisson (or other distributions for count data) variable with a binary indicator variable for outcome, taking the value zero to accommodate the excess zeros that cannot be captured by the Poisson distribution. Other issues with twopart models, such as the interpretation of regression coefficients, may be even more problematic in the longitudinal settings We address these issues and some approaches to dealing with them, predominately in the context of specific two-part models described in this article, and for other similar models. Some important issues in the use of two-part models in longitudinal settings are highlighted and discussed (Section 8), and two primary examples from studies on psoriatic arthritis (PsA) (Sections 9, 10) and risky sexual behavior among HIV-positive individuals (Section 10) are presented to illustrate the use of the two-part models with particular emphasis on the issues raised

Quality of Life in Patients with Psoriatic Arthritis

Permanent Joint Damage in Patients with Psoriatic Arthritis

TWO-PART MIXED MODELS FOR LONGITUDINAL SEMICONTINUOUS DATA

Model Formulation

Model Estimation

ZERO-INFLATED POISSON MODELS WITH RANDOM EFFECTS FOR LONGITUDINAL COUNT DATA

Patient-Level Random Effects Models

Patient- and Observation-Level Random Effects Models

Correlated Random Effects and Potential Bias in Estimation

Marginal Inferences in Two-Part Models with Random Effects

The Concept of Two Populations

Potential Bias with a Misspecified Model for Random Effects

Marginal Covariate Effects

Marginal Covariate Effects for the Binary Part

Marginal Covariate Effects for the Continuous Part and for the Overall Mean

10. EVALUATING THE MOTIVATIONAL INTERVIEW INTERVENTION IN THE SAFETALK STUDY

10.1. Marginalized Zero-Inflated Poisson Models with Random Effects

10.2. Comparison with Traditional Zero-Inflated Poisson Models with Random Effects

11. MOVER-STAYER MODELS FOR DAMAGE IN PSORIATIC ARTHRITIS

11.1. Poisson Mover-Stayer Models

11.2. Negative Binomial Mover-Stayer Models

12. FINAL REMARKS

Findings

Methods

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Annual Review of Statistics and Its Application	Publication Date: Mar 7, 2017
Citations: 88	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Two-Part and Related Regression Models for Longitudinal Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application

Lead the way for us

Similar Papers

Bayesian Analysis of Semiparametric Mixed-Effects Models for Zero-Inflated Count Data
Chen Xue-Dong
Communications in Statistics - Theory and Methods | VOL. 38
Chen Xue-DongChen Xue-Dong
20 May 2009
Communications in Statistics - Theory and Methods | VOL. 38

Models for Overdispersion Count Data with Generalized Distribution: An Application to Parasites Intensity
Öznur İşçi̇ Güneri̇ ... Burcu Durmuş
Journal of New Theory | VOL. -
Öznur İşçi̇ Güneri̇, et. al.Öznur İşçi̇ Güneri̇ ... Burcu Durmuş
30 Jun 2021
Journal of New Theory | VOL. -

Zero-Inflated Models for Count Data: An Application to Number of Antenatal Care Service Visits
Daniel Biftu Bekalo ... Dufera Tejjeba Kebede
Annals of Data Science | VOL. 8
Daniel Biftu Bekalo, et. al.Daniel Biftu Bekalo ... Dufera Tejjeba Kebede
23 May 2021
Annals of Data Science | VOL. 8

Functional linear models for zero-inflated count data with application to modeling hospitalizations in patients on dialysis.
Damla Sentürk ... Danh V Nguyen
Statistics in medicine | VOL. 33
Damla Sentürk, et. al.Damla Sentürk ... Danh V Nguyen
19 Jun 2014
Statistics in medicine | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Part and Related Regression Models for Longitudinal Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application