Defining, Evaluating, and Removing Bias Induced by Linear Imputation in Longitudinal Clinical Trials with MNAR Missing Data

Ronald W Helms,Laura Helms Reece,Russell W Helms,Mary W Helms

doi:10.1080/10543406.2011.550097

Ronald W Helms, Laura Helms Reece + Show 2 more

Open Access

PDF Available

https://doi.org/10.1080/10543406.2011.550097

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Missing not at random (MNAR) post-dropout missing data from a longitudinal clinical trial result in the collection of “biased data,” which leads to biased estimators and tests of corrupted hypotheses. In a full rank linear model analysis the model equation, E[ Y ] = X β, leads to the definition of the primary parameter β = (X′X)−1 X′E[ Y ], and the definition of linear secondary parameters of the form θ = L β = L(X′X)−1 X′E[ Y ], including, for example, a parameter representing a “treatment effect.” These parameters depend explicitly on E[ Y ], which raises the questions: What is E[ Y ] when some elements of the incomplete random vector Y are not observed and MNAR, or when such a Y is “completed” via imputation? We develop a rigorous, readily interpretable definition of E[ Y ] in this context that leads directly to definitions of β, , , and the extent of hypothesis corruption. These definitions provide a basis for evaluating, comparing, and removing biases induced by various linear imputation methods for MNAR incomplete data from longitudinal clinical trials. Linear imputation methods use earlier data from a subject to impute values for post-dropout missing values and include “Last Observation Carried Forward” (LOCF) and “Baseline Observation Carried Forward” (BOCF), among others. We illustrate the methods of evaluating, comparing, and removing biases and the effects of testing corresponding corrupted hypotheses via a hypothetical but very realistic longitudinal analgesic clinical trial.

Full Text