Abstract

In missing data analysis, there is often a need to assess the sensitivity of key inferences to departures from untestable assumptions regarding the miss- ing data process. Such sensitivity analysis often requires specifying a missing data model that commonly assumes parametric functional forms for the predictors of missingness. In this paper, we relax the parametric assumption and investigate the use of a generalized additive missing data model. We also consider the possi- bility of a nonlinear relationship between missingness and the potentially missing outcome, whereas the existing literature commonly assumes a more restricted lin- ear relationship. To avoid computational complexity, we adopt an index approach for local sensitivity. We derive explicit formulas for the resulting semiparametric sensitivity index. The computation of the index is simple, and completely avoids the need to repeatedly fit the semiparametric nonignorable model. Only estimates from the standard software analysis are required, with a moderate amount of ad- ditional computation. Thus, the semiparametric index provides a fast and robust method to adjust the standard estimates for nonignorable missingness. An ex- tensive simulation study is conducted to evaluate the effects of misspecifying the missing data model and to compare the performance of the proposed approach with the commonly used parametric approaches. The simulation study suggests that the proposed method helps reduce bias that might arise from the misspecification of the functional forms of predictors in the missing data model. We illustrate the method in a Wage Offer dataset.

Highlights

  • Missing data arise frequently in studies across different disciplines, including public health, medicine, economics, business, and the social sciences

  • We propose a data-driven procedure to adaptively choose the functional forms of the continuous predictors

  • We propose using a semiparametric approach to adaptively choose the functional form of the continuous predictors for missingness

Read more

Summary

Introduction

Missing data arise frequently in studies across different disciplines, including public health, medicine, economics, business, and the social sciences. The direct estimation can yield valid inferences when the model is correctly specified, its computation is heavy and requires specialized programming Such a joint selection model is often weakly or non-identified (Little (1995), Troxel (1998), Troxel, Harrington and Lipsitz (1998), Chen and Ibrahim (2006), and the results can be highly sensitive to untestable model assumptions (Kenward (1998)).

Selection Model for Nonignorable Missingness
ISNI Methodology
Extending ISNI Using a Generalized Additive Model
A Comparison Using Simulated Data
An Application
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call