Income Imputation Research Articles

Abstract. The socioeconomic data, such as household income, is an important indicator of people’s well-being. However, due to the limited resource in many developing countries such as Thailand, the data obtained from household income surveys are often incomplete. As a result, the annual household survey usually contains a gap at the municipality household level. In this study, we aim to quantify the household income with K-NN imputation models at the sub-district level using satellite imageries and geospatial data as proxies to socioeconomic indicators. We examined the role of satellite and geospatial data in household income estimation, applied the K-NN imputation methods to estimate the missing income data by using various geographical and statistical variables, and quantified how these data improved the accuracy of sub-district household income estimation. Our results illustrated a significant correlation between sub-district household income and geographical data extracted from day-night satellite data, such as night light intensity (r = 0.53), urban density (r = 0.44), residential area (r = 0.68), urban area (r = 0.64), and statistical data as well as household expenditure (r = 0.97). These can be used to improve the socioeconomic indicators’ estimation as well as household income in sub-district level. The income imputation from geographical data perform better result than purely statistical variables. Especially, the night light intensity can infer the wealth of people living in large scale areas, while day-time satellite images can be interpreted for land use and land cover also implying socioeconomic status. Such socioeconomic proxy from space provides spatially explicit information in further study.

Missing data often occur in cross-sectional surveys and longitudinal and experimental studies. The purpose of this study was to compare the prediction of self-rated health (SRH), a robust predictor of morbidity and mortality among diverse populations, before and after imputation of the missing variable "yearly household income." We reviewed data from 4,162 participants of Mexican origin recruited from July 1, 2002, through December 31, 2005, and who were enrolled in a population-based cohort study. Missing yearly income data were imputed using three different single imputation methods and one multiple imputation under a Bayesian approach. Of 4,162 participants, 3,121 were randomly assigned to a training set (to derive the yearly income imputation methods and develop the health-outcome prediction models) and 1,041 to a testing set (to compare the areas under the curve (AUC) of the receiver-operating characteristic of the resulting health-outcome prediction models). The discriminatory powers of the SRH prediction models were good (range, 69-72%) and compared to the prediction model obtained after no imputation of missing yearly income, all other imputation methods improved the prediction of SRH (P < 0.05 for all comparisons) with the AUC for the model after multiple imputation being the highest (AUC = 0.731). Furthermore, given that yearly income was imputed using multiple imputation, the odds of SRH as good or better increased by 11% for each $5,000 increment in yearly income. This study showed that although imputation of missing data for a key predictor variable can improve a risk health-outcome prediction model, further work is needed to illuminate the risk factors associated with SRH.

Income Imputation Research Articles

Articles published on Income Imputation

Estimating intergenerational income mobility on sub-optimal data: a machine learning approach

SOCIOECONOMIC STATUS FROM SPACE: EXAMPLE OF ESTIMATING THAILAND’s SUB-DISTRICT HOUSEHOLD INCOME BASED ON REMOTELY SENSED AND GEOSPATIAL DATA

Estimating the value of lost recreation days from the Deepwater Horizon oil spill

Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study

Polarization and the Middle Class

Imputación de ingresos en la Gran Encuesta Integrada de Hogares (GEIH) de 2010

Eligibility and Take-up of the Medicare Part D Low-Income Subsidy

The Advantage of Imputation of Missing Income Data to Evaluate the Association Between Income and Self-Reported Health Status (SRH) in a Mexican American Cohort Study

Agriculture Income Assessment for the Purpose of Social Assistance: The Case of Ukraine

Valuation of Leasehold Interests: Their Worth, Exchange Value and Relation to Rent

The Effects of Income Imputation on Microanalyses: Evidence from the European Community Household Panel

Nonparametric Tests for the Independence of Regressors and Disturbances as Specification Tests

Do expenditures explain income? A study of variables for income imputation

Alternative Methods for CPS Income Imputation

Alternative Methods for CPS Income Imputation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Income Imputation Research Articles

Articles published on Income Imputation

Estimating intergenerational income mobility on sub-optimal data: a machine learning approach

SOCIOECONOMIC STATUS FROM SPACE: EXAMPLE OF ESTIMATING THAILAND’s SUB-DISTRICT HOUSEHOLD INCOME BASED ON REMOTELY SENSED AND GEOSPATIAL DATA

Estimating the value of lost recreation days from the Deepwater Horizon oil spill

Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study

Polarization and the Middle Class

Imputación de ingresos en la Gran Encuesta Integrada de Hogares (GEIH) de 2010

Eligibility and Take-up of the Medicare Part D Low-Income Subsidy

The Advantage of Imputation of Missing Income Data to Evaluate the Association Between Income and Self-Reported Health Status (SRH) in a Mexican American Cohort Study

Agriculture Income Assessment for the Purpose of Social Assistance: The Case of Ukraine

Valuation of Leasehold Interests: Their Worth, Exchange Value and Relation to Rent

The Effects of Income Imputation on Microanalyses: Evidence from the European Community Household Panel

Nonparametric Tests for the Independence of Regressors and Disturbances as Specification Tests

Do expenditures explain income? A study of variables for income imputation

Alternative Methods for CPS Income Imputation

Alternative Methods for CPS Income Imputation