Abstract

Nonparametric methods are rich classes of statistical tools that have gained acceptance in most areas of statistics. They have been used in the past by researchers to fit missing values in the presence of auxiliary variables in a sampling survey. Nonparametric methods have been preferred to parametric methods because they make it possible to analyze data, estimate trends and conduct inference without having to fully specify a parametric model for the data. This study, therefore, presents some new attempts in the complex survey through the nonparametric imputation of missing values by the use of both penalized splines and neural networks. More precisely, the study adopted a neural network and penalized splines to estimate the functional relationship between the survey variable and the auxiliary variables. This complex survey data was sampled through a cluster - strata design where a population is divided into clusters which are in turn subdivided into strata. Once missing values have been imputed, this study performs a model calibration with auxiliary information assumed completely available at the cluster level. The reasoning behind model calibration is that if the calibration constraints are satisfied by the auxiliary variable, then it is expected that the fitted values of the variable of interest should satisfy such constraints too. The population total estimators are derived by treating the calibration problems at cluster level as optimization problems and solving it by the method of penalty functions. A Monte Carlo simulation was run to assess the finite sample performance of the estimators under complex survey data. The efficiency of the estimator’s performance was then checked by MSE criterion. A comparison of the penalized spline model calibration and neural network model calibration estimators was done with Horvitz Thompson estimator. From the results, the two nonparametric estimator’s performances seem closer to that of Horvitz Thompson estimator and are both unbiased and consistent.

Highlights

  • The concept of auxiliary variables in the present scholarship in statistics denotes independent or predictor variables in a regression analysis

  • This study considered modelling h(xi ) in equation (1) by way of both penalized splines and neural network and performed model calibration on h(xi ) in case when auxilliary information is available at cluster level

  • Using the fitted values in equation (9) and (10) this study proposed two types of model calibrated population total estimator based on Neural Network and based on the penalized splines; yPS for auxiliary variable available at cluster level and based on cluster-strata design with a general form as;

Read more

Summary

Introduction

The concept of auxiliary variables in the present scholarship in statistics denotes independent or predictor variables in a regression analysis. Stage one was to obtain point estimation of the missing values using penalized splines and neural network based on the auxiliary information at cluster levels. A distance measure defined on some design weights thought to be close to the inclusion probabilities is minimized subject to some calibration constraints imposed on the fitted values of the study variable. [6] proposed the use of nonparametric method to obtain the fitted values They used neural networks and local polynomial in fitting the missing values for clustered data in one-stage sampling. It is, important to consider other techniques of recovering the fitted values like penalized splines and neural networks when data is complex as discussed 2.0 of this paper

Fitting of Missing Values
Penalty Function Method of Obtaining the Weights
Empirical Analysis and Discussions
Normality Test
Results for Population total Estimates
Results of Variances and Variance Ratios for Various Sample Size
Relative Bias
Results on Mean Squared Errors for Various Sample Sizes
MSE and MSE Ratio Graphs for Various Sample Sizes
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.