Abstract

When doing multivariate data analysis, one common obstacle is the presence of incomplete observations, i.e., observations for which one or more key fields are missing data. Missing data is often countered by deleting entire observations that contain missing data. The negative effects of deleting entire observations are multiple: deleting observations reduces sample size and can also result in biased inferences even if data is missing at random. In addition, knowledge contained within incomplete observations is knowledge lost when they are deleted- and the effort spent collecting that knowledge is effort wasted. Data imputation methods, or methods of statistically “filling-in” missing data, can help combat small sample sizes by using the existing information in partially complete observations with the end goal of producing less biased and higher confidence inferences. When a sample from a multivariate normal population is only partially complete, and the missing data meets appropriate assumptions (missing at random), robust data imputation of the missing data can be implemented with monotone data augmentation (MDA) using the multivariate t distribution. Missing data imputation is applied to data from the NASA Instrument Cost Model (NICM) using the MDA algorithm under the assumption of having a multivariate t distribution with fixed degrees of freedom. A sensitivity analysis to the degrees of freedom parameter is presented to demonstrate robustness of the multivariate t distribution when dealing with small samples as compared to the multivariate normal distribution.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.