Abstract

Missing value imputation (MVI) is the major solution method for dealing with incomplete dataset problems in which the missing attribute values are replaced from a chosen set of observed data using some statistical methods, such as mean/mode, machine learning, or support vector machine methods. Although machine learning MVI approaches may produce reasonably good imputation results, they usually require larger imputation times than statistical approaches. In this paper, a Class Center based Missing Value Imputation (CCMVI) approach is introduced for producing effective imputation results more efficiently. It is based on measuring the class center of each class and then the distances between it and the other observed data are used to define a threshold for the later imputation. The experimental results based on numerical, categorical, and mixed data types of datasets show that the proposed CCMVI approach outperforms the other MVI approaches for both numerical and mixed datasets. In addition, it requires much less imputation time than the machine learning MVI methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.