Comparative Analysis of Incomplete Business Data Clustering

Rongxuan Wang,Longao Weng

doi:10.54097/hset.v22i.3294

Rongxuan Wang, Longao Weng

Open Access

PDF Available

https://doi.org/10.54097/hset.v22i.3294

Copy DOI

Export

Save

Cite

Journal: Highlights in Science, Engineering and Technology	Publication Date: Dec 7, 2022
Citations: 1	License type: CC BY-NC 4.0

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Incomplete values can significantly reduce the accuracy and usability of missing data. In particular, in analyzing commercial data sets, missing values often lead to the dilemma of data selection. It means that a common way to deal with missing data is to delete the sample that contains the missing attribute. However, this can lead to biased and invalidated conclusions, as some data are too critical to be omitted. Therefore, we should use some method to fill the data set rather than delete the data with missing values. The filling of missing data is divided into supervised learning and unsupervised learning. This paper compares six benchmark business datasets by adopting several different data imputation methods and supplementing the missing data with a clustering approach (unsupervised learning). The results are guided to dealing with incomplete business data.

Full Text