Abstract

The aim of data science is to catch up with the data-intensive life style as well as the demand for decision support, which becomes common in various domains such as medical, education, and other smart solutions. As such, high quality of data analysis is greatly desired for accurate and effective downstreaming exploitations. Specific to data clustering, vast amounts of works have concentrated on modeling a distance metric and a clustering algorithm, with the assumption of a complete data. However, this might not always be the case as missing values can occur in the dataset under examination. Instead of filling in these values using an imputation method, a recent study successfully makes use of the consensus clustering to overcome the problem without committing an explicit imputation procedure. This paper extends the previous framework to link-based consensus clustering that provides a more refined summarization of cluster ensemble, hence the resulting data partition. It exhibits a promising performance on several benchmark data collections obtained from UCI repository.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.