Abstract

A common task in data mining is the visualization of multivariate objects on scatterplots, allowing human observers to perceive subtle inter-relations in the dataset such as outliers, groupings or other regularities. Multidimensional scaling (MDS) is a well known exploratory data analysis family of techniques that produce one display on which inter-object similarity relationships are preserved. The algorithm scales with the square of the number of visualized data, which limits its application to small datasets. In order to alleviate this limitation, we associate MDS with three different clustering models, namely the learning vector quantization, the k-means and the dendrograms. We propose to perform dimensionality reduction on a reduced set of cluster centers, to which the data are added using a relative MDS mapping. Our experiments show that this approach allows to obtain displays of large datasets with fairly good visualization properties, when compared with the display obtained by a direct mapping of the whole dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.