Abstract

In multiple analysis tasks and personalized services, tremendous challenges in Cyber-Physical-Social Systems (CPSS) are clustering large-scale multi-source data and generating multiple distinct clusterings dependent on different applications. To address these challenges, this paper first presents two simple multiple clustering methods which can produce different clustering results according to arbitrarily selected combinations of features, one is similarity matrices-based multiple clusterings which computes the weighted average of similarity matrices for selected feature spaces, another is Euclidean distance-based multiple clusterings which fuses different feature spaces using selective weighted Euclidean distance. Furthermore, a tensor decomposition-based multiple clusterings is presented for efficiently clustering high-dimensional data, and a multi-relational attribute ranking method is further proposed to improve the clustering performance. This paper illustrates and evaluates the proposed methods on a design example and a real world data set. Experimental results show that the proposed methods can effectively cluster big data to provide enhanced knowledge extractions and services in CPSS.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call