Abstract

Contemporary datasets are often comprised of multiple views of data, which provide complete and complementary information in different views, and multi-view clustering is one of the most crucial techniques in multi-view data analysis. However, traditional multi-view clustering methods are sensitive to noises and outliers, suffering from severe performance degradation when the dataset contains many outliers. Moreover, the commonly used multi-view clustering methods are restricted by high time complexity. To address these problems, we propose a robust multi-view k-means algorithm with outlier detection, i.e., Multi-View Clustering with Outlier Removal (MVCOR). This method is designed to remove the outliers and thus boosts the clustering performance on multi-view data with low time complexity. By defining two types of outliers, MVCOR uses the well-defined outlier removal strategy to categorize all the outliers into two specific clusters and performs robust clustering on the clean data at the same time. This strategy significantly improves the clustering performance as well as the model robustness, making MVCOR a more practical approach for real-world scenarios. Besides, the proposed model is efficiently optimized by a well-designed alternating minimization algorithm which is strictly proved to be convergent. Extensive experiments on both synthetic and real-world datasets demonstrate that MVCOR consistently outperforms the related clustering methods on clustering performance as well as robustness to outliers, and achieves comparable performance to the state-of-the-art multi-view outlier detection methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call