Abstract

In most machine learning (ML) applications, data that arrive from heterogeneous views (i.e., multiple heterogeneous sources of data) are more likely to provide complementary information than does a single view. Hence, these are known as multi-view data. In real-world applications, such as web clustering, data arrive from diverse groups (i.e., sets of features) and therefore have heterogeneous properties. Each feature group is referred to as a particular view. Although multi-view learning provides complementary information for machine learning algorithms, it results in high dimensionality. However, to reduce the dimensionality, feature selection is an efficient method that can be used to select only the representative features of the views so to reduce the dimensionality. In this paper, an unsupervised feature selection for online dynamic multi-views (UFODMV) is developed, which is a novel and efficient mechanism for the dynamic selection of features from multi-views in an unsupervised stream. UFODMV consists of a clustering-based feature selection mechanism enabling the dynamic selection of representative features and a merging process whereby both features and views are received incrementally in a streamed fashion over time. The experimental evaluation demonstrates that the UFODMV model has the best classification accuracy with values of 20% and 50% compared with well-known single-view and multi-view unsupervised feature selection methods, namely OMVFS, USSSF, and SPEC.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call