Abstract
One-class classification belongs to the one of the novel and very promising topics in contemporary machine learning. In recent years ensemble approaches have gained significant attention due to increasing robustness to unknown outliers and reducing the complexity of the learning process. In our previous works, we proposed a highly efficient one-class classifier ensemble, based on input data clustering and training weighted one-class classifiers on clustered subsets. However, the main drawback of this approach lied in difficult and time consuming selection of a number of competence areas which indirectly affects a number of members in the ensemble. In this paper, we investigate ten different methodologies for an automatic determination of the optimal number of competence areas for the proposed ensemble. They have roots in model selection for clustering, but can be also effectively applied to the classification task. In order to select the most useful technique, we investigate their performance in a number of one-class and multi-class problems. Numerous experimental results, backed-up with statistical testing, allows us to propose an efficient and fully automatic method for tuning the one-class clustering-based ensembles.
Highlights
Machine learning becomes frequently used in real-life applications, allowing to analyze massive and complex data
We investigate ten methods for automatic selection of a number of clusters, that in our case simultaneously represent a number of competence areas
We present the performance of the examined methods according to the average number of competence areas they had identified, and to their final accuracy
Summary
Machine learning becomes frequently used in real-life applications, allowing to analyze massive and complex data. The classical approach relies on a set of data with well known classes, which are used for training of a machine learning algorithm. Collecting a representative set of objects may be costly, time consuming, unethical or impossible [27]. In such cases, one needs to create a fully operational pattern classification system with the usage of objects originating only from a single class. One needs to create a fully operational pattern classification system with the usage of objects originating only from a single class This learning paradigm is known as the one-class classification (OCC) [22]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.