Abstract
Most data stream ensemble classification algorithms use supervised learning. This method needs to use a large number of labeled data to train the classifier, and the cost of obtaining labeled data is very high. Therefore, the semi supervised learning algorithm using labeled data and unlabeled data to train the classifier becomes more and more popular. This article is the first to review data stream ensemble classification methods from the perspectives of supervised learning and semi-supervised learning. Firstly, basic classifiers such as decision trees, neural networks, and support vector machines are introduced from the perspective of supervised learning and semi-supervised learning. Secondly, the key technologies in data stream ensemble classification are explained from the two aspects of incremental and online. Finally, the majority voting and weight voting are explained in the ensemble strategies. The different ensemble methods are summarized and the classic algorithms are quantitatively analyzed. Further research directions are given, including the handling of concept drift under supervised and semi-supervised learning, the study of homogeneous ensemble and heterogeneous ensemble, and the classification of data stream ensemble under unsupervised learning.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.