Multi-label classification via multi-target regression on data streams

Aljaž Osojnik,Panče Panov,Sašo Džeroski

doi:10.1007/s10994-016-5613-5

Abstract

Multi-label classification (MLC) tasks are encountered more and more frequently in machine learning applications. While MLC methods exist for the classical batch setting, only a few methods are available for streaming setting. In this paper, we propose a new methodology for MLC via multi-target regression in a streaming setting. Moreover, we develop a streaming multi-target regressor iSOUP-Tree that uses this approach. We experimentally compare two variants of the iSOUP-Tree method (building regression and model trees), as well as ensembles of iSOUP-Trees with state-of-the-art tree and ensemble methods for MLC on data streams. We evaluate these methods on a variety of measures of predictive performance (appropriate for the MLC task). The ensembles of iSOUP-Trees perform significantly better on some of these measures, especially the ones based on label ranking, and are not significantly worse than the competitors on any of the remaining measures. We identify the thresholding problem for the task of MLC on data streams as a key issue that needs to be addressed in order to obtain even better results in terms of predictive performance.

Highlights

The task of multi-label classification (MLC) has recently become very prominent in the machine learning research community (Gibaja and Ventura 2015)
We only have enough evidence to conclude that Hoeffding tree with pruned sets (HTPS) significantly outperforms model trees in terms of Precisionmacro
HTPS uses considerably less memory when compared to model and regression trees

Summary

Introduction

The task of multi-label classification (MLC) has recently become very prominent in the machine learning research community (Gibaja and Ventura 2015) It can be seen as a generalization of the ubiquitous multi-class classification task, where instead of a single label, each example is associated with multiple labels. Generalizing multi-class classification, where only one of the possible labels needs to be predicted, multi-label classification requires a model to predict a combination (subset) of the possible labels This means that for each data instance x from an input space X a model needs to provide a prediction yfrom an output space Y , which is a powerset of the labelset L, i.e., Y = 2L. Binary relevance models have been often overlooked due to their inability to account for label correlations, though some BR methods are capable of modeling label correlations during classification

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Dec 30, 2016
Citations: 88	License type: open-access

R Discovery Prime

R Discovery Prime

Multi-label classification via multi-target regression on data streams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Multi-label Classification via Multi-target Regression on Data Streams
Aljaž Osojnik ... Panče Panov
-
Aljaž Osojnik, et. al.Aljaž Osojnik ... Panče Panov
01 Jan 2015
01 Jan 2015

Improving multi-label classification performance by label constraints
Benhui Chen ... Lihua Duan
-
Benhui Chen, et. al.Benhui Chen ... Lihua Duan
01 Aug 2013
01 Aug 2013

F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification Tasks
Xiangxiang Gao ... Wei Zhu
-
Xiangxiang Gao, et. al.Xiangxiang Gao ... Wei Zhu
04 Jun 2023
04 Jun 2023

Multi-Label Arabic Text Classification: An Overview
Nawal Aljedani ... Reem Alotaibi
International Journal of Advanced Computer Science and Applications | VOL. 11
Nawal Aljedani, et. al.Nawal Aljedani ... Reem Alotaibi
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-label classification via multi-target regression on data streams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning