Abstract

Nowadays, multi-label data are ubiquitous in real-world applications, in which each instance is associated with a set of labels. Multi-label learning has attracted significant attentions from researchers and plenty of algorithms have been proposed. Among those algorithms binary relevance (BR) is a widely used framework for multi-label classification. It constructs binary classifiers for each label by means of one-vs-rest style. BR approach is a simple and straight forward way of problem transformation for multi-label learning, but it ignores label correlations totally. Stacking based BR is a feasible way to tackle this problem. The key issue of stacking based BR is how to select label subset to extend the original features for each label. Existing methods of stacking based BR usually select identical label subset for all labels. It may be suboptimal as each label has its own most related label subset. In this paper, a novel stacking based method is introduced to utilize label correlations based on Pareto Optimum for improving the performance of BR. Our method builds a stack of two layers of BR classifiers. At the first layer, a group of binary classifiers are constructed, one for a label. At the second layer, for each label we employ Pareto Optimum to select most related label subset, then augment the original features by the selected label subset. The final binary classifiers for each label are constructed based on their corresponding reconstructed feature space. Comparing to other well-established stacking multi-label learning algorithms in terms of different multi-label classification criteria, experimental results on several multi-label benchmark datasets testify the superiority of the proposed methods.

Highlights

  • In many real-word applications, each instance usually exhibits multiple concepts or semantic meanings simultaneously

  • We propose a novel method for multi-label classification, which utilizes label correlations by means of label specific features

  • THE PROPOSED ALGORITHM we propose a novel algorithm for multi-label classification named SMBPO, i.e. a Stacking Model Based on Pareto Optimum

Read more

Summary

INTRODUCTION

In many real-word applications, each instance usually exhibits multiple concepts or semantic meanings simultaneously. We propose a novel method for multi-label classification, which utilizes label correlations by means of label specific features. The key contributions of our method are summarized as follows: 1) Unlike existing methods depending on rankings or thresholds to select label subset for a given label, our method translates the evaluating values of label correlations into a multi-dimensional space to deal with this challenge. To our knowledge, it is one of the first work for stacking based multi-label learning.

EXPLOITING LABEL DEPENDENCIES IN BR FRAMEWORK
PARETO OPTIMUM
THE PROPOSED ALGORITHM
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call