Abstract

Multi label classification is concerned with learning from a set of instances that are associated with a set of labels, that is, an instance could be associated with multiple labels at the same time. This task occurs frequently in application areas like text categorization, multimedia classification, bioinformatics, protein function classification and semantic scene classification. Current multi-label classification methods could be divided into two categories. The first is called problem transformation methods, which transform multi-label classification problem into single label classification problem, and then apply any single label classifier to solve the problem. The second category is called algorithm adaptation methods, which adapt an existing single label classification algorithm to handle multi-label data. In this paper, we propose a multi-label classification approach based on correlations among labels that use both problem transformation methods and algorithm adaptation methods. The approach begins with transforming multi-label dataset into a single label dataset using least frequent label criteria, and then applies the PART algorithm on the transformed dataset. The output of the approach is multi-labels rules. The approach also tries to get benefit from positive correlations among labels using predictive Apriori algorithm. The proposed approach has been evaluated using two multi-label datasets named (Emotions and Yeast) and three evaluation measures (Accuracy, Hamming Loss, and Harmonic Mean). The experiments showed that the proposed approach has a fair accuracy in comparison to other related methods.

Highlights

  • Data classification is a form of data analysis that can be used to extract models describing important data classes

  • Multi label classification is concerned with learning from set of instances that are associated with a set of labels, that is, an instance could be associated with multiple labels at the same time

  • The general structure of the proposed approach consists of three phases: (a) Transforming multi-label dataset into single label dataset and discovering correlations among labels. (b) Applying a rule-based classification algorithm on the transformed dataset. (c) Generating the multi-label rules based on the output of the rule-based classifier and the correlations among labels

Read more

Summary

A Multi-Label Classification Approach Based on Correlations Among Labels

Abstract—Multi label classification is concerned with learning from a set of instances that are associated with a set of labels, that is, an instance could be associated with multiple labels at the same time. This task occurs frequently in application areas like text categorization, multimedia classification, bioinformatics, protein function classification and semantic scene classification. We propose a multi-label classification approach based on correlations among labels that use both problem transformation methods and algorithm adaptation methods. The approach begins with transforming multi-label dataset into a single label dataset using least frequent label criteria, and applies the PART algorithm on the transformed dataset. The experiments showed that the proposed approach has a fair accuracy in comparison to other related methods

INTRODUCTION
RELATED WORK
Problem Transformation Methods
Algorithm Adaptation methods
THE PROPOSED APPROACH FOR MULTI LABEL CLASSIFICATION
Applying Rule-Based Classifier
AN ILLUSTRATIVE EXAMPLE FOR THE PROPOSED APPROACH
Approach Phases
Amaze d
EXPERIMENTS AND RESULTS
CONCLUSIONS AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call