Abstract

The goal of Graded Multi-label Classification (GMLC) is to assign a degree of membership or relevance of a class label to each data point. As opposed to multi-label classification tasks which can only predict whether a class label is relevant or not. The graded multi-label setting generalizes the multi-label paradigm to allow a prediction on a gradual scale. This is in agreement with practical real-world applications where the labels differ in matter of level relevance. In this paper, we propose a novel decision tree classifier (GML_DT) that is adapted to the graded multi-label setting. It fully models the label dependencies, which sets it apart from the transformation-based approaches in the literature, and increases its performance. Furthermore, our approach yields comprehensive and interpretable rules that efficiently predict all the degrees of memberships of the class labels at once. To demonstrate the model’s effectiveness, we tested it on real-world graded multi-label datasets and compared it against a baseline transformation-based decision tree classifier. To assess its predictive performance, we conducted an experimental study with different evaluation metrics from the literature. Analysis of the results shows that our approach has a clear advantage across the utilized performance measures.

Highlights

  • Multi-label classification (MLC) has become an extensively researched and prominent field in machine learning

  • We propose a novel adapted decision tree classifier (GML-DT) that is suited for the graded multi label setting

  • We present a graded multi-label decision tree classifier, GML_DT, which generalizes the multi-label setting by predicting the membership degrees of the target labels instead of the binary relevance/non relevance

Read more

Summary

INTRODUCTION

Multi-label classification (MLC) has become an extensively researched and prominent field in machine learning. The article can fully belong to the class health while it remains somewhat socioeconomical In this light, all multi-label problems are graded multi-label problems, where the membership degrees are reduced to two binary values, relevant/non relevant. The ongoing research on Graded multi-label classification aims at developing solutions for real-world problems where the multi-label learning paradigm is not applicable or not optimal For this purpose, some graded multilabel classifiers were proposed [1] [12] [13] [14]. The main advantages of this approach are its ability to fully model the label dependencies which improves the quality of its predictions This algorithm is the first adapted tree-based model which makes it the most interpretable existing approach in GML.

Graded Multi-label Classifiers
Multi-label Decision Tree Classifiers
Formal Task Description
A Toy Example
EXPERIMENTAL STUDY
Evaluation Metrics
Datasets
Results
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call