AutoML for Multi-Label Classification: Overview and Empirical Evaluation.

Marcel Wever,Alexander Tornede,Eyke Hullermeier,Felix Mohr

doi:10.1109/tpami.2021.3051276

Abstract

Automated machine learning (AutoML) supports the algorithmic construction and data-specific customization of machine learning pipelines, including the selection, combination, and parametrization of machine learning algorithms as main constituents. Generally speaking, AutoML approaches comprise two major components: a search space model and an optimizer for traversing the space. Recent approaches have shown impressive results in the realm of supervised learning, most notably (single-label) classification (SLC). Moreover, first attempts at extending these approaches towards multi-label classification (MLC) have been made. While the space of candidate pipelines is already huge in SLC, the complexity of the search space is raised to an even higher power in MLC. One may wonder, therefore, whether and to what extent optimizers established for SLC can scale to this increased complexity, and how they compare to each other. This paper makes the following contributions: First, we survey existing approaches to AutoML for MLC. Second, we augment these approaches with optimizers not previously tried for MLC. Third, we propose a benchmarking framework that supports a fair and systematic comparison. Fourth, we conduct an extensive experimental study, evaluating the methods on a suite of MLC problems. We find a grammar-based best-first search to compare favorably to other optimizers.

Highlights

AUTOMATED machine learning (AutoML) is commonly understood as the task of automating the process of engineering a “machine learning pipeline” tailored to a problem at hand, that is, to a dataset on which a model ought to be induced
We considered existing optimization approaches for automating multi-label classification and, transferred other AutoML approaches commonly used for singlelabel classification to the problem domain of MLC
Our extensive study revealed that a reduction of the AutoML problem to hyper-parameter optimization does not scale well to the problem domain of MLC out of the box

Summary

Introduction

AUTOMATED machine learning (AutoML) is commonly understood as the task of automating the process of engineering a “machine learning pipeline” tailored to a problem at hand, that is, to a dataset on which a (predictive) model ought to be induced. This includes the selection, combination, and parameterization of machine learning (ML) algorithms as basic constituents of the pipeline, which is the main output produced by an AutoML tool, and which can be used to train a concrete model on the dataset. Since an AutoML tool is a complex system consisting of several components, most importantly a search space model and an optimization method for traversing this space, one typically faces a credit assignment

Methods

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on pattern analysis and machine intelligence	Publication Date: Jan 13, 2021
Citations: 39	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

AutoML for Multi-Label Classification: Overview and Empirical Evaluation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Similar Papers

Impact analysis of SCRs using single and multi-label machine learning classification
Syed Nadeem Ahsan ... Franz Wotawa
-
Syed Nadeem Ahsan, et. al.Syed Nadeem Ahsan ... Franz Wotawa
16 Sep 2010
16 Sep 2010

Multi-label classification of research articles using Word2Vec and identification of similarity threshold
Ghulam Mustafa ... Muhammad Sulaiman
Scientific Reports | VOL. 11
Ghulam Mustafa, et. al.Ghulam Mustafa ... Muhammad Sulaiman
09 Nov 2021
Scientific Reports | VOL. 11

Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data
Brian Arun Xavier ... Po-Hao Chen
Journal of Digital Imaging | VOL. 35
Brian Arun Xavier, et. al.Brian Arun Xavier ... Po-Hao Chen
02 Jun 2022
Journal of Digital Imaging | VOL. 35

Multi-Label Associative Classification
Adriano Veloso ... Wagner Meira
-
Adriano Veloso, et. al.Adriano Veloso ... Wagner Meira
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AutoML for Multi-Label Classification: Overview and Empirical Evaluation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence