Reweighting Forest for Extreme Multi-label Classification

Zhun-Zheng Lin,Bi-Ru Dai

doi:10.1007/978-3-319-64283-3_21

Abstract

In recent years, data volume is getting larger along with the fast development of Internet technologies. Some datasets contain a huge number of labels, dimensions and data points. As a result, some of them cannot be loaded by typical classifiers, and some of them require very long and unacceptable time for execution. Extreme multi-label classification is designed for these challenges. Extreme multi-label classification differs from traditional multi-label classification in a number of ways including the need for lower execution time, training at an extreme scale with millions of data points, features and labels, etc. In order to enhance the practicality, in this paper, we focus on designing an extreme multi-label classification approach which can be performed on a single person-al computer. We devise a two-phase framework for dealing with the above issues. In the reweighting phase, the prediction precision is improved by paying more attention on hard-to-classify instances and increasing the diversity of the model. In the pretesting phase, trees with lower quality will be removed from the prediction model for reducing the model size and increasing the prediction precision. Experiments on real world datasets will verify that the pro-posed method is able to generate better prediction results and the model size is successfully shrunk down.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reweighting Forest for Extreme Multi-label Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Extreme multi-label learning : A large scale classification approach in machine learning
Purvi Prajapati ... Amit Thakkar
Journal of Information and Optimization Sciences | VOL. 40
Purvi Prajapati, et. al.Purvi Prajapati ... Amit Thakkar
19 May 2019
Journal of Information and Optimization Sciences | VOL. 40

Research Challenges in Extreme Multi-label Classification
Purvi Prajapati ... Nikita Bhatt
-
Purvi Prajapati, et. al.Purvi Prajapati ... Nikita Bhatt
01 Jan 2023
01 Jan 2023

Hybridizing Sentence Transformer Model with Multi-KNN for Biomedical Documents
Owais Ahmad ... Shahid Azim
-
Owais Ahmad, et. al.Owais Ahmad ... Shahid Azim
01 Jan 2021
01 Jan 2021

Advancing Automatic Subject Indexing: Combining Weak Supervision with Extreme Multi-label Classification
Lakshmi Rajendram Bashyam ... Ralf Krestel
-
Lakshmi Rajendram Bashyam, et. al.Lakshmi Rajendram Bashyam ... Ralf Krestel
01 Jan 2024
01 Jan 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reweighting Forest for Extreme Multi-label Classification

Abstract

Talk to us

Similar Papers