A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

Yitong Yao,Peng Zhang,Yueheng Sun,Jing Zhang

doi:10.1145/3597416

Abstract

Multi-label text classification has a wide range of applications in the real world. However, the data distribution in the real world is often imbalanced, which leads to serious long-tailed problems. For multi-label classification, due to the vast scale of datasets and existence of label co-occurrence, how to effectively improve the prediction accuracy of tail labels without degrading the overall precision becomes an important challenge. To address this issue, we propose A Dual-Branch Learning Model with Gradient-Balanced Loss (DBGB) based on the paradigm of existing pre-trained multi-label classification SOTA models. Our model consists of two main long-tailed module improvements. First, with the shared text representation, the dual-classifier is leveraged to process two kinds of label distributions; one is the original data distribution and the other is the under-sampling distribution for head labels to strengthen the prediction for tail labels. Second, the proposed gradient-balanced loss can adaptively suppress the negative gradient accumulation problem related to labels, especially tail labels. We perform extensive experiments on three multi-label text classification datasets. The results show that the proposed method achieves competitive performance on overall prediction results compared to the state-of-the-art methods in solving the multi-label classification, with significant improvement on tail-label accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems

Lead the way for us

Journal: ACM Transactions on Information Systems	Publication Date: Sep 27, 2023
Citations: 1

Similar Papers

Supervised Machine Learning for Multi-label Classification of Bangla Articles
Dip Bhakta ... Swakkhar Shatabda
-
Dip Bhakta, et. al.Dip Bhakta ... Swakkhar Shatabda
01 Jan 2020
01 Jan 2020

Multi-label Classification for Clinical Text with Feature-level Attention
Disheng Pan ... Xizi Zheng
-
Disheng Pan, et. al.Disheng Pan ... Xizi Zheng
01 May 2020
01 May 2020

Learning Local and Global Features for Optimized Multi-Label Text Classification
Muhammad Rafi ... Fizza Abid
-
Muhammad Rafi, et. al.Muhammad Rafi ... Fizza Abid
22 Nov 2022
22 Nov 2022

EnML: Multi-label Ensemble Learning for Urdu Text Classification
Faiza Mehmood ... Muhammad Nabeel Asim
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Faiza Mehmood, et. al.Faiza Mehmood ... Muhammad Nabeel Asim
22 Sep 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems