LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Ting Jiang,Deqing Wang,Zhengyang Zhao,Leilei Sun,Huayi Yang,Fuzhen Zhuang

doi:10.1609/aaai.v35i9.16974

Abstract

Extreme multi-label text classification(XMC) is a task for finding the most relevant labels from a large label set. Nowadays deep learning-based methods have shown significant success in XMC. However, the existing methods (e.g., AttentionXML and X-Transformer etc) still suffer from 1) combining several models to train and predict for one dataset, and 2) sampling negative labels statically during the process of training label ranking model, which will harm the performance and accuracy of model. To address the above problems, we propose LightXML, which adopts end-to-end training and dynamical negative labels sampling. In LightXML, we use GAN like networks to recall and rank labels. The label recalling part will generate negative and positive labels, and the label ranking part will distinguish positive labels from these labels. Based on these networks, negative labels are sampled dynamically during label ranking part training. With feeding both label recalling and ranking parts with the same text representation, LightXML can reach high performance. Extensive experiments show that LightXML outperforms state-of-the-art methods in five extreme multi-label datasets with much smaller model size and lower computational complexity. In particular, on the Amazon dataset with 670K labels, LightXML can reduce the model size up to 72% compared to AttentionXML. Our code is available at http://github.com/kongds/LightXML.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 54

Similar Papers

Extreme Multi-Label Classification with Label Masking for Product Attribute Value Extraction
...
-
, et. al. ...
16 May 2022
16 May 2022

Multi-label classification by exploiting local positive and negative pairwise label correlation
Jun Huang ... Qingming Huang
Neurocomputing | VOL. 257
Jun Huang, et. al.Jun Huang ... Qingming Huang
06 Feb 2017
Neurocomputing | VOL. 257

Consumer Reactions to Positive and Negative Front-of-Package Food Labels
Anna H Grummon ... Eric B Rimm
American Journal of Preventive Medicine | VOL. 64
Anna H Grummon, et. al.Anna H Grummon ... Eric B Rimm
04 Oct 2022
American Journal of Preventive Medicine | VOL. 64

Machine Learning-based Mineral Prospectivity Mapping: Exploring the Role of Negative Training Labels to Enhance Predictive Models
Nyah Bay ... Kyubo Noh
-
Nyah Bay, et. al.Nyah Bay ... Kyubo Noh
08 Mar 2024
08 Mar 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence