Multi-Label Text Classification model integrating Label Attention and Historical Attention

Guoying Sun,Yanan Cheng,Fangzhou Dong,Luhua Wang,Dong Zhao,Zhaoxin Zhang,Xiaojun Tong

doi:10.1016/j.knosys.2024.111878

Abstract

Multi-Label Text Classification (MLTC) is one of the most import research of natural language processing. Although Deep Learning (DL) models have been widely applied to MLTC, there still exist some drawbacks. First, traditional DL models utilize all the words in the document to construct the embedding vector, while there are many words that affect the classification results. Next, the labels in MLTC have specific semantics, while traditional DL models ignore fine-grained matching signals between words and labels. Then, traditional DL models have difficulty in handling the data imbalance issue in MLTC datasets. In addition, during the training process, small errors in a certain epoch may be amplified with the increase of the number of iterations, resulting in classification errors. To address the above problems, a MLTC model integrating Label Attention and Historical Attention (i.e. LAHA) is proposed. First, a word filter is set up to select important words based on the cosine similarity between words and labels. Next, Document Self Attention (DSA) and Label Attention (LA) are obtained, and DSA-attended LA co-attention (LA-co) and LA-attended DSA co-attention (DAS-co) networks are constructed. Then, the fine-grained matching signals between words and labels are integrated through the adaptive plus of LA-co and DAS-co. At last, Historical Attention is integrated to LAHA, which not only avoids mis-classification caused by minor errors of a certain epoch, but also reduces overfitting to high-frequency labels. Multiple comparative experiments on four benchmark datasets demonstrate that LAHA outperforms the state-of-the-art baseline models and can effectively solve the data imbalance issue in MLTC datasets. Our code is available at https://github.com/sgysgywaityou/LAHA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Label Text Classification model integrating Label Attention and Historical Attention

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: May 1, 2024
Citations: 1

Similar Papers

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Sanjay Aneja
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Sanjay Aneja
01 Jul 2021
Cancer Research | VOL. 81

A Proposed Arabic Text Classification Model using Multi-Label System
Hussain A Rahmana ... Salwa S Baawi
Journal of Al-Qadisiyah for Computer Science and Mathematics | VOL. 15
Hussain A Rahmana, et. al.Hussain A Rahmana ... Salwa S Baawi
30 Sep 2023
Journal of Al-Qadisiyah for Computer Science and Mathematics | VOL. 15

Deep active learning for multi label text classification.
Qunbo Wang ... Haobin Shi
Scientific reports | VOL. 14
Qunbo Wang, et. al.Qunbo Wang ... Haobin Shi
15 Nov 2024
Scientific reports | VOL. 14

Research on Multi-Label Text Classification Based on Multi-Channel CNN and BiLSTM
Shoujin Wang ... Yuanjiao Yang
-
Shoujin Wang, et. al.Shoujin Wang ... Yuanjiao Yang
01 Oct 2022
01 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Label Text Classification model integrating Label Attention and Historical Attention

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems