Description-Enhanced Label Embedding Contrastive Learning for Text Classification.

Kun Zhang,Le Wu,Guangyi Lv,Enhong Chen,Shulan Ruan,Jing Liu,Zhiqiang Zhang,Jun Zhou,Meng Wang

doi:10.1109/tnnls.2023.3282020

Abstract

Text classification is one of the fundamental tasks in natural language processing, which requires an agent to determine the most appropriate category for input sentences. Recently, deep neural networks have achieved impressive performance in this area, especially pretrained language models (PLMs). Usually, these methods concentrate on input sentences and corresponding semantic embedding generation. However, for another essential component: labels, most existing works either treat them as meaningless one-hot vectors or use vanilla embedding methods to learn label representations along with model training, underestimating the semantic information and guidance that these labels reveal. To alleviate this problem and better exploit label information, in this article, we employ self-supervised learning (SSL) in model learning process and design a novel self-supervised relation of relation ( [Formula: see text]) classification task for label utilization from a one-hot manner perspective. Then, we propose a novel relation of relation learning network( [Formula: see text]-Net) for text classification, in which text classification and [Formula: see text] classification are treated as optimization targets. Meanwhile, triplet loss is employed to enhance the analysis of differences and connections among labels. Moreover, considering that one-hot usage is still short of exploiting label information, we incorporate external knowledge from WordNet to obtain multiaspect descriptions for label semantic learning and extend [Formula: see text]-Net to a novel description-enhanced label embedding network(DELE) from a label embedding perspective. One step further, since these fine-grained descriptions may introduce unexpected noise, we develop a mutual interaction module to select appropriate parts from input sentences and labels simultaneously based on contrastive learning (CL) for noise mitigation. Extensive experiments on different text classification tasks reveal that [Formula: see text]-Net can effectively improve the classification performance and DELE can make better use of label information and further improve the performance. As a byproduct, we have released the codes to facilitate other research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Description-Enhanced Label Embedding Contrastive Learning for Text Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Oct 1, 2024
Citations: 2

Similar Papers

Multitask Learning as Question Answering with BERT
Shishir Roy ... Nayeem Ehtesham
-
Shishir Roy, et. al.Shishir Roy ... Nayeem Ehtesham
18 Dec 2021
18 Dec 2021

A Survey of Adversarial Defenses and Robustness in NLP
Shreya Goyal ... Mitesh M Khapra
ACM Computing Surveys | VOL. 55
Shreya Goyal, et. al.Shreya Goyal ... Mitesh M Khapra
17 Jul 2023
ACM Computing Surveys | VOL. 55

Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching
Kun Zhang ... Guangyi Lv
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Kun Zhang, et. al.Kun Zhang ... Guangyi Lv
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Deep neural networks with pre-train model BERT for aspect-level sentiments classification
Yunxiang Zhang ... Zhuyi Rao
-
Yunxiang Zhang, et. al.Yunxiang Zhang ... Zhuyi Rao
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Description-Enhanced Label Embedding Contrastive Learning for Text Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems