Advanced sentence-embedding method considering token importance based on explainable artificial intelligence and text summarization model

Yuho Cha,Younghoon Lee

doi:10.1016/j.neucom.2023.126987

Abstract

Although pretrained language models achieve high performance on various natural language processing tasks, they still require further improvements in the sentence embedding task. Many studies have improved performance in this task using pre-trained language models and contrastive learning, but these approaches are limited because they are based on naive average pooling and CLS tokens. Therefore, we propose an advanced sentence-embedding method based on weighted pooling that considers token importance. Specifically, the token importance is calculated by combining an explainable artificial-intelligence module with a text summarization model, and the final sentence embedding is derived through weighted pooling token embedding and token importance. Thus, we derive a sentence embedding that considers both the local information of the token embedding and the global information of the entire sentence. Experimental results reveal that our proposed sentence embedding outperforms other models on both text similarity tasks and text classification. Moreover, the proposed method’s robustness is verified through the results of an ablation study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Advanced sentence-embedding method considering token importance based on explainable artificial intelligence and text summarization model

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Nov 2, 2023
Citations: 2

Similar Papers

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models
...
-
, et. al. ...
25 May 2021
25 May 2021

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models
James Y Huang ... Kuan-Hao Huang
-
James Y Huang, et. al.James Y Huang ... Kuan-Hao Huang
01 Jan 2020
01 Jan 2020

On the Sentence Embeddings from Pre-trained Language Models
Bohan Li ... Junxian He
-
Bohan Li, et. al.Bohan Li ... Junxian He
01 Jan 2020
01 Jan 2020

Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph for Zero-shot Entity Retrieval
Taiqiang Wu ... Weijie Liu
-
Taiqiang Wu, et. al.Taiqiang Wu ... Weijie Liu
27 Feb 2023
27 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Advanced sentence-embedding method considering token importance based on explainable artificial intelligence and text summarization model

Abstract

Talk to us

Similar Papers

More From: Neurocomputing