TA-SBERT: Token Attention Sentence-BERT for Improving Sentence Representation

Jaejin Seo,Ling Liu,Wonik Choi,Sangwon Lee

doi:10.1109/access.2022.3164769

Jaejin Seo, Ling Liu + Show 2 more

Open Access

https://doi.org/10.1109/access.2022.3164769

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 10	License type: CC BY 4.0

Affiliation: Inha University, Georgia Institute of Technology

Abstract

A sentence embedding vector can be obtained by connecting a global average pooling (GAP) to a pre-trained language model. The problem of such a sentence embedding vector using a GAP is that it is generated with the same weight for all words appearing in the sentence. We propose a novel sentence embedding-method-based model Token Attention-SentenceBERT (TA-SBERT) to address this problem. The rationale of TA-SBERT is to enhance the performance of sentence embedding by introducing three strategies. First, we convert the base form while preprocessing the input sentence to reduce misunderstanding. Second, we propose a novel Token Attention (TA) technique that distinguishes important words to produce more informative sentence vectors. Third, we increase stability of fine-tuning to avoid catastrophic forgetting by adding a reconstruction loss to the word embedding vector. Extensive ablation studies demonstrate that our TA-SBERT outperforms the original SentenceBERT (SBERT) in the sentence vector evaluation using semantic textual similarity (STS) tasks and the SentEval toolkit.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TA-SBERT: Token Attention Sentence-BERT for Improving Sentence Representation

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang ... Pilsung Kang
Computer Speech & Language | VOL. 71
Myeongjun Jang, et. al.Myeongjun Jang ... Pilsung Kang
16 Jul 2021
Computer Speech & Language | VOL. 71

SupMPN: Supervised Multiple Positives and Negatives Contrastive Learning Model for Semantic Textual Similarity
Somaiyeh Dehghan ... Mehmet Fatih Amasyali
Applied Sciences | VOL. 12
Somaiyeh Dehghan, et. al.Somaiyeh Dehghan ... Mehmet Fatih Amasyali
26 Sep 2022
Applied Sciences | VOL. 12

SEBGM: Sentence Embedding Based on Generation Model with multi-task learning
Qian Wang ... Xu Wang
Computer Speech & Language | VOL. 87
Qian Wang, et. al.Qian Wang ... Xu Wang
06 Apr 2024
Computer Speech & Language | VOL. 87

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

-

01 Aug 2021
01 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TA-SBERT: Token Attention Sentence-BERT for Improving Sentence Representation

Abstract

Talk to us

Similar Papers

More From: IEEE Access