Exploring the interpretability of the BERT model for semantic similarity

Diana Anahí Ledesma Roque,Olga Kolesnikova,Ricardo Menchaca Méndez

doi:10.3233/jifs-219359

Abstract

This study addresses the issue of semantic similarity in sentences using the BERT model through various aggregation techniques, such as max-pooling, mean-pooling, and an LSTM network applied to the output of the BERT model. Subsequently, the linguistic interpretability of the BERT-Base transformer model is analyzed through the unsupervised learning approach, specifically through dimensionality reduction using autoencoders and clustering algorithms, utilizing the representation of the classification token CLS. The results highlight that the CLS classification token achieves better abstractions than the proposed methods. In terms of interpretability, it is observed that sequence length is relevant in the early layers, with a gradual decrease across the layers. Additionally, attention to semantic similarity is concentrated in the intermediate and upper layers, especially in layers 6, 8, 9, and 10. All these findings were obtained by addressing the semantic similarity task using the STS-Benchmark dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the interpretability of the BERT model for semantic similarity

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Similar Papers

A BERT-GRU Model for Measuring the Similarity of Arabic Text
Rakia Saidi ... Didier Schwab
JUCS - Journal of Universal Computer Science | VOL. 30
Rakia Saidi, et. al.Rakia Saidi ... Didier Schwab
28 Jun 2024
JUCS - Journal of Universal Computer Science | VOL. 30

Quantifying semantic similarity of clinical evidence in the biomedical literature to facilitate related evidence synthesis.
Hamed Hassanzadeh ... Anthony Nguyen
Journal of Biomedical Informatics | VOL. 100
Hamed Hassanzadeh, et. al.Hamed Hassanzadeh ... Anthony Nguyen
30 Oct 2019
Journal of Biomedical Informatics | VOL. 100

Evaluation of Lexical-Based Approaches to the Semantic Similarity of Malay Sentences
Shahrul Azman Noah ... Amru Yusrin Amruddin
Journal of Quantitative Linguistics | VOL. 22
Shahrul Azman Noah, et. al.Shahrul Azman Noah ... Amru Yusrin Amruddin
19 Mar 2015
Journal of Quantitative Linguistics | VOL. 22

Sentence similarity measuring by vector space model
U. L. D. N. Gunasinghe ... A. S. Perera
-
U. L. D. N. Gunasinghe, et. al.U. L. D. N. Gunasinghe ... A. S. Perera
01 Dec 2014
01 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the interpretability of the BERT model for semantic similarity

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems