Sentence-CROBI: A Simple Cross-Bi-Encoder-Based Neural Network Architecture for Paraphrase Identification

Jesus-German Ortiz-Barajas,Helena Gómez-Adorno,Gemma Bel-Enguix

doi:10.3390/math10193578

Jesus-German Ortiz-Barajas, Helena Gómez-Adorno + Show 1 more

Open Access

https://doi.org/10.3390/math10193578

Copy DOI

Journal: Mathematics	Publication Date: Sep 30, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Universidad Nacional Autónoma de México

Abstract

Since the rise of Transformer networks and large language models, cross-encoders have become the dominant architecture for various Natural Language Processing tasks. When dealing with sentence pairs, they can exploit the relationships between those pairs. On the other hand, bi-encoders can obtain a vector given a single sentence and are used in tasks such as textual similarity or information retrieval due to their low computational cost; however, their performance is inferior to that of cross-encoders. In this paper, we present Sentence-CROBI, an architecture that combines cross-encoders and bi-encoders to obtain a global representation of sentence pairs. We evaluated the proposed architecture in the paraphrase identification task using the Microsoft Research Paraphrase Corpus, the Quora Question Pairs dataset, and the PAWS-Wiki dataset. Our model obtains competitive results compared with the state-of-the-art by using model ensembles and a simple model configuration. These results demonstrate that a simple architecture that combines sentence pair and single-sentence representations without using complex pre-training or fine-tuning algorithms is a viable alternative for sentence pair tasks.

Full Text