BiVaSE: A bilingual variational sentence encoder with randomly initialized Transformer layers

Bence Nyéki

doi:10.1556/2062.2022.00584

Abstract

AbstractTransformer-based NLP models have achieved state-of-the-art results in many NLP tasks including text classification and text generation. However, the layers of these models do not output any explicit representations for texts units larger than tokens (e.g. sentences), although such representations are required to perform text classification. Sentence encodings are usually obtained by applying a pooling technique during fine-tuning on a specific task. In this paper, a new sentence encoder is introduced. Relying on an autoencoder architecture, it was trained to learn sentence representations from the very beginning of its training. The model was trained on bilingual data with variational Bayesian inference. Sentence representations were evaluated in downstream and linguistic probing tasks. Although the newly introduced encoder generally performs worse than well-known Transformer-based encoders, the experiments show that it was able to learn to incorporate linguistic information in the sentence representations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BiVaSE: A bilingual variational sentence encoder with randomly initialized Transformer layers

Abstract

Talk to us

Similar Papers

More From: Acta Linguistica Academica

Lead the way for us

Journal: Acta Linguistica Academica	Publication Date: Dec 12, 2022
License type: CC BY 4.0

Similar Papers

TreeLSTM with tag-aware hypernetwork for sentence representation
Chunlin Xu ... Zhiwei Lin
Neurocomputing | VOL. 434
Chunlin Xu, et. al.Chunlin Xu ... Zhiwei Lin
28 Dec 2020
Neurocomputing | VOL. 434

Using Pre-trained Embeddings to Detect the Intent of an Email
Zaid Alibadi ... Jose M Vidal
-
Zaid Alibadi, et. al.Zaid Alibadi ... Jose M Vidal
29 May 2019
29 May 2019

Training Effective Neural Sentence Encoders from Automatically Mined Paraphrases
Slawomir Dadas
-
Slawomir DadasSlawomir Dadas
09 Oct 2022
09 Oct 2022

A comprehensive review on recent advances in Variational Bayesian inference
Rohit Sain ... Vikas Mittal
-
Rohit Sain, et. al.Rohit Sain ... Vikas Mittal
01 Mar 2015
01 Mar 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BiVaSE: A bilingual variational sentence encoder with randomly initialized Transformer layers

Abstract

Talk to us

Similar Papers

More From: Acta Linguistica Academica