Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot

Yitao Cai,Xiaojun Wan,Yue Cao

doi:10.18653/v1/2021.emnlp-main.350

Abstract

Paraphrases refer to texts that convey the same meaning with different expression forms. Pivot-based methods, also known as the round-trip translation, have shown promising results in generating high-quality paraphrases. However, existing pivot-based methods all rely on language as the pivot, where large-scale, high-quality parallel bilingual texts are required. In this paper, we explore the feasibility of using semantic and syntactic representations as the pivot for paraphrase generation. Concretely, we transform a sentence into a variety of different semantic or syntactic representations (including AMR, UD, and latent semantic representation), and then decode the sentence back from the semantic representations. We further explore a pretraining-based approach to compress the pipeline process into an end-to-end framework. We conduct experiments comparing different approaches with different kinds of pivots. Experimental results show that taking AMR as pivot can obtain paraphrases with better quality than taking language as the pivot. The end-to-end framework can reduce semantic shift when language is used as the pivot. Besides, several unsupervised pivot-based methods can generate paraphrases with similar quality as the supervised sequence-to-sequence model, which indicates that parallel data of paraphrases may not be necessary for paraphrase generation.

Highlights

P (Y |X) = P (Z|X)P (Y |Z), where Z denotes the pivot of X
Existing pivot-based methods all Paraphrase generation is an important and challenging task in the field of Natural Language Processing (NLP), which can be applied in a variety of applications such as information retrieval (Yan et al, 2016), question answering (Fader et al, 2014; Yin et al, 2015), machine translation (Cho et al, 2014), and so on
We explore the feasibility of using different pivots for pivot-based paraphrasing models, including syntactic representation (Universal Dependencies (McDonald et al, 2013), UD), semantic representation (Abstract Meaning Representation (Banarescu et al, 2013), AMR), and latent semantic representation (LSR)

Summary

Introduction

Choose Z as representations in a different language, the quality of the generated paraphrases largely depends on the pre-existing machine translation system. Choosing language as pivot has some disadvantages, for example: (1) the pipeline translations may incur semantic shift (Guo et al, 2019), and (2) machine translation systems are sensitive to domain, and the quality of translating out-of-domain sentences can not be guaranteed. We explore the feasibility of using different pivots for pivot-based paraphrasing models, including syntactic representation (Universal Dependencies (McDonald et al, 2013), UD), semantic representation (Abstract Meaning Representation (Banarescu et al, 2013), AMR), and latent semantic representation (LSR). Meaning Representation (AMR) (Banarescu et al, 2013) is a rooted, labeled, acyclic graph which abstracts away from syntax and preserves semantics. Since AMR only keeps semantic information, paraphrases can share the same AMR graph

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 1	License type: cc-by

Similar Papers

Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot
...
-
, et. al. ...
03 Nov 2021
03 Nov 2021

Independent Semantic and Syntactic Representations in L2 Mandarin Learners: Evidence from Structural Priming
Douglas J Getty ... Lin Chen
Education Sciences | VOL. 14
Douglas J Getty, et. al.Douglas J Getty ... Lin Chen
17 Feb 2024
Education Sciences | VOL. 14

A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations
Mingda Chen ... Qingming Tang
-
Mingda Chen, et. al.Mingda Chen ... Qingming Tang
01 Jan 2019
01 Jan 2019

The Modular Architecture of Grammar by Jerrold M. Sadock
Francesco-Alessio Ursini
The Canadian Journal of Linguistics / La revue canadienne de linguistique | VOL. 61
Francesco-Alessio UrsiniFrancesco-Alessio Ursini
01 Jan 2015
The Canadian Journal of Linguistics / La revue canadienne de linguistique | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot

Abstract

Highlights

Summary

Talk to us

Similar Papers