A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation

Wenqing Chen,Jidong Tian,Yaohui Jin,Hao He,Liqiang Xiao

doi:10.18653/v1/2020.coling-main.102

Abstract

Paraphrase generation aims to generate semantically consistent sentences with different syntactic realizations. Most of the recent studies rely on the typical encoder-decoder framework where the generation process is deterministic. However, in practice, the ability to generate multiple syntactically different paraphrases is important. Recent work proposed to cooperate variational inference on a target-related latent variable to introduce the diversity. But the latent variable may be contaminated by the semantic information of other unrelated sentences, and in turn, change the conveyed meaning of generated paraphrases. In this paper, we propose a semantically consistent and syntactically variational encoder-decoder framework, which uses adversarial learning to ensure the syntactic latent variable be semantic-free. Moreover, we adopt another discriminator to improve the word-level and sentence-level semantic consistency. So the proposed framework can generate multiple semantically consistent and syntactically different paraphrases. The experiments show that our model outperforms the baseline models on the metrics based on both n-gram matching and semantic similarity, and our model can generate multiple different paraphrases by assembling different syntactic variables.

Highlights

Paraphrase generation is a longstanding problem in Natural Language Processing (NLP) (McKeown, 1983), which aims to generate semantically consistent sentences for a given sentence with different syntactic realizations
Since this integral is unavailable in closed form or requires exponential time to compute (Blei et al, 2016), it is approximated by maximizing the evidence lower bound (ELBO): log pθ(x) ≥ ELBO = E [log pθ(x|z)] − KL(qφ(z|x)∥p(z))
To understand what is an applaudable score on each metric, we do a preliminary experiment by designing a copying and a randomly sampling model, which can be considered as the upper and lower bound for metrics

Summary

Introduction

Paraphrase generation is a longstanding problem in Natural Language Processing (NLP) (McKeown, 1983), which aims to generate semantically consistent sentences for a given sentence with different syntactic realizations. The models will select the best result through the beam search but are not able to produce multiple paraphrases in a principled way (Gupta et al, 2018). A previous work proposed β-VAE (Higgins et al, 2017) to use a weight β for the KL divergence. This approach was considered as a baseline for paraphrase generation (Fu et al, 2019). When a text generation model involves the process of sampling words and expecting a reward from a discriminator or an evaluator, it will suffer from the non-differentiable problem due to the discrete nature of texts. We use Gumbel-softmax because it makes models end-to-end differentiable, improving the stability and speed of training over RL (Chen et al, 2018)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 2	License type: cc-by

Similar Papers

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation
...
-
, et. al. ...
25 Nov 2020
25 Nov 2020

Evolutions of semantic consistency in research topic via contextualized word embedding
Shengzhi Huang ... Yong Huang
Information Processing and Management | VOL. 61
Shengzhi Huang, et. al.Shengzhi Huang ... Yong Huang
03 Aug 2024
Information Processing and Management | VOL. 61

Automatic Image Annotation Using Semantic Subspace Graph Spectral Clustering Algorithm
Yu Tang Guo ... Chang Gang Han
Advanced Materials Research | VOL. 271-273
Yu Tang Guo, et. al.Yu Tang Guo ... Chang Gang Han
01 Jul 2011
Advanced Materials Research | VOL. 271-273

Improving Multilingual Semantic Interoperation in Cross-Organizational Enterprise Systems Through Concept Disambiguation
Jingzhi Guo ... Li Da Xu
IEEE Transactions on Industrial Informatics | VOL. 8
Jingzhi Guo, et. al.Jingzhi Guo ... Li Da Xu
01 Aug 2012
IEEE Transactions on Industrial Informatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers