Reformulating Unsupervised Style Transfer as Paraphrase Generation

Kalpesh Krishna,Mohit Iyyer,John Wieting

doi:10.18653/v1/2020.emnlp-main.55

Abstract

Modern NLP defines the task of style transfer as modifying the style of a given sentence without appreciably changing its semantics, which implies that the outputs of style transfer systems should be paraphrases of their inputs. However, many existing systems purportedly designed for style transfer inherently warp the input's meaning through attribute transfer, which changes semantic properties such as sentiment. In this paper, we reformulate unsupervised style transfer as a paraphrase generation problem, and present a simple methodology based on fine-tuning pretrained language models on automatically generated paraphrase data. Despite its simplicity, our method significantly outperforms state-of-the-art style transfer systems on both human and automatic evaluations. We also survey 23 style transfer papers and discover that existing automatic metrics can be easily gamed and propose fixed variants. Finally, we pivot to a more real-world style transfer setting by collecting a large dataset of 15M sentences in 11 diverse styles, which we use for an in-depth analysis of our system.

Highlights

The task of style transfer on text data involves changing the style of a given sentence while preserving its semantics.1 Recent work in this area conflates style transfer with the related task of attribute transfer (Subramanian et al, 2019; He et al, 2020), in which modifications to attributespecific content words warp both stylistic and semantic properties of a sentence (Preotiuc-Pietro et al, 2016)
Our contributions are: (1) a simple approach to perform lexically and syntactically diverse paraphrasing with pretrained language models; (2) a simple unsupervised style transfer method that models semantic preservation with our paraphraser and significantly outperforms prior work; (3) a critique of existing style transfer evaluation based on a naıve baseline that performs on par with prior work on poorly designed metrics; (4) a new benchmark dataset that contains 15M sentences from 11 diverse styles
We focus exclusively on semantics-preserving style transfer tasks, which means that we do not evaluate on attribute transfer datasets such as sentiment, gender, and political transfer

Summary

Introduction

The task of style transfer on text data involves changing the style of a given sentence while preserving its semantics. Recent work in this area conflates style transfer with the related task of attribute transfer (Subramanian et al, 2019; He et al, 2020), in which modifications to attributespecific content words (e.g., those that carry sentiment) warp both stylistic and semantic properties of a sentence (Preotiuc-Pietro et al, 2016). The task of style transfer on text data involves changing the style of a given sentence while preserving its semantics.. Our unsupervised method (Style Transfer via Paraphrasing, or STRAP) requires no parallel data between different styles and proceeds in three simple stages: 1. Create pseudo-parallel data by feeding sentences from different styles through a diverse paraphrase model (Figure 1, left). Our contributions are: (1) a simple approach to perform lexically and syntactically diverse paraphrasing with pretrained language models; (2) a simple unsupervised style transfer method that models semantic preservation with our paraphraser and significantly outperforms prior work; (3) a critique of existing style transfer evaluation based on a naıve baseline that performs on par with prior work on poorly designed metrics; (4) a new benchmark dataset that contains 15M sentences from 11 diverse styles

Style Transfer via Paraphrasing

Creating pseudo-parallel training data

Style transfer via inverse paraphrasing

Paraphraser implementation with GPT-2

Promoting diversity by filtering data

Evaluating style transfer

Current state of style transfer evaluation

Aggregation of Metrics

A Naıve Style Transfer System

Datasets

Comparisons against prior work

Ablation studies

Towards Real-World Style Transfer

Related Work

Conclusion

PARANMT-50M Filtering Details

Generative Model Details

Classifier Model Details

OpenNMT Model Details

More Comparisons with Prior Work

Diverse Paraphrasing on CDS

Style Transfer Performance on CDS

A Survey of Evaluation Methods

A.10 Details on Human Evaluation

What then do you come hither for at such an hour?

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reformulating Unsupervised Style Transfer as Paraphrase Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 130	License type: cc-by

Similar Papers

A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning
Yuxin Zhang ... Weiming Dong
ACM Transactions on Graphics | VOL. 42
Yuxin Zhang, et. al.Yuxin Zhang ... Weiming Dong
28 Jul 2023
ACM Transactions on Graphics | VOL. 42

Towards A Friendly Online Community: An Unsupervised Style Transfer Framework for Profanity Redaction
Minh Tran ... Mohammad Soleymani
-
Minh Tran, et. al.Minh Tran ... Mohammad Soleymani
01 Jan 2020
01 Jan 2020

Cycle-Consistent Adversarial Autoencoders for Unsupervised Text Style Transfer
Yufang Huang ... Changjian Hu
-
Yufang Huang, et. al.Yufang Huang ... Changjian Hu
01 Jan 2020
01 Jan 2020

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reformulating Unsupervised Style Transfer as Paraphrase Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers