Visual Paraphrase Generation with Key Information Retained

Jiayuan Xie,Yi Cai,Qingbao Huang,Qing Li,Jiali Chen

doi:10.1145/3585010

Abstract

Visual paraphrase generation task aims to rewrite a given image-related original sentence into a new paraphrase, where the paraphrase needs to have the same expressed meaning as the original sentence but have a difference in expression form. Existing studies mainly extract two semantic vectors to represent the entire image and the entire original sentence, respectively, for paraphrase generation. However, these semantic vectors for an image or a sentence may lead to the model failing to focus on some key objects in the original sentence, which may generate semantically inconsistent sentences by changing key object information. In this article, we propose an object-level paraphrase generation model, which generates paraphrases by adjusting the permutation of key objects and modifying their associated descriptions. To adjust the permutation of key objects, an object-sorting module aims to obtain new object sequences based on the key object information and original sentences. Then, a sequence generation module sequentially generates paraphrases based on the permutation of the newly object sequences. Each generation step focuses on different image features associated with different key objects to generate descriptions with differences. Furthermore, we use a semantic discriminator module to promote the generated paraphrase to be semantically close to the original sentence. Specifically, the loss function of the discriminator penalizes the excessive distance between the paraphrase and the original sentence. Extensive experiments on the MS COCO dataset show that the proposed model outperforms the baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual Paraphrase Generation with Key Information Retained

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: May 30, 2023
Citations: 1

Similar Papers

Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach
Zhe Lin ... Xiaojun Wan
-
Zhe Lin, et. al.Zhe Lin ... Xiaojun Wan
01 Jan 2020
01 Jan 2020

Neural-Driven Search-Based Paraphrase Generation
Betty Fabre ... Jonathan Chevelu
-
Betty Fabre, et. al.Betty Fabre ... Jonathan Chevelu
01 Jan 2020
01 Jan 2020

Using Deep Learning for Implementing Paraphrasing in a Social Robot
Laura Martín Galván ... Enrique Fernández-Rodicio
-
Laura Martín Galván, et. al.Laura Martín Galván ... Enrique Fernández-Rodicio
01 Jan 2023
01 Jan 2023

Quality Controlled Paraphrase Generation
...
-
, et. al. ...
07 May 2022
07 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual Paraphrase Generation with Key Information Retained

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications