Explicitly Representing Syntax Improves Sentence-to-Layout Prediction of Unexpected Situations

Wolf Nuyts,Marie-Francine Moens,Ruben Cartuyvels

doi:10.1162/tacl_a_00643

Abstract

Abstract Recognizing visual entities in a natural language sentence and arranging them in a 2D spatial layout require a compositional understanding of language and space. This task of layout prediction is valuable in text-to-image synthesis as it allows localized and controlled in-painting of the image. In this comparative study it is shown that we can predict layouts from language representations that implicitly or explicitly encode sentence syntax, if the sentences mention similar entity-relationships to the ones seen during training. To test compositional understanding, we collect a test set of grammatically correct sentences and layouts describing compositions of entities and relations that unlikely have been seen during training. Performance on this test set substantially drops, showing that current models rely on correlations in the training data and have difficulties in understanding the structure of the input sentences. We propose a novel structural loss function that better enforces the syntactic structure of the input sentence and show large performance gains in the task of 2D spatial layout prediction conditioned on text. The loss has the potential to be used in other generation tasks where a tree-like structure underlies the conditioning modality. Code, trained models, and the USCOCO evaluation set are available via Github.1

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Explicitly Representing Syntax Improves Sentence-to-Layout Prediction of Unexpected Situations

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Apr 9, 2024
License type: CC BY 4.0

Similar Papers

A mapping mechanism of NL sentences onto an SPN state machine for understanding purposes
A Psarologou ... M Virvou
-
A Psarologou, et. al.A Psarologou ... M Virvou
01 Jul 2014
01 Jul 2014

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

The effect of noise on the predictive limit of QSAR models
Scott S Kolmar ... Christopher M Grulke
Journal of Cheminformatics | VOL. 13
Scott S Kolmar, et. al.Scott S Kolmar ... Christopher M Grulke
25 Nov 2021
Journal of Cheminformatics | VOL. 13

Referential Translation Machines for Predicting Translation Performance
Ergun Bicici
-
Ergun BiciciErgun Bicici
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Explicitly Representing Syntax Improves Sentence-to-Layout Prediction of Unexpected Situations

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics