Where's Wally: the influence of visual salience on referring expression generation.

Alasdair D F Clarke,Micha Elsner,Hannah Rohde

doi:10.3389/fpsyg.2013.00329

Alasdair D F Clarke, Micha Elsner + Show 1 more

Open Access

https://doi.org/10.3389/fpsyg.2013.00329

Copy DOI

Abstract

Referring expression generation (REG) presents the converse problem to visual search: given a scene and a specified target, how does one generate a description which would allow somebody else to quickly and accurately locate the target?Previous work in psycholinguistics and natural language processing has failed to find an important and integrated role for vision in this task. That previous work, which relies largely on simple scenes, tends to treat vision as a pre-process for extracting feature categories that are relevant to disambiguation. However, the visual search literature suggests that some descriptions are better than others at enabling listeners to search efficiently within complex stimuli. This paper presents a study testing whether participants are sensitive to visual features that allow them to compose such “good” descriptions. Our results show that visual properties (salience, clutter, area, and distance) influence REG for targets embedded in images from the Where's Wally? books. Referring expressions for large targets are shorter than those for smaller targets, and expressions about targets in highly cluttered scenes use more words. We also find that participants are more likely to mention non-target landmarks that are large, salient, and in close proximity to the target. These findings identify a key role for visual salience in language production decisions and highlight the importance of scene complexity for REG.

Highlights

Cognitive science research in the domains of vision and language faces similar challenges for modeling the way people use and integrate information
Existing studies at the vision ∼ language interface have succeeded in incorporating complex visual stimuli or complex linguistic tasks, but rarely both, and the conclusions from that previous work have assigned a limited role to vision in language production
This paper considers the question of how the language people produce in a complex referential task is influenced by the properties of a complex visual scene

Summary

INTRODUCTION

Cognitive science research in the domains of vision and language faces similar challenges for modeling the way people use and integrate information. Their speakers were not using an efficient visual search strategy based on salience to check whether a candidate description (“a large blue airplane”) sufficiently identifies the target This REG result is puzzling in light of the extensive literature on perception (Eckstein, 2011; Wolfe, 2012), which shows that visual search is sensitive to visually salient features and because search and generation are in some sense converse problems – one dealing with perception, the other with production. The scenes are deliberately cluttered and contain large numbers of similar-looking people as well as more and less salient objects; in some sense they represent the other extreme to the simplistic scenes used in previous work Results on such images certainly leave open a range of intermediate visual complexity in which salience effects might be weaker and harder to detect. The Wally images may represent the upper range of complexity in which humans must compose descriptions, but they are probably no worse than the scenes we expect people to encounter on a day-to-day basis

MATERIALS AND METHODS

Findings

DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in psychology	Publication Date: Jun 18, 2013
Citations: 92	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Where's Wally: the influence of visual salience on referring expression generation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology

Lead the way for us

Similar Papers

Can You Tell Me where Wally Is?
...
i-Perception | VOL. 4
, et. al. ...
01 Oct 2013
i-Perception | VOL. 4

The use of spatial relations in referring expression generation
Jette Viethen ... Robert Dale
-
Jette Viethen, et. al.Jette Viethen ... Robert Dale
01 Jan 2008
01 Jan 2008

Visual working memory and saliency independently influence the priority for access to visual awareness.
Yun Ding ... Stefan Van Der Stigchel
Journal of Vision | VOL. 19
Yun Ding, et. al.Yun Ding ... Stefan Van Der Stigchel
13 Sep 2019
Journal of Vision | VOL. 19

Human visual exploration reduces uncertainty about the sensed world.
M Berk Mirza ... Christoph Mathys
PLOS ONE | VOL. 13
M Berk Mirza, et. al.M Berk Mirza ... Christoph Mathys
05 Jan 2018
PLOS ONE | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Where's Wally: the influence of visual salience on referring expression generation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology