Abstract

Eye movements are vital for human vision, and it is therefore important to understand how observers decide where to look. Meaning maps (MMs), a technique to capture the distribution of semantic information across an image, have recently been proposed to support the hypothesis that meaning rather than image features guides human gaze. MMs have the potential to be an important tool far beyond eye-movements research. Here, we examine central assumptions underlying MMs. First, we compared the performance of MMs in predicting fixations to saliency models, showing that DeepGaze II – a deep neural network trained to predict fixations based on high-level features rather than meaning – outperforms MMs. Second, we show that whereas human observers respond to changes in meaning induced by manipulating object-context relationships, MMs and DeepGaze II do not. Together, these findings challenge central assumptions underlying the use of MMs to measure the distribution of meaning in images.

Highlights

  • Human eyes resolve fine detail only in a small, central part of the visual field, with resolution dropping off rapidly in the periphery

  • We suggest that similar to saliency models, meaning maps (MMs) index the distribution of visual features rather than meaning

  • Semantic changes induced by altering object-context relationships elicited changes in distributions of human fixations, but neither MMs nor DeepGaze II (DGII) could predict them. These results suggest that both models might be sensitive to image features, which are frequently correlated with image meaning, rather than to meaning itself

Read more

Summary

Introduction

Human eyes resolve fine detail only in a small, central part of the visual field, with resolution dropping off rapidly in the periphery. If MMs measure meaning and if meaning guides human eye-movements, MMs should be better in predicting locations of fixations than saliency models because these models rely solely on image features.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call