Rapid Scene Categorization Research Articles

Introduction: Humans can rapidly categorize scenes (R. VanRullen & S. Thorpe, 2001), even using peripheral vision (Larson & Loschky, 2009). Various computational models have been proposed for rapid scene categorization in terms of low-level properties such as spatial envelopes (Oliva & Torralba, 2001) and texture summary statistics (TTM, Rosenholtz et al., 2012). Yet, these models do not explicitly model the foveated properties of the visual system nor the interaction between eye movements and the scene category task. We propose a model with a foveated visual system and eye movements that can predict the dependence of human categorization performance across fixations. The model combines square pooling regions with the computer vision-transformer architecture (Dosovitskiy et al., 2020, Touvron et al., 2020) and makes multiple fixations to maximize classification using the technique of self-attention (Parikh et al., 2016, Bahdanau et al., 2015). Methods: Twenty-two participants classified 360 images (Places365 database, places2.csail.mit.edu) into 30 classes. Images subtended a viewing angle of 22.7 degrees. A gaze-contingent display was used to randomly interrupt the display after 1, 2, 3, or 4 fixations with initial forced-fixation at bottom-center or top-center. Results: We show that there is no significant improvement in performance after the 2nd fixation (Δ correct categorization=0.015; p=0.4729), unlike performance for object search (Koehler and Eckstein, 2017). The model correctly predicts modest classification improvements for free-viewing fixations (Δ=0.016). The model-human correlation in classification choices was not significantly lower than human-human correlations. Our findings suggest that human categorization of scenes within a single fixation can be explained by the spatially global distribution of the visual information in the scene and their availability even through the bottlenecks of the visual periphery. The newly proposed hybrid approach using biologically based modeling and Transformers can flexibly be applied to various naturalistic tasks and stimuli.

Read full abstract

Observers can categorize a novel scene within the first 100 ms of its onset. Researchers have suggested this is accomplished through a rapid feed-forward sweep of neural activation. Consequently, researchers have focused on examining the minimal perceptual information diagnostic of a scene's semantic category, rather than investigating the role that top-down processes play in rapid scene categorization. Thus, most scene gist studies present scenes from multiple categories in randomized sequences.Conversely, in this experiment, we tested whether scene gist recognition is facilitated by sequential expectations.To do this, we created more ecologically valid spatio-temporal "narrative" sequences of images along spatially connected routes from starting points to destinations (e.g., office, hallway, stairwell, sidewalk, parking lot). Scene images were presented one-at-a-time, and based on pilot-testing, briefly flashed (24 ms) and masked (48 ms), followed by selecting the scene category from an 8-AFC array. To reduce predictability of subsequent images, we included subsequences of randomly 1-4 scenes from each category (e.g., 1-4 office images followed by 1-4 hallways, etc.). Critically, scenes were shown in either coherent or randomized narrative sequences to test two competing alternative hypotheses: 1) "Narrative coherence": accuracy is higher for images in coherent narrative sequences because scene category expectations prime representations for to-be-presented scenes; 2) "Feed-forward": accuracy does not differ between coherent and randomized image sequences because it is a purely feed-forward process. Results: images presented in randomized sequences were categorized just as accurately as images presented in coherent sequences, consistent with the "Feed-forward" hypothesis. Furthermore, accuracy did not increase as a function of the number of sequential exposures to a single scene category (e.g., 1-4 office images in a row), suggesting that the mechanisms responsible for perceiving a scene's meaning are so rapid that prior exposures do not support extracting the gist of subsequent scenes. Meeting abstract presented at VSS 2017

Read full abstract

Rapid Scene Categorization Research Articles

Related Topics

Articles published on Rapid Scene Categorization

A Foveated Vision-Transformer Model for Scene Classification

Rapid scene categorization is not purely feed-forward: An EEG investigation of scene gist facilitation by sequential predictions

The role of high-order statistics in evoking perceptual priming effect on rapid scene categorization

Rapid scene categorization: From coarse peripheral vision to fine central vision

The contributions of central and peripheral vision to scene-gist recognition with a 180° visual field.

The effects of distributed and focused attention on rapid scene categorization

Narrative priming of scene gist: The role of sequential expectations in scene gist perception

Association of rapid scene categorization with a nicotinic acetylcholine receptor gene polymorphism(Summary of Awarded Presentation at the 31st Annual Meeting)

Two scenes or not two scenes: The effects of stimulus repetition and view-similarity on scene categorization from brief displays.

Visual information representation and rapid-scene categorization are simultaneous across cortex: An MEG study

Processing context: Asymmetric interference of visual form and texture in object and scene interactions

The influence of segmentation on rapid scene categorization

Distinguishing the roles of color and other surface properties in rapid natural scene categorization: Evidence from ERPs

Does segmentation influence rapid scene categorization?

Does segmentation influence rapid scene categorization

Rapid scene categorization: Role of spatial frequency order, accumulation mode and luminance contrast

An Integrated Saliency Model with Guidance of Eye Movement in Natural Scene Classification

The importance of visual features in rapid scene categorization: evidence from repetition blindness

The importance of visual features in rapid scene categorization: Evidence from repetition blindness.

A simple rapid categorization model accounts for variations in behavioral responses across rapid scene categorization tasks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Rapid Scene Categorization Research Articles

Related Topics

Articles published on Rapid Scene Categorization

A Foveated Vision-Transformer Model for Scene Classification

Rapid scene categorization is not purely feed-forward: An EEG investigation of scene gist facilitation by sequential predictions

The role of high-order statistics in evoking perceptual priming effect on rapid scene categorization

Rapid scene categorization: From coarse peripheral vision to fine central vision

The contributions of central and peripheral vision to scene-gist recognition with a 180° visual field.

The effects of distributed and focused attention on rapid scene categorization

Narrative priming of scene gist: The role of sequential expectations in scene gist perception

Association of rapid scene categorization with a nicotinic acetylcholine receptor gene polymorphism(Summary of Awarded Presentation at the 31st Annual Meeting)

Two scenes or not two scenes: The effects of stimulus repetition and view-similarity on scene categorization from brief displays.

Visual information representation and rapid-scene categorization are simultaneous across cortex: An MEG study

Processing context: Asymmetric interference of visual form and texture in object and scene interactions

The influence of segmentation on rapid scene categorization

Distinguishing the roles of color and other surface properties in rapid natural scene categorization: Evidence from ERPs

Does segmentation influence rapid scene categorization?

Does segmentation influence rapid scene categorization

Rapid scene categorization: Role of spatial frequency order, accumulation mode and luminance contrast

An Integrated Saliency Model with Guidance of Eye Movement in Natural Scene Classification

The importance of visual features in rapid scene categorization: evidence from repetition blindness

The importance of visual features in rapid scene categorization: Evidence from repetition blindness.

A simple rapid categorization model accounts for variations in behavioral responses across rapid scene categorization tasks