Scene Perception Research Articles

With the rise of GeoAI research, streetscape imagery has received extensive attention due to its comprehensive coverage, abundant information, and accessibility. However, obtaining a holistic spatial–temporal scene representation is difficult because places are often composed of multiple images from different angles, times and locations. This problem also exists in other types of geo-tagged imagery. To solve it, we propose a purely visual, robust, and reliable method for urban function identification at the street scale. We introduce a method based on a two-layer spatially dependent graph neural network structure, which handles sequential street view imagery as input (typically available in services such as Google Street View, Baidu Maps, and Mapillary), with full consideration of the spatial dependencies among road networks. In this paper, we construct an urban topological map network using OpenStreetMap data in Wuhan, China, and compute a semantic representation of the scene as a whole at the street scale using a large-scale pre-trained model. We construct the graph network with streets as nodes based on 28,693 mapping relationships constructed from 75,628 street view images and 5,458 streets. Only 5.3% of the node labels were required to obtain 10 categories of functions for all nodes in the study area. The results demonstrate that by using appropriate spatial weights, street encoder, and graph structure, our novel method achieves high accuracy of P@1 46.2%, P@3 73.0%, P@5 82.4%, and P@10 89.9%, fully demonstrating the effectiveness of the introduced approach. We also use the model to sense urban spatial–temporal renewal by computing time series street images. The model is also applicable to the prediction of other attributes, where only a small number of labels are required to obtain valid and reliable scene perception results. The example data and code is shared at: https://github.com/yemanzhongting/Knowledge-and-Topology.

As an important indicator of urban development capacity, vitality can be affected by the human perception of street views, which is a dynamic sensory process that can differ greatly according to different transportation modes, due to their different travel speeds, distances, and routes. However, few studies have evaluated how the dynamic spatial perceptions differ between different travel modes and how these differences can affect vitality differently, due to the limitation of city-scale quantitative data on the dynamic perception of urban scenes. To fill the gap, we propose a “dynamic through-movement perception” (DTMP) measure which integrates a streetscape quality evaluation model with a network-based movement potential model. We measure the streetscape qualities from Baidu street-view images (SVI) and compare the spatial perceptions of drivers and pedestrians in central Guangzhou, China. First, more than twenty visual elements were classified from SVIs to predict human perceptions collected from visual surveys. Second, the through-movement probability of driving and walking were calculated based on classic natural movement theory in space syntax and measured as the angular betweenness for the two travel modes. Third, we accumulate the multipliers of visual perception and through-movement probability of driving and walking as the DTMP for both modes. Lastly, the DTMPs of both modes were fitted into linear regression models to explain street vitality, which is measured using Baidu mobile phone check-in data, when other control variables such as functional density, functional diversity and amenity clustering reachability are accounted for. The results show that the dynamic perception of driving overall shows a stronger correlation with street vitality, while perceived richness is significantly positive in both travel modes. This study provides the first quantitative evidence to reveal how the movement probability of different travel modes can significantly influence people’s sense of place, while in turn increasing street vitality. Our results can explain how different types of street commerce (i.e., pedestrian-oriented, and auto-oriented) aggregate spontaneously due to the dynamic movement potential, which provides an important reference for urban planners and decision makers for improving street vitality when making urban revitalization policies.

Scene Perception Research Articles

Related Topics

Articles published on Scene Perception

Perceived similarity as a window into representations of integrated sentence meaning.

Robot trajectory planning for autonomous 3D reconstruction of cockpit in aircraft final assembly testing

Preferential signal pathways during the perception and imagery of familiar scenes: An effective connectivity study.

Outdoor mobility aid for people with visual impairment: Obstacle detection and responsive framework for the scene perception during the outdoor mobility of people with visual impairment

GeoSynth: A Photorealistic Synthetic Indoor Dataset for Scene Understanding.

Hybrid deep learning with improved Salp swarm optimization based multi-class grape disease classification model

Examining whether adults with autism spectrum disorder encounter multiple problems in theory of mind: a study based on meta-analysis.

Time Courses of Attended and Ignored Object Representations.

A dynamical scan-path model for task-dependence during scene viewing.

Pedestrian reported activity and information preference while waiting at a red light

Knowledge and topology: A two layer spatially dependent graph neural networks to identify urban functions with time-series street view image

Bi-RRNet: Bi-level recurrent refinement network for camouflaged object detection

Perception of global properties, objects, and settings in natural auditory scenes

Research on Scene Perception of Mobile Robot based on SLAM

Object Sub-Categorization and Common Framework<br /> Method using Iterative AdaBoost for Rapid Detection of Multiple Objects

Drivers or Pedestrians, Whose Dynamic Perceptions Are More Effective to Explain Street Vitality? A Case Study in Guangzhou

Mechanisms of speed encoding in the human middle temporal cortex measured by 7T fMRI.

Interactionally Embedded Gestalt Principles of Multimodal Human Communication

Predictive Processing of Scene Layout Depends on Naturalistic Depth of Field

Awareness survey on drug crime scene investigation and drug detection kits among drug-related police officers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Scene Perception Research Articles

Related Topics

Articles published on Scene Perception

Perceived similarity as a window into representations of integrated sentence meaning.

Robot trajectory planning for autonomous 3D reconstruction of cockpit in aircraft final assembly testing

Preferential signal pathways during the perception and imagery of familiar scenes: An effective connectivity study.

Outdoor mobility aid for people with visual impairment: Obstacle detection and responsive framework for the scene perception during the outdoor mobility of people with visual impairment

GeoSynth: A Photorealistic Synthetic Indoor Dataset for Scene Understanding.

Hybrid deep learning with improved Salp swarm optimization based multi-class grape disease classification model

Examining whether adults with autism spectrum disorder encounter multiple problems in theory of mind: a study based on meta-analysis.

Time Courses of Attended and Ignored Object Representations.

A dynamical scan-path model for task-dependence during scene viewing.

Pedestrian reported activity and information preference while waiting at a red light

Knowledge and topology: A two layer spatially dependent graph neural networks to identify urban functions with time-series street view image

Bi-RRNet: Bi-level recurrent refinement network for camouflaged object detection

Perception of global properties, objects, and settings in natural auditory scenes

Research on Scene Perception of Mobile Robot based on SLAM

Object Sub-Categorization and Common Framework&lt;br /&gt; Method using Iterative AdaBoost for Rapid Detection of Multiple Objects

Drivers or Pedestrians, Whose Dynamic Perceptions Are More Effective to Explain Street Vitality? A Case Study in Guangzhou

Mechanisms of speed encoding in the human middle temporal cortex measured by 7T fMRI.

Interactionally Embedded Gestalt Principles of Multimodal Human Communication

Predictive Processing of Scene Layout Depends on Naturalistic Depth of Field

Awareness survey on drug crime scene investigation and drug detection kits among drug-related police officers

Object Sub-Categorization and Common Framework<br /> Method using Iterative AdaBoost for Rapid Detection of Multiple Objects