High-level Visual Features Research Articles

Prospect-refuge theory suggests that people prefer environments that offer both prospect, the ability to scan for resources, and refuge, a safe place to hide. Urban planners, architects and researchers alike have had a tendency to use prospect-refuge theory research on natural scenes to inform on the design of urban environments. Despite the large body of prospect-refuge theory research, the degree to which prospect and refuge impact preference in urban environments remain unclear. Here, we aim to first evaluate the relationship between prospect, refuge and preference for urban scene images. Secondly, we aim to evaluate the contributions of visual features and streetscape quality ratings to subjective ratings of prospect and refuge in order to create proxy values of prospect and refuge. Finally, we aim to understand how the proxy values impact preference for urban scenes, and if the proxy values created replicate the relationship between subjective measures of prospect, refuge and preference. First, we used participant ratings of prospect and refuge to predict participants' preference for 552 images of urban street scenes. Higher ratings of both prospect and refuge predicted greater image preference. We next used principal components analysis to summarize these images' low- and high-level visual features as well as participant ratings of streetscape qualities, such as walkability and disorder. Visual feature and streetscape quality principal components predicted prospect and refuge ratings in this first image set, providing “proxy measures' for prospect and refuge. In an independent set of 1119 images from Talen et al. (2022) for which prospect and refuge ratings were not available, we asked whether these proxies for prospect and refuge predicted preference. Findings replicated the effect that more refuge in an image predicts more preference. However, the proxy measure of prospect did not predict preference. In summary, our results show that refuge ratings do relate to preferences in urban environments, which extends prospect-refuge theory to more urban environments. Future work is needed to understand if prospect has different implications in more urban environments.

Read full abstract

Single image super-resolution (SISR) aims to recover clear high-resolution images from low-resolution images, which has made great progress with the development of deep learning these years. Scene text image super-resolution (STISR) is a subfield of SISR with the goal of increasing the resolution of a low-resolution text image and enhancing the readability of characters in the image. Despite significant improvements in recent approaches, STISR remains a challenging task due to the diversity of background, text appearances and layouts, etc. This paper presents a Perceiving Multiple Representations (PerMR) method for better super-resolution performances in scene text images. PerMR is a unified network that combines super-resolution with text recognition and exploits the recognizer’s feedback to facilitate super-resolution. Specifically, contextual information from the text decoder is extracted to provide sequence-specific guidance and enable the super-resolution model to pay more attention to the text region. Meanwhile, low-level and high-level visual features from the vision backbone of the recognition network are integrated to further improve visual quality. Additionally, we incorporate a frequency branch into the vanilla convolution unit, which efficiently enhances global and local feature representations. Experiments on the STISR benchmark dataset TextZoom validate that PerMR can not only generate more distinguishable images, but also outperforms the current state-of-the-art methods. PerMR boosts the average recognition accuracy by 5.9% using ASTER, 5.8% using MORAN and 10.6% using CRNN compared to the baseline model TSRN. PerMR outperforms the advanced method TPGSR-3 by 1.4% on ASTER, 0.1% on MORAN, 0.2% on CRNN and boosts TATT by 0.6% on ASTER and 1.1% on MORAN respectively. Furthermore, PerMR demonstrates good robustness and generalization when tackling low-quality text images in multiple scene text recognition datasets. The experiment results verify the capabilities of PerMR to boost text recognition performance.

Read full abstract

High-level Visual Features Research Articles

Related Topics

Articles published on High-level Visual Features

Multimodal Sentiment Analysis in Natural Disaster Data on Social Media

Retinotopy drives the variation in scene responses across visual field map divisions of the occipital place area.

GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation

Anchor objects drive realism while diagnostic objects drive categorization in GAN generated scenes

Quantifying urban environments: Aesthetic preference through the lens of prospect-refuge theory

Testing the flexibility of ensemble coding: Limitations in cross-modal ensemble perception.

Predicting movies’ eudaimonic and hedonic scores: A machine learning approach using metadata, audio and visual features

Semantic manipulation through the lens of Geometric Algebra

Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks.

Visual perceptual learning is effective in the illusory far but not in the near space

Estimating urban noise along road network from street view imagery

High or low? Exploring the restorative effects of visual levels on campus spaces using machine learning and street view imagery

Transductive semantic knowledge graph propagation for zero-shot learning

Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer

(Retracted) Mimicking human vision systems: deep-learning-based feature fusion for semantic image retrieval

Perceptual awareness of natural scenes is limited by higher-level visual features: Evidence from deep neural networks.

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.

Looking more masculine among females: Spatial context modulates gender perception of face and biological motion.

Is it the best for barista robots to serve like humans? A multidimensional anthropomorphism perspective

CheXGAT: A disease correlation-aware network for thorax disease diagnosis from chest X-ray images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High-level Visual Features Research Articles

Related Topics

Articles published on High-level Visual Features

Multimodal Sentiment Analysis in Natural Disaster Data on Social Media

Retinotopy drives the variation in scene responses across visual field map divisions of the occipital place area.

GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation

Anchor objects drive realism while diagnostic objects drive categorization in GAN generated scenes

Quantifying urban environments: Aesthetic preference through the lens of prospect-refuge theory

Testing the flexibility of ensemble coding: Limitations in cross-modal ensemble perception.

Predicting movies’ eudaimonic and hedonic scores: A machine learning approach using metadata, audio and visual features

Semantic manipulation through the lens of Geometric Algebra

Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks.

Visual perceptual learning is effective in the illusory far but not in the near space

Estimating urban noise along road network from street view imagery

High or low? Exploring the restorative effects of visual levels on campus spaces using machine learning and street view imagery

Transductive semantic knowledge graph propagation for zero-shot learning

Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer

(Retracted) Mimicking human vision systems: deep-learning-based feature fusion for semantic image retrieval

Perceptual awareness of natural scenes is limited by higher-level visual features: Evidence from deep neural networks.

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.

Looking more masculine among females: Spatial context modulates gender perception of face and biological motion.

Is it the best for barista robots to serve like humans? A multidimensional anthropomorphism perspective

CheXGAT: A disease correlation-aware network for thorax disease diagnosis from chest X-ray images