Natural Scene Images Research Articles

Object detection in remote sensing images (RSIs) is one of the basic tasks in the field of remote sensing image automatic interpretation. In recent years, the deep object detection frameworks of natural scene images (NSIs) have been introduced into object detection on RSIs, and the detection performance has improved significantly because of the powerful feature representation. However, there are still many challenges concerning the particularities of remote sensing objects. One of the main challenges is the missed detection of small objects which have less than five percent of the pixels of the big objects. Generally, the existing algorithms choose to deal with this problem by multi-scale feature fusion based on a feature pyramid. However, the benefits of this strategy are limited, considering that the location of small objects in the feature map will disappear when the detection task is processed at the end of the network. In this study, we propose a subtask attention network (StAN), which handles the detection task directly on the shallow layer of the network. First, StAN contains one shared feature branch and two subtask attention branches of a semantic auxiliary subtask and a detection subtask based on the multi-task attention network (MTAN). Second, the detection branch uses only low-level features considering small objects. Third, the attention map guidance mechanism is put forward to optimize the network for keeping the identification ability. Fourth, the multi-dimensional sampling module (MdS), global multi-view channel weights (GMulW) and target-guided pixel attention (TPA) are designed for further improvement of the detection accuracy in complex scenes. The experimental results on the NWPU VHR-10 dataset and DOTA dataset demonstrated that the proposed algorithm achieved the SOTA performance, and the missed detection of small objects decreased. On the other hand, ablation experiments also proved the effects of MdS, GMulW and TPA.

Read full abstract

The estimation of image quality and noise perception still remains an important issue in various image processing applications. It has also become a hot topic in the field of photo-realistic computer graphics where noise is inherent in the calculation process. Unlike natural-scene images, however, a reference image is not available for computer-generated images. Thus, classic methods to assess noise quantity and stopping criterion during the rendering process are not usable. This is particularly important in the case of global illumination methods based on stochastic techniques: They provide photo-realistic images which are, however, corrupted by stochastic noise. This noise can be reduced by increasing the number of paths, as proved by Monte Carlo theory, but the problem of finding the right number of paths that are required in order to ensure that human observers cannot perceive any noise is still open. Until now, the features taking part in the human evaluation of image quality and the remaining perceived noise are not precisely known. Synthetic image generation tends to be very expensive and the produced datasets are high-dimensional datasets. In that case, finding a stopping criterion using a learning framework is a challenging task. In this paper, a new method for characterizing computational noise for computer generated images is presented. The noise is represented by the entropy of the singular value decomposition of each block composing an image. These Singular Value Decomposition (SVD)-entropy values are then used as input to a recurrent neural network architecture model in order to extract image noise and in predicting a visual convergence threshold of different parts of any image. Thus a new no-reference image quality assessment is proposed using the relation between SVD-Entropy and perceptual quality, based on a sequence of distorted images. Experiments show that the proposed method, compared with experimental psycho-visual scores, demonstrates a good consistency between these scores and stopping criterion measures that we obtain.

Read full abstract

Natural Scene Images Research Articles

Related Topics

Articles published on Natural Scene Images

Multi‐lingual text detection and identification using agile convolutional neural network

Subtask Attention Based Object Detection in Remote Sensing Images

Visual memorability in the absence of semantic content

Naturalness and aesthetics of colors – Preference for color compositions perceived as natural

COCO-Search18 fixation dataset for predicting goal-directed attention control

Text detection and localization in scene images: a broad review

Multi-granularity Deep Local Representations for Irregular Scene Text Recognition

Towards Accurate Scene Text Detection with Bidirectional Feature Pyramid Network

Text detection and script identification in natural scene images using deep learning

Character-based handwritten text transcription with attention networks

Sparse elastic net multi-label rank support vector machine with pinball loss and its applications

Rule-based perspective rectification for Chinese text in natural scene images

What is visible across the visual field?

Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches.

Attractiveness in the Eyes: A Possibility of Positive Loop between Transient Pupil Constriction and Facial Attraction.

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Automatic Text Segmentation and Recognition in Natural Scene Images Using Msocr

Estimation of tea leaf blight severity in natural scene images

Robust license plate detection and recognition with automatic rectification

Stopping Criterion during Rendering of Computer-Generated Images Based on SVD-Entropy.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Natural Scene Images Research Articles

Related Topics

Articles published on Natural Scene Images

Multi‐lingual text detection and identification using agile convolutional neural network

Subtask Attention Based Object Detection in Remote Sensing Images

Visual memorability in the absence of semantic content

Naturalness and aesthetics of colors – Preference for color compositions perceived as natural

COCO-Search18 fixation dataset for predicting goal-directed attention control

Text detection and localization in scene images: a broad review

Multi-granularity Deep Local Representations for Irregular Scene Text Recognition

Towards Accurate Scene Text Detection with Bidirectional Feature Pyramid Network

Text detection and script identification in natural scene images using deep learning

Character-based handwritten text transcription with attention networks

Sparse elastic net multi-label rank support vector machine with pinball loss and its applications

Rule-based perspective rectification for Chinese text in natural scene images

What is visible across the visual field?

Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches.

Attractiveness in the Eyes: A Possibility of Positive Loop between Transient Pupil Constriction and Facial Attraction.

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Automatic Text Segmentation and Recognition in Natural Scene Images Using Msocr

Estimation of tea leaf blight severity in natural scene images

Robust license plate detection and recognition with automatic rectification

Stopping Criterion during Rendering of Computer-Generated Images Based on SVD-Entropy.