Object Detection In Natural Scenes Research Articles

Multi-scale object detection in natural scenes is still challenging. To enhance the multi-scale perception capability, some algorithms combine the lower-level and higher-level information via multi-scale feature fusion strategies. However, the inherent spatial properties among instances and relations between foreground and background are ignored. In addition, the human-defined “center-based” regression quality evaluation strategy, predicting a high-to-low score based on a linear relationship with the distance to the center of ground-truth box, is not robust to scale-variant objects. In this work, we propose a Depth-Guided Progressive Network (DGPNet) for multi-scale object detection. Specifically, besides the prediction of classification and localization, the depth is estimated and used to guide the image features in a weighted manner to obtain a better spatial representation. Therefore, depth estimation and 2D object detection are simultaneously learned via a unified network, where the depth features are merged as auxiliary information into the detection branch to enhance the discrimination among multi-scale objects. Moreover, to overcome the difficulty of empirically fitting the localization quality function, high-quality predicted boxes on scale-variant objects are more adaptively obtained by an IoU-aware progressive sampling strategy. We divide the sampling process into two stages, i.e., “statistical-aware” and “IoU-aware”. The former selects thresholds for positive samples based on statistical characteristics of multi-scale instances, and the latter further selects high-quality samples by IoU on the basis of the former. Therefore, the final ranking scores better reflect the quality of localization. Experiments verify that our method outperforms state-of-the-art methods on the KINS and Cityscapes dataset.

Read full abstract

Selective brain responses to objects arise within a few hundreds of milliseconds of neural processing, suggesting that visual object recognition is mediated by rapid feed-forward activations. Yet disruption of neural responses in early visual cortex beyond feed-forward processing stages affects object recognition performance. Here, we unite these discrepant findings by reporting that object recognition involves enhanced feedback activity (recurrent processing within early visual cortex) when target objects are embedded in natural scenes that are characterized by high complexity. Human participants performed an animal target detection task on natural scenes with low, medium or high complexity as determined by a computational model of low-level contrast statistics. Three converging lines of evidence indicate that feedback was selectively enhanced for high complexity scenes. First, functional magnetic resonance imaging (fMRI) activity in early visual cortex (V1) was enhanced for target objects in scenes with high, but not low or medium complexity. Second, event-related potentials (ERPs) evoked by target objects were selectively enhanced at feedback stages of visual processing (from ~220 ms onwards) for high complexity scenes only. Third, behavioral performance for high complexity scenes deteriorated when participants were pressed for time and thus less able to incorporate the feedback activity. Modeling of the reaction time distributions using drift diffusion revealed that object information accumulated more slowly for high complexity scenes, with evidence accumulation being coupled to trial-to-trial variation in the EEG feedback response. Together, these results suggest that while feed-forward activity may suffice to recognize isolated objects, the brain employs recurrent processing more adaptively in naturalistic settings, using minimal feedback for simple scenes and increasing feedback for complex scenes.

Read full abstract

Object Detection In Natural Scenes Research Articles

Related Topics

Articles published on Object Detection In Natural Scenes

Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes

Apple object detection based on improved YOLOX

Depth-Guided Progressive Network for Object Detection

Does Semantic Activation Affect Human Object Detection in Natural Scenes?

Real-Time Water Surface Object Detection Based on Improved Faster R-CNN.

Scene complexity modulates degree of feedback activity during object detection in natural scenes.

Object detection in natural scenes: Independent effects of spatial and category-based attention

The role of Weibull image statistics in rapid object detection in natural scenes

Hierarchical feed-forward network for object detection tasks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Object Detection In Natural Scenes Research Articles

Related Topics

Articles published on Object Detection In Natural Scenes

Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes

Apple object detection based on improved YOLOX

Depth-Guided Progressive Network for Object Detection

Does Semantic Activation Affect Human Object Detection in Natural Scenes?

Real-Time Water Surface Object Detection Based on Improved Faster R-CNN.

Scene complexity modulates degree of feedback activity during object detection in natural scenes.

Object detection in natural scenes: Independent effects of spatial and category-based attention

The role of Weibull image statistics in rapid object detection in natural scenes

Hierarchical feed-forward network for object detection tasks