Objects In Scene Research Articles

During the last few years, a heightened interest has been shown in classifying scene images depicting diverse robotic environments. The surge in interest can be attributed to significant improvements in visual sensor technology, which has enhanced image analysis capabilities. Advances in vision technology have a major impact on the areas of multiple object detection and scene understanding. These tasks are an integral part of a variety of technologies, including integrating scenes in augmented reality, facilitating robot navigation, enabling autonomous driving systems, and improving applications in tourist information. Despite significant strides in visual interpretation, numerous challenges persist, encompassing semantic understanding, occlusion, orientation, insufficient availability of labeled data, uneven illumination including shadows and lighting, variation in direction, and object size and changing background. To overcome these challenges, we proposed an innovative scene recognition framework, which proved to be highly effective and yielded remarkable results. First, we perform preprocessing using kernel convolution on scene data. Second, we perform semantic segmentation using UNet segmentation. Then, we extract features from these segmented data using discrete wavelet transform (DWT), Sobel and Laplacian, and textual (local binary pattern analysis). To recognize the object, we have used deep belief network and then find the object-to-object relation. Finally, AlexNet is used to assign the relevant labels to the scene based on recognized objects in the image. The performance of the proposed system was validated using three standard datasets: PASCALVOC-12, Cityscapes, and Caltech 101. The accuracy attained on the PASCALVOC-12 dataset exceeds 96% while achieving a rate of 95.90% on the Cityscapes dataset. Furthermore, the model demonstrates a commendable accuracy of 92.2% on the Caltech 101 dataset. This model showcases noteworthy advancements beyond the capabilities of current models.

Read full abstract

Accurately counting the number of dense objects in an image, such as pedestrians or vehicles, is a challenging and practical task. The existing density map regression methods based on CNN are mainly used to count a class of dense objects in a single scene. However, in complex traffic scenes, objects such as vehicles and pedestrians usually exist at the same time, and multiple classes of dense objects need to be counted simultaneously. To solve the above issues, we propose a new multiple types of dense object counting method based on feature enhancement, which can enhance the features of dense counting objects in complex traffic scenes to realize the classification and regression counting of dense vehicles and people. The counting model consists of the regression subnet and the classification subnet. The regression subnet is primarily used to generate two-channel predicted density maps, mainly including the initial feature layer and the feature enhancement layer, in which the feature enhancement layer can enhance the classification features and regression counting features of dense objects in complex traffic scenes. The classification subnet mainly supervises classifying dense vehicles and people into two feature channels to assist the regression counting task of the regression subnets. Our method is compared on VisDrone+ datasets, ApolloScape+ datasets, and UAVDT+ datasets. The experimental results show that the method counts two kinds of dense objects simultaneously and outputs a high-quality two-channel predicted density map. The counting performance is better than the state-of-the-art counting network in dense people and vehicle counting. In future work, we will further improve the feature extraction ability of the model in complex traffic scenes to classify and count a variety of dense objects such as cars, pedestrians, and non-motor vehicles.

Read full abstract

Objects In Scene Research Articles

Related Topics

Articles published on Objects In Scene

Remote intelligent perception system for multi-object detection.

Counting dense object of multiple types based on feature enhancement.

FocusDet: an efficient object detector for small object

A Multi-Feature Fusion Method for Urban Functional Regions Identification: A Case Study of Xi’an, China

Oculomotor routines for perceptual judgments.

Category-based depth incorporation for salient object ranking

Insights into Image Understanding: Segmentation Methods for Object Recognition and Scene Classification

Exploring the Semantic-Inconsistency Effect in Scenes Using a Continuous Measure of Linguistic-Semantic Similarity

Limited information-processing capacity in vision explains number psychophysics.

On investigating drivers’ attention allocation during partially-automated driving

A Study of YOLO-Based Object Detection for Visually Impaired Individuals

Saccades to partially occluded objects: Perceptual completion mediates oculomotor control.

IDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

SeqRank: Sequential Ranking of Salient Objects

Exploiting Polarized Material Cues for Robust Car Detection

Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes

A number sense as an emergent property of the manipulating brain

Frequency and spatial based multi-layer context network (FSCNet) for remote sensing scene classification

A new framework for improving semantic segmentation in aerial imagery

Constructing a game engine: A proposed game engine architecture course for undergraduate students

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Objects In Scene Research Articles

Related Topics

Articles published on Objects In Scene

Remote intelligent perception system for multi-object detection.

Counting dense object of multiple types based on feature enhancement.

FocusDet: an efficient object detector for small object

A Multi-Feature Fusion Method for Urban Functional Regions Identification: A Case Study of Xi’an, China

Oculomotor routines for perceptual judgments.

Category-based depth incorporation for salient object ranking

Insights into Image Understanding: Segmentation Methods for Object Recognition and Scene Classification

Exploring the Semantic-Inconsistency Effect in Scenes Using a Continuous Measure of Linguistic-Semantic Similarity

Limited information-processing capacity in vision explains number psychophysics.

On investigating drivers’ attention allocation during partially-automated driving

A Study of YOLO-Based Object Detection for Visually Impaired Individuals

Saccades to partially occluded objects: Perceptual completion mediates oculomotor control.

IDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

SeqRank: Sequential Ranking of Salient Objects

Exploiting Polarized Material Cues for Robust Car Detection

Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes

A number sense as an emergent property of the manipulating brain

Frequency and spatial based multi-layer context network (FSCNet) for remote sensing scene classification

A new framework for improving semantic segmentation in aerial imagery

Constructing a game engine: A proposed game engine architecture course for undergraduate students