Complex Road Scenes Research Articles

In the realm of remote sensing image analysis, the task of road extraction poses significant complexities, especially in the context of intricate scenes and diminutive targets. In response to these challenges, we have developed a novel deep learning network, christened CDAU-Net, designed to discern and delineate these features with enhanced precision. This network takes its structural inspiration from the fundamental architecture of U-Net while introducing innovative enhancements: we have integrated CoordConv convolutions into both the initial layer of the U-Net encoder and the terminal layer of the decoder, thereby facilitating a more efficacious processing of spatial information inherent in remote sensing images. Moreover, we have devised a unique mechanism termed the Deep Dual Cross Attention (DDCA), purposed to capture long-range dependencies within images—a critical factor in remote sensing image analysis. Our network replaces the skip-connection component of the U-Net with this newly designed mechanism, dealing with feature maps of the first four scales in the encoder and generating four corresponding outputs. These outputs are subsequently linked with the decoder stage to further capture the remote dependencies present within the remote sensing imagery. We have subjected CDAU-Net to extensive empirical validation, including testing on the Massachusetts Road Dataset and DeepGlobe Road Dataset. Both datasets encompass a diverse range of complex road scenes, making them ideal for evaluating the performance of road extraction algorithms. The experimental results showcase that whether in terms of accuracy, recall rate, or Intersection over Union (IoU) metrics, the CDAU-Net outperforms existing state-of-the-art methods in the task of road extraction. These findings substantiate the effectiveness and superiority of our approach in handling complex scenes and small targets, as well as in capturing long-range dependencies in remote sensing imagery. In sum, the design of CDAU-Net not only enhances the accuracy of road extraction but also presents new perspectives and possibilities for deep learning analysis of remote sensing imagery.

Read full abstract

Urban management and survey departments have begun investigating the feasibility ofacquiring data from various laser scanning systems for urban infrastructure measurements and assessments. Roadside objects such as cars, trees, traffic poles, pedestrians, bicycles and e-bicycles describe the static and dynamic urban information available for acquisition. Because of the unstructured nature of 3D point clouds, the rich targets in complex road scenes, and the varying scales of roadside objects, finely classifying these roadside objects from various point clouds is a challenging task. In this paper, we integrate two representations of roadside objects, point clouds and multiview images to propose a point-group-view network named PGVNet for classifying roadside objects into cars, trees, traffic poles, and small objects (pedestrians, bicycles and e-bicycles) from generalized point clouds. To utilize the topological information of the point clouds, we propose a graph attention convolution operation called AtEdgeConv to mine the relationship among the local points and to extract local geometric features. In addition, we employ a hierarchical view-group-object architecture to diminish the redundant information between similar views and to obtain salient viewwise global features. To fuse the local geometric features from the point clouds and the global features from multiview images, we stack an attention-guided fusion network in PGVNet. In particular, we quantify and leverage the global features as an attention mask to capture the intrinsic correlation and discriminability of the local geometric features, which contributes to recognizing the different roadside objects with similar shapes. To verify the effectiveness and generalization of our methods, we conduct extensive experiments on six test datasets of different urban scenes, which were captured by different laser scanning systems, including mobile laser scanning (MLS) systems, unmanned aerial vehicle (UAV)-based laser scanning (ULS) systems and backpack laser scanning (BLS) systems. Experimental results, and comparisons with state-of-the-art methods, demonstrate that the PGVNet model is able to effectively identify various cars, trees, traffic poles and small objects from generalized point clouds, and achieves promising performances on roadside object classifications, with an overall accuracy of 95.76%. Our code is released on https://github.com/flidarcode/PGVNet.

Read full abstract

Complex Road Scenes Research Articles

Related Topics

Articles published on Complex Road Scenes

Improved Road Target Detection Algorithm for YOLOv7-Tiny

Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion

TTIS-YOLO: a traffic target instance segmentation paradigm for complex road scenarios

A novel real-time object detection method for complex road scenes based on YOLOv7-tiny

MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes

PODB: A learning-based polarimetric object detection benchmark for road scenes in adverse weather conditions

SNCE-YOLO: An Improved Target Detection Algorithm in Complex Road Scenes

Improvement of Road Instance Segmentation Algorithm Based on the Modified Mask R-CNN

PatchAugNet: Patch feature augmentation-based heterogeneous point cloud place recognition in large-scale street scenes

CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery

Enhanced YOLOv5: An Efficient Road Object Detection Method.

ASA-BiSeNet: improved real-time approach for road lane semantic segmentation of low-light autonomous driving road scenes.

Weakly supervised multi-class semantic video segmentation for road scenes

A Multi-Scale Traffic Object Detection Algorithm for Road Scenes Based on Improved YOLOv5

Research on Infrared and Visible Image Registration Algorithm for Complex Road Scenes

Hierarchical Fine Extraction Method of Street Tree Information from Mobile LiDAR Point Cloud Data

Curb Detection and Compensation Method for Autonomous Driving via a 3-D-LiDAR Sensor

Anchor-Free Object Detection with Scale-Aware Networks for Autonomous Driving

A joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds

AN APPROACH FOR VEHICLE TARGET DETECTION USING YOLO V3

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Complex Road Scenes Research Articles

Related Topics

Articles published on Complex Road Scenes

Improved Road Target Detection Algorithm for YOLOv7-Tiny

Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion

TTIS-YOLO: a traffic target instance segmentation paradigm for complex road scenarios

A novel real-time object detection method for complex road scenes based on YOLOv7-tiny

MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes

PODB: A learning-based polarimetric object detection benchmark for road scenes in adverse weather conditions

SNCE-YOLO: An Improved Target Detection Algorithm in Complex Road Scenes

Improvement of Road Instance Segmentation Algorithm Based on the Modified Mask R-CNN

PatchAugNet: Patch feature augmentation-based heterogeneous point cloud place recognition in large-scale street scenes

CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery

Enhanced YOLOv5: An Efficient Road Object Detection Method.

ASA-BiSeNet: improved real-time approach for road lane semantic segmentation of low-light autonomous driving road scenes.

Weakly supervised multi-class semantic video segmentation for road scenes

A Multi-Scale Traffic Object Detection Algorithm for Road Scenes Based on Improved YOLOv5

Research on Infrared and Visible Image Registration Algorithm for Complex Road Scenes

Hierarchical Fine Extraction Method of Street Tree Information from Mobile LiDAR Point Cloud Data

Curb Detection and Compensation Method for Autonomous Driving via a 3-D-LiDAR Sensor

Anchor-Free Object Detection with Scale-Aware Networks for Autonomous Driving

A joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds

AN APPROACH FOR VEHICLE TARGET DETECTION USING YOLO V3