Abstract

Semantic segmentation is one of the most commonly used techniques for road scene understanding. Recently developed deep learning-based semantic segmentation networks are typically based on the encoder-decoder structure and have made great progress in road scene understanding. However, these conventional networks still encounter difficulties in recovering spatial details. To overcome this problem, we introduce a lightweight prediction and boundary-aware refinement module that can hierarchically refine the segmentation results with spatial details. The proposed refinement module has two attention units called the upper-level prediction attention unit and the upper-level boundary attention unit. The upper-level prediction attention unit emphasizes the features in the regions that need to be refined by using predicted class probability from the upper-level, whereas the upper-level boundary attention unit focuses on the features near the semantic boundary of the upper-level segmentation result. By using the proposed prediction and boundary-aware refinement module in the decoder network, the segmentation result can gradually be improved in a top-down manner to a finer and more complete one. Experimental results on the Cityscapes and CamVid datasets demonstrate that the proposed prediction and boundary attention-based refinement module can achieve considerable performance improvement in segmentation accuracy with a marginal increase in computational complexity.

Highlights

  • In recent years, a large number of studies have addressed in the field of autonomous driving [1], [2]

  • We propose a lightweight feature refinement module, called prediction and boundary-aware refinement module (PBRM), that can effectively recover the spatial details of segmentation result in a top-down manner

  • In the proposed upper-level boundary attention (UBA) unit, we introduce a lightweight boundary attention mask generation (BAMG) block that consists of Conv layers

Read more

Summary

Introduction

A large number of studies have addressed in the field of autonomous driving [1], [2]. An essential component of autonomous driving, involves multiple tasks, such as detecting drivable areas [3]–[8], pedestrians, and vehicles [9]–[11]; it provides crucial information for the path planning of autonomous vehicles. One of the most commonly used techniques for road scene understanding is semantic segmentation, which aims to classify each pixel of an image into one of the predefined classes. We focus on the semantic segmentation of road scenes, which means segmenting on-road objects and background materials such as road, sidewalk, sky, and vegetation

Objectives
Methods
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.