Abstract
The road scene segmentation is an important problem which is helpful for a higher level of the scene understanding. This article presents a novel approach for image semantic segmentation of road scenes via a hierarchical graph-based inference. A deep encoder–decoder network is first applied for a fast pixel-wise classification. Then, hierarchical graph-based inference is performed to get an accurate segmentation result. In the inference process, all the object classes are grouped into fewer categories which contains at least one class. The category labels are assigned to image superpixels using Markov random field model. For each category, a pixel-level labeling based on fully connected conditional random fields is performed to divide image into different classes. After the inference for all categories, the results are integrated together to get the final segmentation. In additional to low-level affinity functions, the feature maps from network are integrated in pairwise potentials of the graphical models. This hierarchical inference scheme can alleviate the confusion of classes belonging to different categories. It performs well for small objects without adding more computational burden. Both qualitative and quantitative assessments are adopted to evaluate the proposed method. The results on benchmark data sets prove the effectiveness of the proposed hierarchical scheme, and the performance is competitive with the state-of-the-art methods.
Highlights
Road scene understanding is a key technique for the success of many vision-based applications such as autonomous driving, driver assistance, and personal navigation
A deep encoder–decoder network is applied for a fast pixel-wise classification
A pixel-level labeling operation based on dense conditional random field (CRF) is performed to divide image into different classes that belong to this category
Summary
Road scene understanding is a key technique for the success of many vision-based applications such as autonomous driving, driver assistance, and personal navigation. Semantic segmentation of all the scene objects is an important step toward complete understanding of the scene. This problem, called scene parsing and scene understanding for simplicity, has become an actively studied problem in recent years. The goal of semantic image segmentation is to segment all the objects in an image and identify their categories. It is a challenge task since it combines the traditional problems of object detection, segmentation, and recognition in a single process.[1] The problem is even harder for a road scene due to the wide variety of categories and complex environment on it. The road scene is well structured and can be handled by standard
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.