Abstract

Scene parsing aims to recognize the object category of every pixel in scene images, and it plays a central role in image content understanding and computer vision applications. However, accurate scene parsing from unconstrained real-world data is still a challenging task. In this paper, we present the non-parametric Spatially Constrained Local Prior (SCLP) for scene parsing on realistic data. For a given query image, the non-parametric SCLP is learnt by first retrieving a subset of most similar training images to the query image and then collecting prior information about object co-occurrence statistics between spatial image blocks and between adjacent superpixels from the retrieved subset. The SCLP is powerful in capturing both long- and short-range context about inter-object correlations in the query image and can be effectively integrated with traditional visual features to refine the classification results. Our experiments on the SIFT Flow and PASCAL-Context benchmark datasets show that the non-parametric SCLP used in conjunction with superpixel-level visual features achieves one of the top performance compared with state-of-the-art approaches.

Highlights

  • Scene parsing aims to assign every pixel of a query image to a correct semantic category, such as bus, road, sky and tree

  • To address the drawbacks above, this paper presents the non-parametric Spatially Constrained Local Prior (SCLP) for scene parsing from complicated real-world data

  • The power of non-parametric SCLP lies in its ability to represent both global and local context from the most similar training subset, separately for each query image, and further integrate contextual and visual features to improve the prediction results

Read more

Summary

Introduction

Scene parsing aims to assign every pixel of a query image to a correct semantic category, such as bus, road, sky and tree. To address the drawbacks above, this paper presents the non-parametric Spatially Constrained Local Prior (SCLP) for scene parsing from complicated real-world data It first retrieves a subset of similar training images to a query image and collecting queryspecific SCLP contextual features from the retrieved subset to represent more specific and useful prior context about inter-class correlations for the query image. The power of non-parametric SCLP lies in its ability to represent both global and local context from the most similar training subset, separately for each query image, and further integrate contextual and visual features to improve the prediction results. It shows state-of-the-art performance on the SIFT Flow and PASCAL-Context benchmark datasets.

Related Work
Proposed Non-parametric SCLP Approach
Retrieval of Similar Training Images for Every Test Image
Extraction of Non-parametric SCLP
Visual Feature-Based Class Probability Prediction
Fusion of Visual and Contextual Probabilities
Experimental Results and Analysis
Evaluation Datasets and System Parameters
Performance Comparisons with State-of-the-Art Approaches
Conclusions and Future Work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call