SSNet: Learning Mid-Level Image Representation Using Salient Superpixel Network

Zhihang Ji,Lijuan Xu,Xiaopeng Hu,Fan Wang,Xiang Gao

doi:10.3390/app10010140

Abstract

In the standard bag-of-visual-words (BoVW) model, the burstiness problem of features and the ignorance of high-order information often weakens the discriminative power of image representation. To tackle them, we present a novel framework, named the Salient Superpixel Network, to learn the mid-level image representation. For reducing the impact of burstiness occurred in the background region, we use the salient regions instead of the whole image to extract local features, and a fast saliency detection algorithm based on the Gestalt grouping principle is proposed to generate image saliency maps. In order to introduce the high-order information, we propose a weighted second-order pooling (WSOP) method, which is capable of exploiting the high-order information and further alleviating the impact of burstiness in the foreground region. Then, we conduct experiments on six image classification benchmark datasets, and the results demonstrate the effectiveness of the proposed framework with either the handcrafted or the off-the-shelf CNN features.

Highlights

Image classification aims to categorize a set of unlabeled images into several predefined classes according to their visual content
We further evaluate the performance of the Salient Superpixel Network (SSNet) framework with the off-the-shelf convolutional neural networks (CNN) local features
In the second set of experiments, we evaluate the performance of the SSNet framework using the off-the-shelf CNN local features

Summary

Introduction

Image classification aims to categorize a set of unlabeled images into several predefined classes according to their visual content. Russakovsky et al [12] and Angelova et al [13] introduce location information to separate the foreground and background features and form the image representation. These methods have enhanced the discriminative ability of the representation; training an object detector is time-consuming. Theseintroducing methods have high-order information into the design of the feature descriptor contributes little to improve enhanced the discriminative ability of the representation; training an object detector is the performance of image classification tasks.

Observing

Research on Mid-Level

Related Work

Methods of Extracting the Off-the-Shelf CNN Feature

Research Work about Burstiness Issue

The Proposed Method for Image Representation

Saliency Region Detection

Measuring the Gestalt Grouping Connectedness

Saliency Map Generation

The Proposed Feature Weighting Method

Weighted Second-Order Pooling

Vectorization and Normalization

The Mid-Level Image Representation Based on SSNet

Figure

Experiments and Results

Experimental Setting

Benchmark Datasets

Effectiveness Evaluation of Weighted Second-Order Pooling

Performance Analysis of Mid-Level Representation Based on SSNet

Comparison with Related BoVW Baselines

SOA Methods

Methods

Some sample imagesfrom fromthe theFood-101

Limitations

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SSNet: Learning Mid-Level Image Representation Using Salient Superpixel Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Dec 23, 2019
License type: CC BY 4.0

Similar Papers

Background Modeling Using Color, Disparity, and Motion Information
Jong Weon Lee ... Hyo Sung Jeon
-
Jong Weon Lee, et. al.Jong Weon Lee ... Hyo Sung Jeon
01 Jan 2004
01 Jan 2004

A variable region scalable fitting energy approach for human Metaspread chromosome image segmentation
Tanvi Arora ... Renu Dhir
Multimedia Tools and Applications | VOL. 78
Tanvi Arora, et. al.Tanvi Arora ... Renu Dhir
23 Aug 2018
Multimedia Tools and Applications | VOL. 78

Bilateral Attention Network for RGB-D Salient Object Detection.
Zhao Zhang ... Jun Xu
IEEE Transactions on Image Processing | VOL. 30
Zhao Zhang, et. al.Zhao Zhang ... Jun Xu
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Adaptive dynamic inference for few-shot left atrium segmentation
Jun Chen ... Guang Yang
Medical Image Analysis | VOL. 98
Jun Chen, et. al.Jun Chen ... Guang Yang
23 Aug 2024
Medical Image Analysis | VOL. 98

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SSNet: Learning Mid-Level Image Representation Using Salient Superpixel Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences