Image classification based on sparse coding multi-scale spatial latent semantic analysis

Tao He

doi:10.1186/s13640-019-0425-8

Abstract

In the face of huge amounts of image data, how to let the computer simulate human cognition of images and automatically classify images into different semantic categories have become a key issue in image semantic analysis. Image classification is based on some attribute of the image, and it is divided into pre-set categories. For human beings, image classification is not difficult but there is a series of problems in using computers to classify images: (1) images contain a large amount of information, which is complex, diverse, and indescribable; and (2) there is a huge difference between the physical expression of images and the conceptual information known by human beings. The traditional sparse coding method loses the spatial information when classifying images. In this paper, spatial pyramid multi-partition method is used to add spatial information restriction to the feature. The proposed multi-scale spatial latent semantic analysis method based on sparse coding has higher average classification accuracy than many existing methods, which verifies its effectiveness and robustness. Experiments also show that the classification accuracy of this paper is 2.1% higher than that of sparse coding for image classification (ScSPM) and the classification performance is 3.1% higher than that of ScSPM when the number of training images is 40. Compared with other methods, the classification performance of the proposed method is improved significantly.

Highlights

With the rapid development of high-speed Internet, the development of information storage and transmission technology, and the popularity of digital equipment, the acquisition and storage of digital color images become easier and the number of image data people come into contact with is increasing at an unprecedented rate
The methods of image representation and classification by vector quantization encoding have the following problems: (1) visual codebook generated by vector quantization encoding will lead to the loss of spatial information, (2) visual vocabulary histogram construction leads to serious quantization errors, and (3) this kind of image representation will show good classification effect only when using non-linear kernel Support vector machine (SVM) classifier
The results fully show that the latent semantic information obtained by learning the Probabilistic latent semantic analysis (PLSA) model in each local area can improve the classification accuracy of images and verify the importance of PLSA in the image classification model in this paper

Summary

Introduction

With the rapid development of high-speed Internet, the development of information storage and transmission technology, and the popularity of digital equipment, the acquisition and storage of digital color images become easier and the number of image data people come into contact with is increasing at an unprecedented rate. Faced with a huge amount of image data, how to simulate the cognitive mechanism of human understanding of images and automatically classify images into different semantic categories according to the way people understand become a key issue. According to the different ways of describing images in traditional image classification methods, image classification methods can be divided into two categories, one is based on global features and the other is based on middle-level semantic information. The methods of image representation and classification by vector quantization encoding have the following problems: (1) visual codebook generated by vector quantization encoding will lead to the loss of spatial information, (2) visual vocabulary histogram construction leads to serious quantization errors, and (3) this kind of image representation will show good classification effect only when using non-linear kernel SVM classifier.

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Feb 8, 2019
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

Image classification based on sparse coding multi-scale spatial latent semantic analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Similar Papers

Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval
Jianshe Zhou ... Sheng Tang
Multimedia Tools and Applications | VOL. 77
Jianshe Zhou, et. al.Jianshe Zhou ... Sheng Tang
05 May 2018
Multimedia Tools and Applications | VOL. 77

Sparse Representation With Kernels
Shenghua Gao ... Liang-Tien Chia
IEEE Transactions on Image Processing | VOL. 22
Shenghua Gao, et. al. Shenghua Gao ... Liang-Tien Chia
21 Sep 2012
IEEE Transactions on Image Processing | VOL. 22

Robust weighted supervised sparse coding for image classification
Xiang Zhang ... Zhigang Luo
-
Xiang Zhang, et. al. Xiang Zhang ... Zhigang Luo
01 Oct 2015
01 Oct 2015

Graph-Guided Fusion Penalty Based Sparse Coding for Image Classification
Xiaoshan Yang ... Changsheng Xu
-
Xiaoshan Yang, et. al.Xiaoshan Yang ... Changsheng Xu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image classification based on sparse coding multi-scale spatial latent semantic analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing