Bag-of-visual-words model for artificial pornographic images recognition

Fangfang Li ,Siwei Luo ,Xiyao Liu ,Beiji Zou

doi:10.1007/s11771-016-3190-1

Abstract

It is illegal to spread and transmit pornographic images over internet, either in real or in artificial format. The traditional methods are designed to identify real pornographic images and they are less efficient in dealing with artificial images. Therefore, criminals turn to release artificial pornographic images in some specific scenes, e.g., in social networks. To efficiently identify artificial pornographic images, a novel bag-of-visual-words based approach is proposed in the work. In the bag-of-words (BoW) framework, speeded-up robust feature (SURF) is adopted for feature extraction at first, then a visual vocabulary is constructed through K-means clustering and images are represented by an improved BoW encoding method, and finally the visual words are fed into a learning machine for training and classification. Different from the traditional BoW method, the proposed method sets a weight on each visual word according to the number of features that each cluster contains. Moreover, a non-binary encoding method and cross-matching strategy are utilized to improve the discriminative power of the visual words. Experimental results indicate that the proposed method outperforms the traditional method.

Full Text