Abstract

Nowadays, source camera identification, which aims to identify the source camera of images, is quite important in the field of forensics. There is a problem that cannot be ignored that the existing methods are unreliable and even out of work in the case of the small training sample. To solve this problem, a virtual sample generation-based method is proposed in this paper, combined with the ensemble learning. In this paper, after constructing sub-sets of LBP features, the authors generate a virtual sample-based on the mega-trend-diffusion (MTD) method, which calculates the diffusion range of samples according to the trend diffusion theory, and then randomly generates virtual sample according to uniform distribution within this range. In the aspect of the classifier, an ensemble learning scheme is proposed to train multiple SVM-based classifiers to improve the accuracy of image source identification. The experimental results demonstrate that the proposed method achieves higher average accuracy than the state-of-the-art, which uses a small number of samples as the training sample set.

Highlights

  • Nowadays, the digital images generation is popular and easier, which makes it possible for some individuals to upload unsuitable images for their interests or to steal images of others for commercial purposes

  • Combing the 10 virtual samples generated by box plot based MTD, the accuracy of image source identification comes to 47.51%, 64.25%, 71.86%, and 77.03%

  • From the results without virtual sample, MTDBOX and MTDRELATION based virtual sample, an important observation is that the high quality virtual samples do help the classification model improve the

Read more

Summary

Introduction

The digital images generation is popular and easier, which makes it possible for some individuals to upload unsuitable images for their interests or to steal images of others for commercial purposes. The issue of image source identification is usually modeled as a classification problem, which means decent results are expectant with enough training samples. It is well known that obtaining a large number of sufficient training samples may be very difficult, and the classifiers perform very poorly in this scenario of small training samples. It is always a big challenge when there are only a small set of labeled images used as references in the practical forensic application. Many methods are proposed for the small training sample problem, which are mainly divided into three categories.

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call