A novel technique is introduced to predict the printer model used to produce a given document. Samples containing only a few letters printed under varying conditions (i.e., different printing modes, letter types, fonts) were collected to establish a dataset of 41 inkjet printer models from common manufacturers, such as HP, Canon, and Epson. Morphological features were analyzed by extraction of image features using several algorithms in a series of microscopic images and a Wilcoxon test was used to measure the significance of variations between printed samples. Significant differences between various printing conditions might post potential challenge to questioned document examination. Discriminant analysis and the k-nearest neighbor (KNN) algorithm were also employed for source printer prediction under varying printing condition on 30% images with the rest images as training dataset. The results of a validation experiment demonstrated that while quadratic discriminant analysis (QDA) achieved an accuracy of 96.3%, a combination of KNN and QDA reached 98.6%. As such, this technique could aid in the forensic examination of printed documents.
Read full abstract