Abstract

We compared preprocessing and machine learning methods for identifying writing inks on paper using mid-infrared hyperspectral imaging and machine learning. We collected training data by making a blackened circle on paper with a pen and measuring it using hyperspectral imaging. We attempted to identify-five types of black ink that are difficult to identify using visible and near-infrared spectroscopy and can only be identified using mid-infrared spectroscopy. We initially analyzed the spectra using principal component analysis, and used the scores as a substitute for the spectrum. As an overall trend, standardization of the data for each variable had little effect on improving the discrimination rate. By contrast, using the difference spectrum from the average spectrum of the paper was effective for improving the discrimination rate. The discrimination rate was higher for the second-order derivative than for the spectrum itself, and for the first-order derivative than for the second-order derivative. Furthermore, the combination of the three had the highest discrimination rate. We tested three supervised machine learning methods: decision trees, discriminant analysis, and k-nearest neighbors. The highest classification accuracy (97.6%) was obtained for second-order discriminant analysis. Considering the discrimination rate and learning time, second-order discriminant analysis was the best method. For the measurement data of a sample with blackened circles using all the writing inks, this method identified the different inks quite well. For the measurement data of crossed line samples, using the proposed method, we clearly identified each ink. At the intersection of two lines, the line written later was detected more strongly.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.