ObjectiveTo determine the living regions of individuals based on amino acids in fingermark residues and to establish a rapid and accurate regional classification method using machine learning. Methods: A total of 71 fingermark donors from six different provinces in various regions of China were selected. The content of 18 amino acids in their fingermarks was detected using UHPLC-QQQ-MS/MS. Classification models were established using various machine learning algorithms, and the cross-validation accuracy of 72 combinations, including feature engineering, classification algorithms, and optimization algorithms, was compared. Results: UHPLC-QQQ-MS/MS successfully quantified 16 amino acids. Significant differences in the relative content of amino acids were found between the fingermarks from the eastern and western regions of China, as well as among neighboring provinces. The combination of SFS+SVM+BO was identified as the optimal classification model, achieving an accuracy of 90.14%. Conclusion: The study found regional differences in the relative content of amino acids in fingermarks and established a regional classification method combining UHPLC-QQQ-MS/MS and machine learning. The method developed in this study can be applied to incomplete or distorted fingermarks, and the experimental results can be directly used in police investigations. This research uncovers the multidimensional information carried by fingerprint substances, demonstrating innovation and application value. It not only saves and shortens investigation time and provides investigative leads, but also enables previously unusable physical evidence to play a role again, enhancing the profiling of suspects.
Read full abstract