Amelogenesis imperfecta (AI) is a genetic disease characterized by poor formation of tooth enamel. AI occurs due to mutations, especially in AMEL, ENAM, KLK4, MMP20, and FAM83H, associated with changes in matrix proteins, matrix proteases, cell-matrix adhesion proteins, and transport proteins of enamel. Due to the wide variety of phenotypes, the diagnosis of AI is complex, requiring a genetic test to characterize it better. Thus, there is a demand for developing low-cost, noninvasive, and accurate platforms for AI diagnostics. This case-control pilot study aimed to test salivary vibrational modes obtained in attenuated total reflection fourier-transformed infrared (ATR-FTIR) together with machine learning algorithms: linear discriminant analysis (LDA), random forest, and support vector machine (SVM) could be used to discriminate AI from control subjects due to changes in salivary components. The best-performing SVM algorithm discriminates AI better than matched-control subjects with a sensitivity of 100%, specificity of 79%, and accuracy of 88%. The five main vibrational modes with higher feature importance in the Shapley Additive Explanations (SHAP) were 1010 cm-1, 1013 cm-1, 1002 cm-1, 1004 cm-1, and 1011 cm-1 in these best-performing SVM algorithms, suggesting these vibrational modes as a pre-validated salivary infrared spectral area as a potential biomarker for AI screening. In summary, ATR-FTIR spectroscopy and machine learning algorithms can be used on saliva samples to discriminate AI and are further explored as a screening tool.
Read full abstract