Abstract

Periodontal disease is a chronic but treatable condition which often does not cause pain during the initial stages of the illness. Lack of awareness of symptoms can delay initiation of treatment and worsen health. The aim of this study was to develop and compare different risk prediction models for periodontal disease using machine learning algorithms. We obtained information on risk factors for periodontal disease from the Korea National Health and Nutrition Examination Survey (KNHANES) dataset. Principal component analysis and an auto-encoder were used to extract data on risk factors for periodontal disease. A synthetic minority oversampling technique algorithm was used to solve the problem of data imbalance. We used a combination of logistic regression analysis, support vector machine (SVM) learning, random forest, and AdaBoost to classify and compare risk prediction models for periodontal disease. In cases where we used principal component analysis (PCA) to extract risk factors, the recall was higher than the feature selection method in the logistic regression and support-vector machine learning models. AdaBoost’s recall was 0.98, showing the highest performance of both feature selection and PCA. The F1 score showed relatively high performance in AdaBoost, logistic regression, and SVM learning models. By using the risk factors extracted from the research results and the predictive model based on machine learning, it will be able to help in the prevention and diagnosis of periodontal disease, and it will be used to study the relationship with various diseases related to periodontal disease.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.