Development of glaucoma predictive model and risk factors assessment based on supervised models

Mahyar Sharifi,Hassan Hashemi,Mohammad Hassan Emamian,Akbar Fotouhi,Toktam Khatibi,Somayeh Sadat

doi:10.1186/s13040-021-00281-8

Abstract

ObjectivesTo develop and to propose a machine learning model for predicting glaucoma and identifying its risk factors.MethodData analysis pipeline is designed for this study based on Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology. The main steps of the pipeline include data sampling, preprocessing, classification and evaluation and validation. Data sampling for providing the training dataset was performed with balanced sampling based on over-sampling and under-sampling methods. Data preprocessing steps were missing value imputation and normalization. For classification step, several machine learning models were designed for predicting glaucoma including Decision Trees (DTs), K-Nearest Neighbors (K-NN), Support Vector Machines (SVM), Random Forests (RFs), Extra Trees (ETs) and Bagging Ensemble methods. Moreover, in the classification step, a novel stacking ensemble model is designed and proposed using the superior classifiers.ResultsThe data were from Shahroud Eye Cohort Study including demographic and ophthalmology data for 5190 participants aged 40-64 living in Shahroud, northeast Iran. The main variables considered in this dataset were 67 demographics, ophthalmologic, optometric, perimetry, and biometry features for 4561 people, including 4474 non-glaucoma participants and 87 glaucoma patients. Experimental results show that DTs and RFs trained based on under-sampling of the training dataset have superior performance for predicting glaucoma than the compared single classifiers and bagging ensemble methods with the average accuracy of 87.61 and 88.87, the sensitivity of 73.80 and 72.35, specificity of 87.88 and 89.10 and area under the curve (AUC) of 91.04 and 94.53, respectively. The proposed stacking ensemble has an average accuracy of 83.56, a sensitivity of 82.21, a specificity of 81.32, and an AUC of 88.54.ConclusionsIn this study, a machine learning model is proposed and developed to predict glaucoma disease among persons aged 40-64. Top predictors in this study considered features for discriminating and predicting non-glaucoma persons from glaucoma patients include the number of the visual field detect on perimetry, vertical cup to disk ratio, white to white diameter, systolic blood pressure, pupil barycenter on Y coordinate, age, and axial length.

Highlights

Glaucoma is the second cause of irreversible blindness and the fourth cause of Moderate and Severe Vision Impairment (MSVI) in the world.[1]
The data were from Shahroud Eye Cohort Study including demographic and ophthalmology data for 5190 participants aged 40-64 living in Shahroud, northeast Iran
The main variables considered in this dataset were 67 demographics, ophthalmologic, optometric, perimetry, and biometry features for 4561 people, including 4474 non-glaucoma participants and 87 glaucoma patients

Summary

Introduction

Glaucoma is the second cause of irreversible blindness and the fourth cause of Moderate and Severe Vision Impairment (MSVI) in the world.[1]. The average number of persons who go blind from glaucoma has been increased from 2.5 million persons in 1990 to 3 million persons in 2015. The average number of persons and the average number of persons who suffer from MSVI due to glaucoma has been raised from 3 million persons to 4 million persons from 1990 till 2015.[2]. It has been predicted that the number of persons suffering from glaucoma aged between 40 and 80 years old would be increased from 64.3 million persons in 2013 to 76 million persons in 2020 and 111.8 million persons in 2040.[3]. Previous studies have identified different risk factors such as age, gender, race, Interocular Pressure (IOP), diabetes, and family history for glaucoma.[4,5,6,7]

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BioData Mining	Publication Date: Nov 24, 2021
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Development of glaucoma predictive model and risk factors assessment based on supervised models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining

Lead the way for us

Similar Papers

Performance evaluation of ML models for preoperative prediction of HER2-low BC based on CE-CBBCT radiomic features: A prospective study.
Xianfei Chen ... Xueli Liang
Medicine | VOL. 103
Xianfei Chen, et. al.Xianfei Chen ... Xueli Liang
14 Jun 2024
Medicine | VOL. 103

An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction.
Richard Wijaya ... Sultan Noman Qasem
Bioengineering (Basel, Switzerland) | VOL. 11
Richard Wijaya, et. al.Richard Wijaya ... Sultan Noman Qasem
02 Jul 2024
Bioengineering (Basel, Switzerland) | VOL. 11

Editor's evaluation: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Eduardo Franco
-
Eduardo FrancoEduardo Franco
05 Sep 2022
05 Sep 2022

Decision letter: Derivation and external validation of clinical prediction rules identifying children at risk of linear growth faltering
Andrew N Mertens ... Eduardo Franco
-
Andrew N Mertens, et. al.Andrew N Mertens ... Eduardo Franco
05 Sep 2022
05 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of glaucoma predictive model and risk factors assessment based on supervised models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining