Abstract
To compare machine learning classifiers trained on three clustering schemes to determine whether distinguishing healthy eyes from those with glaucomatous optic neuropathy (GON) can be optimized by training with clustered data. Two machine learning classifiers-quadratic discriminant analysis (QDA) and support vector machines with Gaussian kernel (SVMg)-were trained separately using standard perimetry data from the Diagnostic Innovations in Glaucoma Study (DIGS), clustered using three clustering schemes on a training data set (123 eyes/123 glaucoma patients with GON; 135 eyes/135 normal control subjects). Trained classifiers were then applied to an independent data set containing 69 eyes of 69 glaucoma patients with early visual field loss and 83 eyes of 83 normal control subjects. Two control conditions were included: unclustered data and a random assignment of locations to clusters. Areas under the receiver operating characteristic (ROC) curve ranged from 0.85 (SVMg, thresholds clustered by Glaucoma Hemifield Test sectors) to 0.92 (QDA, thresholds clustered by Garway-Heath mapping) for the training data set. Use of clustered data showed no significant optimization of sensitivity over use of unclustered data, and no single clustering method resulted in significantly higher performance in the independent data set. Sensitivities tended to be higher with QDA than with SVMg, regardless of specificity cutoff and clustering QDA performed better with the early glaucoma data set than did the SVMg. Clustering may be advantageous when data-dimension reduction is needed-for example, when combining field results with other high-dimensional data (e.g., structural imaging data)-but it is not necessary for visual field data alone.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.