Classification of optical coherence tomography images using deep machine learning methods

Alexander Arzamastsev,Elena Valerievna Kulagina,Oleg Lvovitch Fabrikantov,Nataly Alexandrovna Zenkova

doi:10.17816/dd623801

Abstract

Backgraund. Optical coherence tomography (OCT) is a modern high-tech and informative method for detecting pathology of the retina and preretinal layers of the vitreous body. However, the description and interpretation of the research results require high qualifications and special training of an ophthalmologist, and significant time expenditure for the doctor and the patient. At the same time, the use of mathematical models based on artificial neural networks (ANN- models) currently makes it possible to automate many processes associated with image processing. Therefore, solving problems associated with automating the process of classifying OCT images based on ANN models is actual. Aims. To develop architectures of mathematical (computer) models based on deep learning of convolutional neural networks (CNN) for classification of OCT images of the retina. To compare the results of computational experiments conducted using Python tools in the Google Colaboratory with single-model and multi-model approaches and evaluate classification accuracy. To make conclusions about the optimal architecture of ANN models and the values of the hyperparameters used. Materials and methods. The original dataset, which was anonymized OCT images of real patients, included more than 2000 images obtained directly from the device in a resolution of 1920 × 969 × 24 BPP. The number of image classes is 12. To create the training and validation data sets, a subject area of 1100 × 550 × 24 BPP was “cut out.” Various approaches were studied: the possibility of using pretrained CNNs with transfer learning, techniques for resizing and augmenting images, as well as various combinations of hyperparameters of ANN-models. When compiling the model, the following parameters were used: Adam optimizer, categorical_crossentropy loss function, accuracy metric. All technological processes with images and ANN-models were carried out using Python language tools in Google Colaboratory. Results. Single-model and multi-model principles for classifying OCT images of the retina are proposed. Computational experiments on automated classification of such images obtained from a DRI OCT Triton 3D tomograph using various ANN model architectures showed an accuracy of 98-100% during training and validation and 85% during an additional test, which is a satisfactory result. The optimal architecture of the ANN model - a six-layer convolutional network - was selected and the values of its hyperparameters were determined. Conclusions. The results of deep training of convolutional neural network models with various architectures, their validation and testing showed satisfactory classification accuracy of retinal OCT images. These developments can be used in decision support systems in the field of ophthalmology.

Full Text