Objective: To establish a prediction model for the identifying of cataplexy facial features based on clinical shooting videos by using a deep learning image recognition network ResNet-18. Methods: A cross-sectional study. Twenty-five narcolepsy type 1 patients who were first diagnosed and never received treatment and 25 healthy controls recruited by advertisement in the Second Affiliated Hospital of Nanchang University from 2020 to 2023.After image preprocessing, a total of 1 180 images were obtained, including 583 cataplexy faces and 597 normal faces.90% were selected as the training set and validation set, and then expanded the data by 5 times.80% of the expanded data set was extracted as the training set and 20% as the validation set, that is, the number of the training set was (583+597)×0.9×0.8×5=4 248, the number of the validation set was (583+597)×0.9×0.2×5=1 062. The data sets for training and validation were used train parameters to establish the model and were trained through the five-fold cross-validation method, to establish the ResNet-18 cataplexy face recognition model via transfer learning.10% (118 images) of the original non-amplified images were extracted as the test set. The test set data did not participate in data enhancement and model training, and was only used to evaluate the final performance of the model. Finally, ResNet-18 was compared with VGG-16, ResNet-34 and Inception V3 deep learning models, and the receiver operating characteristic curve was used to evaluate the value of ResNet-18 image recognition network in cataplexy face recognition. Results: Among 25 patients with narcolepsy type 1, 15 were males and 10 were females, aged [M (Q1, Q3)] of 14.0(11.0, 20.5) years.Among 25 healthy controls, 14 were males and 11 were females, with a median age of 16.0(14.4, 23.0) years.The overall accuracy of ResNet-18 image recognition network in the test set was 90.9%, the sensitivity was 96.4% and the specificity was 85.2%. The area under the ROC curve was 0.99(95%CI:0.96-1.00). The ResNet-18 model parameter amount was 11.69 M, the floating point operation amount was 1 824.03 M, and the single image recognition time was 5.9 ms. Conclusions: The cataplexy face prediction model built based on the deep learning image recognition network ResNet-18 has a high accuracy in identifying cataplexy faces.
Read full abstract