Size, an important characteristic of a tympanic membrane perforation (TMP), is commonly assessed with gross estimation via visual inspection, a practice which is prone to inaccuracy. Herein, we demonstrate feasibility of a proof-of-concept computer vision model for estimating TMP size in a small set of perforations. An open-source deep learning architecture was used to train a model to segment and calculate the area of a perforation and the visualized tympanic membrane (TM) in a set of endoscopic images of mostly anterior and relatively small TMPs. The model then computed relative TMP size by calculating the ratio of perforation area to TM area. Model performance on the test dataset was compared to ground-truth manual annotations. In a validation survey, otolaryngologists were tasked with estimating the size of TMPs from the test dataset. The primary outcome was the average absolute error of model size predictions and clinician estimates compared to sizes determined by ground-truth manual annotations. The model's average absolute error for size predictions was a 0.8% overestimation for all test perforations. Conversely, among the 38 survey respondents, the average clinician error was a 11.0% overestimation (95% CI, 5.2-16.7%, p = 0.003). In a small sample of TMPs, we demonstrated a computer vision approach for estimating TMP size is feasible. Further validation studies must be done with significantly larger and more heterogenous datasets. N/A Laryngoscope, 134:2906-2911, 2024.
Read full abstract