Automated video summarization and label assignment for otoscopy videos using deep learning and natural language processing

Hamidullah Binol,Aaron C Moberly,M Khalid Khan Niazi,Charles Elmaraghy,Metin N Gurcan

doi:10.1117/12.2582009

Abstract

Tympanic membrane (TM) diseases are among the most frequent pathologies, affecting the majority of the pediatric population. Video otoscopy is an effective tool for diagnosing TM diseases. However, access to Ear, Nose, and Throat (ENT) physicians is limited in many sparsely-populated regions worldwide. Moreover, high inter- and intra-reader variability impair accurate diagnosis. This study proposes a digital otoscopy video summarization and automated diagnostic label assignment model that benefits from the synergy of deep learning and natural language processing (NLP). Our main motivation is to obtain the key visual features of TM diseases from their short descriptive reports. Our video database consisted of 173 otoscopy records from three different TM diseases. To generate composite images, we utilized our previously developed semantic segmentation-based stitching framework, SelectStitch. An ENT expert reviewed these composite images and wrote short reports describing the TM's visual landmarks and the disease for each ear. Based on NLP and a bag-of-words (BoW) model, we determined the five most frequent words characterizing each TM diagnostic category. A neighborhood components analysis was used to predict the diagnostic label of the test instance. The proposed model provided an overall F1-score of 90.2%. This is the first study to utilize textual information in computerized ear diagnostics to the best of our knowledge. Our model has the potential to become a telemedicine application that can automatically make a diagnosis of the TM by analyzing its visual descriptions provided by a healthcare provider from a mobile device.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automated video summarization and label assignment for otoscopy videos using deep learning and natural language processing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Developing and Analyzing Deep Learning and Natural Language Processing Systems in the Context of Medical Information Processing
Emmanuel, Victoria Nkemjika ... Ogbonna Tochukwu Loveday
International Journal of Research and Innovation in Applied Science | VOL. 9
Emmanuel, Victoria Nkemjika, et. al.Emmanuel, Victoria Nkemjika ... Ogbonna Tochukwu Loveday
01 Jan 2024
International Journal of Research and Innovation in Applied Science | VOL. 9

Deep Learning and Natural Language Processing Technology Based Display and Analysis of Modern Artwork
Xiongfei Li, Yongjun Li
Journal of Electrical Systems | VOL. 20
Xiongfei Li, Yongjun LiXiongfei Li, Yongjun Li
04 Apr 2024
Journal of Electrical Systems | VOL. 20

Deep Natural Language Processing in unstructured big data analysis and insights extraction - A quantitative study
Bibhu Dash ... Azad Ali
-
Bibhu Dash, et. al.Bibhu Dash ... Azad Ali
15 Dec 2022
15 Dec 2022

Applying Deep Learning and Natural Language Processing in Cancer: A Survey
Aiman Ahmad Abusamra ... Areej M R Al-Madhoun
-
Aiman Ahmad Abusamra, et. al.Aiman Ahmad Abusamra ... Areej M R Al-Madhoun
01 Sep 2021
01 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automated video summarization and label assignment for otoscopy videos using deep learning and natural language processing

Abstract

Talk to us

Similar Papers