Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach

Ahmed M Yousef,Dimitar D Deliyski,Stephanie R.C Zacharias,Maryam Naghibolhosseini

doi:10.1016/j.jvoice.2022.01.028

Ahmed M Yousef, Dimitar D Deliyski + Show 2 more

Open Access

https://doi.org/10.1016/j.jvoice.2022.01.028

Copy DOI

Abstract

Adductor spasmodic dysphonia (AdSD) is a neurogenic voice disorder, affecting the intrinsic laryngeal muscle control. AdSD leads to involuntary laryngeal spasms and only reveals during connected speech. Laryngeal high-speed videoendoscopy (HSV) coupled with a flexible fiberoptic endoscope provides a unique opportunity to study voice production and visualize the vocal fold vibrations in AdSD during speech. The goal of this study is to automatically detect instances during which the image of the vocal folds is optically obstructed in HSV recordings obtained during connected speech. HSV data were recorded from vocally normal adults and patients with AdSD during reading of the "Rainbow Passage", six CAPE-V sentences, and production of the vowel /i/. A convolutional neural network was developed and trained as a classifier to detect obstructed/unobstructed vocal folds in HSV frames. Manually labelled data were used for training, validating, and testing of the network. Moreover, a comprehensive robustness evaluation was conducted to compare the performance of the developed classifier and visual analysis of HSV data. The developed convolutional neural network was able to automatically detect the vocal fold obstructions in HSV data in vocally normal participants and AdSD patients. The trained network was tested successfully and showed an overall classification accuracy of 94.18% on the testing dataset. The robustness evaluation showed an average overall accuracy of 94.81% on a massive number of HSV frames demonstrating the high robustness of the introduced technique while keeping a high level of accuracy. The proposed approach can be used for efficient analysis of HSV data to study laryngeal maneuvers in patients with AdSD during connected speech. Additionally, this method will facilitate development of vocal fold vibratory measures for HSV frames with an unobstructed view of the vocal folds. Indicating parts of connected speech that provide an unobstructed view of the vocal folds can be used for developing optimal passages for precise HSV examination during connected speech and subject-specific clinical voice assessment protocols.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach

Abstract

Talk to us

Similar Papers

More From: Journal of Voice

Lead the way for us

Journal: Journal of Voice	Publication Date: Mar 16, 2022
Citations: 10

Similar Papers

Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy
Ahmed M Yousef ... Maryam Naghibolhosseini
Journal of Voice | VOL. -
Ahmed M Yousef, et. al.Ahmed M Yousef ... Maryam Naghibolhosseini
01 Sep 2022
Journal of Voice | VOL. -

Vibratory Onset of Adductor Spasmodic Dysphonia and Muscle Tension Dysphonia: A High-Speed Video Study✰
Wenli Chen ... Thomas Murry
Journal of Voice | VOL. 34
Wenli Chen, et. al.Wenli Chen ... Thomas Murry
28 Dec 2018
Journal of Voice | VOL. 34

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech
Ahmed M Yousef ... Maryam Naghibolhosseini
Journal of Voice | VOL. 37
Ahmed M Yousef, et. al.Ahmed M Yousef ... Maryam Naghibolhosseini
27 Nov 2020
Journal of Voice | VOL. 37

A Deep Learning Approach for Quantifying Vocal Fold Dynamics During Connected Speech Using Laryngeal High-Speed Videoendoscopy.
Ahmed M Yousef ... Maryam Naghibolhosseini
Journal of speech, language, and hearing research : JSLHR | VOL. 65
Ahmed M Yousef, et. al.Ahmed M Yousef ... Maryam Naghibolhosseini
23 May 2022
Journal of speech, language, and hearing research : JSLHR | VOL. 65

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach

Abstract

Talk to us

Similar Papers

More From: Journal of Voice