HypernasalityNet: Deep recurrent neural network for automatic hypernasality detection

Xiyue Wang,Sen Yang,Ming Tang,Heng Yin,Hua Huang,Ling He

doi:10.1016/j.ijmedinf.2019.05.023

Abstract

BackgroundCleft palate patients have inability to produce adequate velopharyngeal closure, which results in hypernasal speech. In clinic, hypernasal speech is assessed through subject assessment by speech language pathologists. Automatic hypernasal speech detection can provide aided diagnoses for speech language pathologists and clinicians. ObjectivesThis study aims to develop Long Short-Term Memory (LSTM) based Deep Recurrent Neural Network (DRNN) system to detect hypernasal speech from cleft palate patients, thus to provide aided diagnoses for clinical operation and speech therapy. Meanwhile, the feature mining and classification abilities of LSTM-DRNN system are explored. MethodsThe utilized speech recordings are 14,544 vowels in Mandarin. Speech data is collected from 144 children (72 children with hypernasality and 72 controls) with the age of 5–12 years old. This work proposes a LSTM based DRNN system to achieve automatic hypernasal speech detection, since LSTM-DRNN can learn short-time dependences of hypernasal speech. The vocal tract based features are fed into LSTM-DRNN to achieve deep mining of features. To verify the feature mining ability of LSTM-DRNN, features projected by LSTM-DRNN are fed into shallow classifiers instead of the following two fully connected layers and a softmax layer. And the features without the projecting process of LSTM-DRNN are directly fed into shallow classifiers as a comparison. Hypernasality-sensitive vowels (/a/, /i/, and /u/) are analyzed for the first time. ResultsThis LSTM-DRNN based hypernasal speech detection method reaches higher detection accuracy than that using shallow classifiers, since LSTM-DRNN mines features through time axis and network depth simultaneously. The proposed LSTM-DRNN based hypernasality detection system reaches the highest accuracy of 93.35%. According to the analysis of hypernasality-sensitive vowels, the experimental result concludes that vowels /i/ and /u/ are the most sensitive vowels to hypernasal speech. ConclusionsThe results show that LSTM-DRNN has robust feature mining ability and classification ability. This is the first work that applies the LSTM-DRNN technique to automatically detect hypernasality in cleft palate speech. The experimental results demonstrate the potential of deep learning on pathologist speech detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HypernasalityNet: Deep recurrent neural network for automatic hypernasality detection

Abstract

Talk to us

Similar Papers

More From: International Journal of Medical Informatics

Lead the way for us

Journal: International Journal of Medical Informatics	Publication Date: May 23, 2019
Citations: 20

Similar Papers

Automatic Hypernasality Detection in Cleft Palate Speech Using CNN
Xiyue Wang ... Heng Yin
Circuits, Systems, and Signal Processing | VOL. 38
Xiyue Wang, et. al.Xiyue Wang ... Heng Yin
20 May 2019
Circuits, Systems, and Signal Processing | VOL. 38

Automatic detection of consonant omission in cleft palate speech
Ling He ... Xiyue Wang
International Journal of Speech Technology | VOL. 22
Ling He, et. al.Ling He ... Xiyue Wang
03 Dec 2018
International Journal of Speech Technology | VOL. 22

How Early Can We Predict the Need for VPI Surgery?
Veera V Pitkänen ... Suvi A Alaluusua
Plastic and Reconstructive Surgery - Global Open | VOL. 10
Veera V Pitkänen, et. al.Veera V Pitkänen ... Suvi A Alaluusua
21 Nov 2022
Plastic and Reconstructive Surgery - Global Open | VOL. 10

Automatic evaluation of hypernasality and speech intelligibility for children with cleft palate
Ling He ... Qi Liu
-
Ling He, et. al. Ling He ... Qi Liu
01 Jun 2013
01 Jun 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HypernasalityNet: Deep recurrent neural network for automatic hypernasality detection

Abstract

Talk to us

Similar Papers

More From: International Journal of Medical Informatics