Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Beiming Cao,Myungjong Kim,Jun Wang,Ted Mau

doi:10.21437/slpat.2016-14

Abstract

Individuals with larynx (vocal folds) impaired have problems in controlling their glottal vibration, producing whispered speech with extreme hoarseness. Standard automatic speech recognition using only acoustic cues is typically ineffective for whispered speech because the corresponding spectral characteristics are distorted. Articulatory cues such as the tongue and lip motion may help in recognizing whispered speech since articulatory motion patterns are generally not affected. In this paper, we investigated whispered speech recognition for patients with reconstructed larynx using articulatory movement data. A data set with both acoustic and articulatory motion data was collected from a patient with surgically reconstructed larynx using an electromagnetic articulograph. Two speech recognition systems, Gaussian mixture model-hidden Markov model (GMM-HMM) and deep neural network-HMM (DNN-HMM), were used in the experiments. Experimental results showed adding either tongue or lip motion data to acoustic features such as mel-frequency cepstral coefficient (MFCC) significantly reduced the phone error rates on both speech recognition systems. Adding both tongue and lip data achieved the best performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Abstract

Talk to us

Similar Papers

More From: Workshop on Speech and Language Processing for Assistive Technologies

Lead the way for us

Journal: Workshop on Speech and Language Processing for Assistive Technologies	Publication Date: Sep 13, 2016
Citations: 12

Similar Papers

Determining an Optimal Set of Flesh Points on Tongue, Lips, and Jaw for Continuous Silent Speech Recognition
Jun Wang ... Seongjun Hahm
-
Jun Wang, et. al.Jun Wang ... Seongjun Hahm
01 Jan 2015
01 Jan 2015

Bilingual Speech Recognition based on Deep Neural Networks and Directed Acyclic Word Graphs
Rohith Gowtham Kodali ... Durga Prasad Manukonda
-
Rohith Gowtham Kodali, et. al.Rohith Gowtham Kodali ... Durga Prasad Manukonda
01 Nov 2019
01 Nov 2019

Turkish Speech Recognition Based On Deep Neural Networks
Ussen Abre Kimanuka ... Osman Buyuk
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi | VOL. 22
Ussen Abre Kimanuka, et. al.Ussen Abre Kimanuka ... Osman Buyuk
05 Sep 2018
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi | VOL. 22

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition

-

01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Abstract

Talk to us

Similar Papers

More From: Workshop on Speech and Language Processing for Assistive Technologies