Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition.

Myungjong Kim,Hoirin Kim,Joohong Yoo,Younggwan Kim,Jun Wang

doi:10.1109/tnsre.2017.2681691

Abstract

This paper addresses the problem of recognizing the speech uttered by patients with dysarthria, which is a motor speech disorder impeding the physical production of speech. Patients with dysarthria have articulatory limitation, and therefore, they often have trouble in pronouncing certain sounds, resulting in undesirable phonetic variation. Modern automatic speech recognition systems designed for regular speakers are ineffective for dysarthric sufferers due to the phonetic variation. To capture the phonetic variation, Kullback-Leibler divergence-based hidden Markov model (KL-HMM) is adopted, where the emission probability of state is parameterized by a categorical distribution using phoneme posterior probabilities obtained from a deep neural network-based acoustic model. To further reflect speaker-specific phonetic variation patterns, a speaker adaptation method based on a combination of L2 regularization and confusion-reducing regularization, which can enhance discriminability between categorical distributions of the KL-HMM states while preserving speaker-specific information is proposed. Evaluation of the proposed speaker adaptation method on a database of several hundred words for 30 speakers consisting of 12 mildly dysarthric, 8 moderately dysarthric, and 10 non-dysarthric control speakers showed that the proposed approach significantly outperformed the conventional deep neural network-based speaker adapted system on dysarthric as well as non-dysarthric speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Systems and Rehabilitation Engineering

Lead the way for us

Journal: IEEE Transactions on Neural Systems and Rehabilitation Engineering	Publication Date: Mar 13, 2017
Citations: 86

Similar Papers

Neural network-based clustering model of ischemic stroke patients with a maximally distinct distribution of 1-year vascular outcomes.
Joon-Tae Kim ... Nu Ri Kim
Scientific Reports | VOL. 12
Joon-Tae Kim, et. al.Joon-Tae Kim ... Nu Ri Kim
08 Jun 2022
Scientific Reports | VOL. 12

Detecting autism from picture book narratives using deep neural utterance embeddings.
Aleksander Wawer ... Izabela Chojnicka
International Journal of Language & Communication Disorders | VOL. 57
Aleksander Wawer, et. al.Aleksander Wawer ... Izabela Chojnicka
12 May 2022
International Journal of Language & Communication Disorders | VOL. 57

Effective speaker adaptations for speaker verification
Sungjoo Ahn ... Sunmee Kang
-
Sungjoo Ahn, et. al. Sungjoo Ahn ... Sunmee Kang
05 Jun 2000
05 Jun 2000

Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives
Petr Cerva ... Ladislav Seps
Speech Communication | VOL. 55
Petr Cerva, et. al.Petr Cerva ... Ladislav Seps
08 Jul 2013
Speech Communication | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Systems and Rehabilitation Engineering