A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems

Driss Matrouf,Pascal Nocera,Olivier Bellot,Jean-Francois Bonastre,Georges Linares

doi:10.21437/eurospeech.2001-323

Abstract

The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corresponding speaker-independent recognizers. The aim of speaker adaptation techniques is to enhance the speakerindependent acoustic models to bring their recognition accuracy as close as possible to the one obtained with speaker-dependent models. In this paper, we propose a method using test and training data for acoustic model adaptation. This method operates in two steps. The first one performs an a priori adaptation using the transcribed training data of the closest training speakers to the test speaker. This adaptation is done with MAP procedure allowing reduced variances in the acoustic models. The second one performs an a posteriori adaptation using the MLLR procedure on the test data, allowing mapping of Gaussians means to match the test speaker’s acoustic space. This adaptation strategy was evaluated in a large vocabulary speech recognition task. Our method leads to a relative gain of 15% with respect to the baseline system and 10% with respect to the conventional MLLR adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia
Andreas Widjaja ... Vincent Elbert Budiman
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6
Andreas Widjaja, et. al.Andreas Widjaja ... Vincent Elbert Budiman
10 Aug 2020
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6

Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition
Panu Juhani Somervuo
-
Panu Juhani SomervuoPanu Juhani Somervuo
04 Oct 2004
04 Oct 2004

An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks
I-Fan Chen ... Seokyong Moon
-
I-Fan Chen, et. al.I-Fan Chen ... Seokyong Moon
01 Oct 2013
01 Oct 2013

Evolution-Strategy-Based Automation of System Development for High-Performance Speech Recognition
Takafumi Moriya ... Shinji Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 27
Takafumi Moriya, et. al.Takafumi Moriya ... Shinji Watanabe
01 Jan 2019
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems

Abstract

Talk to us

Similar Papers