Combined speech and speaker recognition with speaker-adapted connectionist models

Dominique Genoud ,Daniel P W Ellis ,Nelson Morgan

doi:10.7916/d85b0bvc

Abstract

One approach to speaker adaptation for the neural-network acoustic models of a hybrid connectionist-HMM speech recognizer is to adapt a speaker-independent network by performing a small amount of additional training using data from the target speaker, giving an acoustic model specifically tuned to that speaker. This adapted model might be useful for speaker recognition too, especially since state-of-the-art speaker recognition typically performs a speech-recognition labelling of the input speech as a first stage. However, in order to exploit the discriminant nature of the neural nets, it is better to train a single model to discriminate both between the different phone classes (as in conventional speech recognition) and between the target speaker and the ‘rest of the world’ (a common approach to speaker recognition). We present the results of using such an approach for a set of 12 speakers selected from the DARPA/NIST Broadcast News corpus. The speaker-adapted nets showed a 17% relative improvement in worderror rate on their target speakers, and were able to identify among the 12 speakers with an average equal-error rate of 6.6%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combined speech and speaker recognition with speaker-adapted connectionist models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker adaptation of neural network acoustic models using i-vectors
George Saon ... Michael Picheny
-
George Saon, et. al.George Saon ... Michael Picheny
01 Dec 2013
01 Dec 2013

Asynchronous factorisation of speaker and background with feature transforms in speech recognition
Oscar Saz ... Thomas Hain
-
Oscar Saz, et. al.Oscar Saz ... Thomas Hain
25 Aug 2013
25 Aug 2013

Towards utterance-based neural network adaptation in acoustic modeling
Ivan Himawan ... Petr Motlicek
-
Ivan Himawan, et. al.Ivan Himawan ... Petr Motlicek
01 Dec 2015
01 Dec 2015

Multi-Turn RNN-T for Streaming Recognition of Multi-Party Speech
Ilya Sklyar ... Xianrui Zheng
-
Ilya Sklyar, et. al.Ilya Sklyar ... Xianrui Zheng
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combined speech and speaker recognition with speaker-adapted connectionist models

Abstract

Talk to us

Similar Papers