Speaker verification based processing for robust ASR in co-channel speech scenarios

Seyed Omid Sadjadi,Larry P Heck

doi:10.1109/icassp.2014.6853903

Abstract

Co-channel speech, which occurs in monaural audio recordings of two or more overlapping talkers, poses a great challenge for automatic speech applications. Automatic speech recognition (ASR) performance, in particular, has been shown to degrade significantly in the presence of a competing talker. In this paper, assuming a known target talker scenario, we present two different masking strategies based on speaker verification to alleviate the impact of the competing talker (a.k.a. masker) interference on ASR performance. In the first approach, frame-level speaker verification likelihoods are used as reliability measures that control the degree to which each frame contributes to the Viterbi search, while in the second approach time-frequency (T-F) level speaker verification scores form soft masks for speech separation. Effectiveness of the two strategies, both individually and in combination, are evaluated in the context of ASR tasks with speech mixtures at various signal-to-interference ratios (SIR), ranging from 6 dB to -9 dB. Experimental results indicate efficacy of the proposed speaker verification based solutions in mitigating the impact of the competing talker interference on ASR performance. Combination of the two masking techniques yields reductions as large as 43% in word error rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker verification based processing for robust ASR in co-channel speech scenarios

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Estimation of speech recognition performance in noisy and reverberant environments using PESQ score and acoustic parameters
Takahiro Fukumori ... Takanobu Nishiura
-
Takahiro Fukumori, et. al.Takahiro Fukumori ... Takanobu Nishiura
01 Oct 2013
01 Oct 2013

Novel speech processing techniques for robust automatic speech recognition

-

01 Jan 2006
01 Jan 2006

Analytic assessment of telephone transmission impact on ASR performance using a simulation model
Sebastian Möller ... Hervé Bourlard
Speech Communication | VOL. 38
Sebastian Möller, et. al.Sebastian Möller ... Hervé Bourlard
08 Mar 2002
Speech Communication | VOL. 38

Investigation of Automatic Speech Recognition Performance and Mean Opinion Scores for Different Standard Speech and Audio Codecs
A V Ramana ... Mythili Sharan Pala
IETE Journal of Research | VOL. 58
A V Ramana, et. al.A V Ramana ... Mythili Sharan Pala
01 Mar 2012
IETE Journal of Research | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker verification based processing for robust ASR in co-channel speech scenarios

Abstract

Talk to us

Similar Papers