Enabling improved speaker recognition by voice quality estimation

Anthony L Bartos,Douglas J Nelson

doi:10.1109/acssc.2011.6190071

Abstract

Presented is a method to mitigate noise and interference in automated speaker identification (SID). This process uses the MIT/LL SID module without modifications. In this process, speaker models are built for a lattice of signal to noise ratio (SNR) levels. The SNR of the received signal is estimated by first applying speech activity detection to identify portions of the signal that actually contain speech. A voice quality estimation process is then applied to estimate the SNR of the received signal. The speaker models representing the SNR of the received signal are dynamically loaded, and conventional SID is applied. In training, the SNR of each training signal is estimated, and the signal is modified by adding noise to create a signal at the desired SNR. Using this process, each signal may be used to train models at any SNR level less than or equal to the SNR of the original signal. The process has been fully implemented and is completely automated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enabling improved speaker recognition by voice quality estimation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Feature classification criterion for missing features mask estimation in robust speaker recognition
Dayana Ribas González ... José Ramón Calvo De Lara
Signal, Image and Video Processing | VOL. 8
Dayana Ribas González, et. al.Dayana Ribas González ... José Ramón Calvo De Lara
20 Mar 2012
Signal, Image and Video Processing | VOL. 8

Noise-robust speech triage.
Anthony L Bartos ... Petr Schwarz
The Journal of the Acoustical Society of America | VOL. 143
Anthony L Bartos, et. al.Anthony L Bartos ... Petr Schwarz
01 Apr 2018
The Journal of the Acoustical Society of America | VOL. 143

Speaker identification using convolutional-long short-term memory neural networks
Serkan Tokgoz ... Issa M Panahi
The Journal of the Acoustical Society of America | VOL. 146
Serkan Tokgoz, et. al.Serkan Tokgoz ... Issa M Panahi
01 Oct 2019
The Journal of the Acoustical Society of America | VOL. 146

IDENTIFIKASI SUARA MENGGUNAKAN METODE MEL FREQUENCY CEPSTRUM COEFFICIENTS (MFCC) DAN JARINGAN SYARAF TIRUAN BACKPROPAGATION
Abdullah Zainuddin ... Sudi Mariyanto Sasongko
DIELEKTRIKA | VOL. 7
Abdullah Zainuddin, et. al.Abdullah Zainuddin ... Sudi Mariyanto Sasongko
29 Feb 2020
DIELEKTRIKA | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enabling improved speaker recognition by voice quality estimation

Abstract

Talk to us

Similar Papers