Speaker Recognition Based on Multilevel Speech Signal Analysis on Polish Corpus

Szymon Drgas,Adam Dabrowski

doi:10.1007/978-3-642-30721-8_9

Abstract

AbstractThis article deals with a new approach to the text-independent speaker verification task. It is namely proposed to combine spectral and the so-called high-level features (prosodic, articulatory, and lexical) in order increase accuracy of speaker verification. The presented experiments were performed using a Polish language corpus called PUEPS. It contains semi-spontaneous telephone conversations (acted emergency telephone notifications) recorded in laboratory conditions. As the Polish language is under resourced and the PUEPS corpus is relatively small, another approach is needed than these known from the well known NIST evaluations. The authors proposed to use the fast scoring instead of more complex classifiers and the AdaBoost algorithm for features combination. Combination of features resulted in equal error rate (EER) reduction for various SNR conditions.KeywordsSpeaker recognitionhigh-level featureskernel combinationboosting

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker Recognition Based on Multilevel Speech Signal Analysis on Polish Corpus

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker recognition based on multilevel speech signal analysis on Polish corpus
Szymon Drgas ... Adam Dabrowski
Multimedia Tools and Applications | VOL. 74
Szymon Drgas, et. al.Szymon Drgas ... Adam Dabrowski
25 May 2013
Multimedia Tools and Applications | VOL. 74

Speaker verification based on fusion of acoustic and articulatory information
Ming Li ... Vikram Ramanarayanan
-
Ming Li, et. al.Ming Li ... Vikram Ramanarayanan
25 Aug 2013
25 Aug 2013

Efficient text-independent speaker verification with structural gaussian mixture models and neural network
Bing Xiang ... T Berger
IEEE Transactions on Speech and Audio Processing | VOL. 11
Bing Xiang, et. al. Bing Xiang ... T Berger
01 Sep 2003
IEEE Transactions on Speech and Audio Processing | VOL. 11

Emotion attribute projection for speaker recognition on emotional speech
Huanjun Bao ... Thomas Fang Zheng
-
Huanjun Bao, et. al.Huanjun Bao ... Thomas Fang Zheng
27 Aug 2007
27 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker Recognition Based on Multilevel Speech Signal Analysis on Polish Corpus

Abstract

Talk to us

Similar Papers