Linguistically-constrained formant-based i-vectors for automatic speaker recognition

Javier Franco-Pedroso,Joaquin Gonzalez-Rodriguez

doi:10.1016/j.specom.2015.11.002

Javier Franco-Pedroso, Joaquin Gonzalez-Rodriguez

Open Access

https://doi.org/10.1016/j.specom.2015.11.002

Copy DOI

Journal: Speech Communication	Publication Date: Dec 1, 2015
Citations: 21	License type: publisher-specific-oa

Affiliation: Autonomous University of Madrid

Abstract

This paper presents a large-scale study of the discriminative abilities of formant frequencies for automatic speaker recognition. Exploiting both the static and dynamic information in formant frequencies, we present linguistically-constrained formant-based i-vector systems providing well calibrated likelihood ratios per comparison of the occurrences of the same isolated linguistic units in two given utterances. As a first result, the reported analysis on the discriminative and calibration properties of the different linguistic units provide useful insights, for instance, to forensic phonetic practitioners. Furthermore, it is shown that the set of units which are more discriminative for every speaker vary from speaker to speaker. Secondly, linguistically-constrained systems are combined at score-level through average and logistic regression speaker-independent fusion rules exploiting the different speaker-distinguishing information spread among the different linguistic units. Testing on the English-only trials of the core condition of the NIST 2006 SRE (24,000 voice comparisons of 5 minutes telephone conversations from 517 speakers -219 male and 298 female-), we report equal error rates of 9.57 and 12.89% for male and female speakers respectively, using only formant frequencies as speaker discriminative information. Additionally, when the formant-based system is fused with a cepstral i-vector system, we obtain relative improvements of ∼6% in EER (from 6.54 to 6.13%) and ∼15% in minDCF (from 0.0327 to 0.0279), compared to the cepstral system alone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Linguistically-constrained formant-based i-vectors for automatic speaker recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Similar Papers

Perceptual analysis of the male-to-female transgender voice after glottoplasty-the telephone test.
Jonas Meister ... Heike Kühn
The Laryngoscope | VOL. 127
Jonas Meister, et. al.Jonas Meister ... Heike Kühn
23 Jun 2016
The Laryngoscope | VOL. 127

Analysis of Formant Frequencies and Vowel Articulation in the Spoken Standard Nigerian English of Undergraduate Students
Mosunmola Oluyinka Adebayo
Indian Journal of Language and Linguistics | VOL. 4
Mosunmola Oluyinka AdebayoMosunmola Oluyinka Adebayo
19 Sep 2023
Indian Journal of Language and Linguistics | VOL. 4

한국어 자연발화 음성코퍼스의 남녀 모음 포먼트 비교 연구
Kyuchul Yoon ... Soonok Kim
Phonetics and Speech Sciences | VOL. 7
Kyuchul Yoon, et. al.Kyuchul Yoon ... Soonok Kim
30 Jun 2015
Phonetics and Speech Sciences | VOL. 7

Vocal Attractiveness Increases by Averaging
Laetitia Bruckert ... Pascal Belin
Current Biology | VOL. 20
Laetitia Bruckert, et. al.Laetitia Bruckert ... Pascal Belin
01 Jan 2009
Current Biology | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Linguistically-constrained formant-based i-vectors for automatic speaker recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication