Language Normalization for Bilingual Speaker Recognition Systems

Murat Akbacak,John H.L Hansen

doi:10.1109/icassp.2007.366898

Abstract

In this study, we focus on the problem of removing/normalizing the impact of spoken language variation in bilingual speaker recognition (BSR) systems. In addition to environment, recording, and channel mismatches, spoken language mismatch is an additional factor resulting in performance degradation in speaker recognition systems. In today's world, the number of bilingual speakers is increasing with English becoming the universal second language. Data sparseness is becoming an important research issue to deploy speaker recognition systems with limited resources (e.g., short train/test durations). Therefore, leveraging existing resources from different languages becomes a practical concern in limited-resource BSR applications, and effective language normalization schemes are required to achieve more robust speaker recognition systems. Here, we propose two novel algorithms to address the spoken language mismatch problem: normalization at the utterance-level via language identification (LID), and normalization at the segment-level via multilingual phone recognition (PR). We evaluated our algorithms using a bilingual (Spanish-English) speaker set of 80 speakers. Experimental results show improvements over a baseline system which employs fusion of language-dependent speaker models with fixed weights.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language Normalization for Bilingual Speaker Recognition Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploring the Impact of Mismatch Conditions, Noisy Backgrounds, and Speaker Health on Convolutional Autoencoder-Based Speaker Recognition System with Limited Dataset
Arundhati Niwatkar ... Yuvraj Kanse
ICST Transactions on Scalable Information Systems | VOL. -
Arundhati Niwatkar, et. al.Arundhati Niwatkar ... Yuvraj Kanse
09 Apr 2024
ICST Transactions on Scalable Information Systems | VOL. -

Automatic speaker recognition with enhanced swallow swarm optimization and ensemble classification model from speech signals
Kharibam Jilenkumari Devi ... Khelchandra Thongam
Journal of Ambient Intelligence and Humanized Computing | VOL. -
Kharibam Jilenkumari Devi, et. al.Kharibam Jilenkumari Devi ... Khelchandra Thongam
31 Jul 2019
Journal of Ambient Intelligence and Humanized Computing | VOL. -

Speaker Recognition with VAD
Jian Ling ... Jianwei Zhu
-
Jian Ling, et. al.Jian Ling ... Jianwei Zhu
01 Jun 2009
01 Jun 2009

Your Voice is Not Yours? Black-Box Adversarial Attacks Against Speaker Recognition Systems
Jianbin Ye ... Xiaoyuan Liu
-
Jianbin Ye, et. al.Jianbin Ye ... Xiaoyuan Liu
01 Dec 2022
01 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language Normalization for Bilingual Speaker Recognition Systems

Abstract

Talk to us

Similar Papers