Analysis of i-vector framework for speaker identification in TV-shows

Corinne Fredouille,Delphine Charlet

doi:10.21437/interspeech.2014-15

Corinne Fredouille, Delphine Charlet

Open Access

https://doi.org/10.21437/interspeech.2014-15

Copy DOI

Publication Date: Sep 14, 2014
Citations: 3	License type: other-oa

Affiliation: Laboratoire Informatique d'Avignon, Orange (France)

Abstract

Inspired from the Joint Factor Analysis, the I-vector-based analysis has become the most popular and state-of-the-art framework for the speaker verification task. Mainly applied within the NIST/SRE evaluation campaigns, many studies have been proposed to improve more and more performance of speaker verification systems. Nevertheless, while the i-vector framework has been used in other speech processing fields like language recognition, a very few studies have been reported for the speaker identification task on TV shows. This work was done in the REPERE challenge context, focused on the people recognition task in multimodal conditions (audio, video, text) from TV show corpora. Moreover, the challenge participants are invited for providing systems for monomodal tasks, like speaker identification. The application of the i-vector framework is investi-gatedthrough different points of views: (1) some of the i-vector based approaches are compared, (2) a specific i-vector extraction protocol is proposed in order to deal with widely varying amounts of training data among speaker population, (3) the joint use of both speaker diarization and identification is finally analyzed. Based on a 533 speaker dictionary, this joint system wins the monomodal speaker identification task of the 2014 REPERE challenge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of i-vector framework for speaker identification in TV-shows

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition
Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
-
Alicia Lozano-Diez, et. al.Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
21 Nov 2018
21 Nov 2018

Speaker verification: minimizing the channel effects using autoassociative neural network models
S.P Kishore ... B Yegnanarayana
-
S.P Kishore, et. al.S.P Kishore ... B Yegnanarayana
05 Jun 2000
05 Jun 2000

Variational Bayesian Joint Factor Analysis Models for Speaker Verification
Xianyu Zhao ... Yuan Dong
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20
Xianyu Zhao, et. al.Xianyu Zhao ... Yuan Dong
01 Mar 2012
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20

Robust speaker verification using GFCC and joint factor analysis
Pranab Das ... Utpal Bhattacharjee
-
Pranab Das, et. al.Pranab Das ... Utpal Bhattacharjee
01 Jul 2014
01 Jul 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of i-vector framework for speaker identification in TV-shows

Abstract

Talk to us

Similar Papers