On Behaviour of PLDA Models in the Task of Speaker Recognition

Lukáš Machlica,Vlasta Radová

doi:10.1007/978-3-642-40585-3_45

Abstract

Nowadays, Factor analysis based techniques become part of state-of-the-art Speaker Recognition (SR) systems. These are the Joint Factor Analysis, its modified version called the concept of i-vectors, and the Probabilistic Linear Discriminant Analysis (PLDA). PLDA, as a generative statistical model, is usually used as the back end of a SR system, e.g. once i-vectors have been extracted, a PLDA model is used in the i-vector space to provide a verification score of two given i-vectors. In order to train the system huge amount of development data are utilized. In this paper the behaviour of the PLDA model is investigated. It is shown how does the amount of development data influence the system’s performance. PLDA has several parameters to be tuned, i.e. dimensions of latent variables/subspaces, which represent the speaker and the channel variabilities. These will be examined too.

Full Text