Abstract
In this paper, Individual Dimension Gaussian Mixture Model (IDGMM) is proposed for speaker identification. As to the training-purpose feature vector series of a certain register, its joint probability distribution function (PDF) of is modeled by the product of the PDF of each dimension (marginal PDF), the scalar-based Gaussian Mixture Model (GMM) serving as the marginal PDF. For a good discriminative capability, the decorrelation by Schmidt orthogonalization and the Mixture Component Number (MCN) decision are adopted during the train. A close-set text-independent speaker identification experiment is also given. The simulation result shows that the IDGMM accelerates the training process remarkably and maintains the discriminative capability in testing process.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have