Analysis of Large-Scale SVM Training Algorithms for Language and Speaker Recognition

Sandro Cumani,Pietro Laface

doi:10.1109/tasl.2012.2186290

Abstract

This paper compares a set of large scale support vector machine (SVM) training algorithms for language and speaker recognition tasks. We analyze five approaches for training phonetic and acoustic SVM models for language recognition. We compare the performance of these approaches as a function of the training time required by each of them to reach convergence, and we discuss their scalability towards large corpora. Two of these algorithms can be used in speaker recognition to train a SVM that classifies pairs of utterances as either belonging to the same speaker or to two different speakers. Our results show that the accuracy of these algorithms is asymptotically equivalent, but they have different behavior with respect to the time required to converge. Some of these algorithms not only scale linearly with the training set size, but are also able to give their best results after just a few iterations. State-of-the-art performance has been obtained in the female subset of the NIST 2010 Speaker Recognition Evaluation extended core test using a single SVM system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Jul 1, 2012
Citations: 16	License type: other-oa

R Discovery Prime

R Discovery Prime

Analysis of Large-Scale SVM Training Algorithms for Language and Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

GPU Acceleration of Interior Point Methods in Large Scale SVM Training
Tao Li ... Shuai Zhang
-
Tao Li, et. al.Tao Li ... Shuai Zhang
01 Jul 2013
01 Jul 2013

Accelerated training of support vector machines
M Rychetsky ... S Ortmann
-
M Rychetsky, et. al.M Rychetsky ... S Ortmann
01 Jan 1998
01 Jan 1998

A unified deep neural network for speaker and language recognition
Fred Richardson ... Najim Dehak
-
Fred Richardson, et. al.Fred Richardson ... Najim Dehak
06 Sep 2015
06 Sep 2015

A Parallel Decomposition Solver for SVM: Distributed dual ascend using Fenchel Duality
Tamir Hazan ... Amit Man
-
Tamir Hazan, et. al.Tamir Hazan ... Amit Man
01 Jun 2008
01 Jun 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Large-Scale SVM Training Algorithms for Language and Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing