Dialect/Accent Classification Using Unrestricted Audio

Rongqing Huang,John H L Hansen,Pongtep Angkititrakul

doi:10.1109/tasl.2006.881695

Abstract

This study addresses novel advances in English dialect/accent classification. A word-based modeling technique is proposed that is shown to outperform a large vocabulary continuous speech recognition (LVCSR)-based system with significantly less computational costs. The new algorithm, which is named Word-based Dialect Classification (WDC), converts the text-independent decision problem into a text-dependent decision problem and produces multiple combination decisions at the word level rather than making a single decision at the utterance level. The basic WDC algorithm also provides options for further modeling and decision strategy improvement. Two sets of classifiers are employed for WDC: a word classifier DW(k) and an utterance classifier D u. DW(k) is boosted via the AdaBoost algorithm directly in the probability space instead of the traditional feature space. Du is boosted via the dialect dependency information of the words. For a small training corpus, it is difficult to obtain a robust statistical model for each word and each dialect. Therefore, a context adapted training (CAT) algorithm is formulated, which adapts the universal phoneme Gaussian mixture models (GMMs) to dialect-dependent word hidden Markov models (HMMs) via linear regression. Three separate dialect corpora are used in the evaluations that include the Wall Street Journal (American and British English), NATO N4 (British, Canadian, Dutch, and German accent English), and IViE (eight British dialects). Significant improvement in dialect classification is achieved for all corpora tested

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dialect/Accent Classification Using Unrestricted Audio

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Feb 1, 2007
Citations: 78

Similar Papers

Integrate template matching and statistical modeling for continuous speech recognition
Xie Sun
-
Xie SunXie Sun
01 Jan 2010
01 Jan 2010

Dialect/Accent Classification via Boosted Word Modeling
Rongqing Huang ... J.H.L Hansen
-
Rongqing Huang, et. al. Rongqing Huang ... J.H.L Hansen
18 Mar 2005
18 Mar 2005

WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition
T Robinson ... J Fransen
-
T Robinson, et. al.T Robinson ... J Fransen
09 May 1995
09 May 1995

Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
Xie Sun ... Yunxin Zhao
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2014
Xie Sun, et. al.Xie Sun ... Yunxin Zhao
01 Feb 2014
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dialect/Accent Classification Using Unrestricted Audio

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing