Syllable-based large vocabulary continuous speech recognition

A Ganapathiraju,J Picone,J Hamaker,G.R Doddington,M Ordowski

doi:10.1109/89.917681

Abstract

Most large vocabulary continuous speech recognition (LVCSR) systems in the past decade have used a context-dependent (CD) phone as the fundamental acoustic unit. We present one of the first robust LVCSR systems that uses a syllable-level acoustic unit for LVCSR on telephone-bandwidth speech. This effort is motivated by the inherent limitations in phone-based approaches-namely the lack of an easy and efficient way for modeling long-term temporal dependencies. A syllable unit spans a longer time frame, typically three phones, thereby offering a more parsimonious framework for modeling pronunciation variation in spontaneous speech. We present encouraging results which show that a syllable-based system exceeds the performance of a comparable triphone system both in terms of word error rate (WER) and complexity. The WER of the best syllabic system reported here is 49.1% on a standard Switchboard evaluation, a small improvement over the triphone system. We also report results on a much smaller recognition task, OGI Alphadigits, which was used to validate some of the benefits syllables offer over triphones. The syllable-based system exceeds the performance of the triphone system by nearly 20%, an impressive accomplishment since the alphadigits application consists mostly of phone-level minimal pair distinctions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Syllable-based large vocabulary continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: May 1, 2001
Citations: 142

Similar Papers

Improved lattice rescoring by using speech attributes in Large Vocabulary Continuous Speech Recognition systems
Xinglong Gao ... Jielin Pan
-
Xinglong Gao, et. al.Xinglong Gao ... Jielin Pan
01 Dec 2013
01 Dec 2013

Enhancing Large Vocabulary Continuous Speech Recognition System for Urdu-English Conversational Code-Switched Speech
Muhammad Umar Farooq ... Maryam Khalid
-
Muhammad Umar Farooq, et. al.Muhammad Umar Farooq ... Maryam Khalid
05 Nov 2020
05 Nov 2020

HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications
Hamza Frihia ... Halima Bahi
International Journal of Speech Technology | VOL. 20
Hamza Frihia, et. al.Hamza Frihia ... Halima Bahi
09 Jun 2017
International Journal of Speech Technology | VOL. 20

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions
Xunying Liu ... Mark Gales
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15
Xunying Liu, et. al.Xunying Liu ... Mark Gales
01 May 2007
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Syllable-based large vocabulary continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing