A split lexicon approach for improved recognition of spoken names

Abhinav Sethy,Shrikanth Narayanan,S Parthasarthy

doi:10.1016/j.specom.2006.03.005

Abstract

Recognition of spoken names is a challenging task for automatic speech recognition systems because the list of names for applications such as directory assistance tends to be in the order of several hundred thousands. This makes spoken name recognition a very high perplexity task. In this paper we propose the use of syllables as the acoustic unit for spoken name recognition based on reverse lookup schemes and show how syllables can be used to improve recognition performance and reducing the system perplexity. We present system design methodologies to address the problem of acoustic-training data sparsity encountered when using longer length units such as syllables. We illustrate our ideas first on a TIMIT based continuous speech recognition problem and then focus on the application of these ideas to spoken name recognition. Our results on the OGI spoken name corpus indicate that using syllables in place of phoneme models can help boost system accuracy significantly while helping to reduce the system complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A split lexicon approach for improved recognition of spoken names

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: May 5, 2006
Citations: 152

Similar Papers

End-to-End Named Entity Recognition from English Speech
Hemant Yadav ... Yi Yu
-
Hemant Yadav, et. al.Hemant Yadav ... Yi Yu
25 Oct 2020
25 Oct 2020

Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech
Gakuto Kurata ... Bhuvana Ramabhadran
Speech Communication | VOL. 54
Gakuto Kurata, et. al.Gakuto Kurata ... Bhuvana Ramabhadran
11 Nov 2011
Speech Communication | VOL. 54

Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
Gakuto Kurata ... Abhinav Sethy
-
Gakuto Kurata, et. al.Gakuto Kurata ... Abhinav Sethy
01 May 2011
01 May 2011

AISHELL-NER: Named Entity Recognition from Chinese Speech
Boli Chen ... Fei Huang
-
Boli Chen, et. al.Boli Chen ... Fei Huang
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A split lexicon approach for improved recognition of spoken names

Abstract

Talk to us

Similar Papers

More From: Speech Communication