Integrated recognition of words and prosodic phrase boundaries

F Gallwitz,H Niemann,E Nöth,V Warnke

doi:10.1016/s0167-6393(01)00027-9

Abstract

In this paper, we present an integrated approach for recognizing both the word sequence and the syntactic–prosodic structure of a spontaneous utterance. The approach aims at improving the performance of the understanding component of speech understanding systems by exploiting not only acoustic–phonetic and syntactic information, but also prosodic information directly within the speech recognition process. Whereas spoken utterances are typically modelled as unstructured word sequences in the speech recognizer, our approach includes phrase boundary information in the language model and provides HMMs to model the acoustic and prosodic characteristics of phrase boundaries. This methodology has two major advantages compared to purely word-based speech recognizers. First, additional syntactic–prosodic boundaries are determined by the speech recognizer which facilitates parsing and resolve syntactic and semantic ambiguities. Second – after having removed the boundary information from the result of the recognizer – the integrated model yields a 4% relative word error rate (WER) reduction compared to a traditional word recognizer. The boundary classification performance is equal to that of a separate prosodic classifier operating on the word recognizer output, thus making a separate classifier unnecessary for this task and saving the computation time involved. Compared to the baseline word recognizer, the integrated word-and-boundary recognizer does not involve any computational overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrated recognition of words and prosodic phrase boundaries

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Oct 30, 2001
Citations: 34

Similar Papers

Subband Temporal Envelope Features and Data Augmentation for End-to-end Recognition of Distant Conversational Speech
Cong-Thanh Do
-
Cong-Thanh DoCong-Thanh Do
01 May 2019
01 May 2019

Improved recognition by combining different features and different systems

-

01 Jan 1999
01 Jan 1999

Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li ... Rui Zhao
-
Jinyu Li, et. al.Jinyu Li ... Rui Zhao
13 Oct 2019
13 Oct 2019

Inversion-based nonlinear adaptation of noisy acoustic parameters for a neural/HMM speech recognizer
Edmondo Trentin ... Marco Gori
Neurocomputing | VOL. 70
Edmondo Trentin, et. al.Edmondo Trentin ... Marco Gori
27 Jun 2006
Neurocomputing | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated recognition of words and prosodic phrase boundaries

Abstract

Talk to us

Similar Papers

More From: Speech Communication