Abstract

The relationships between phoneme recognition accuracy and sentence recognition accuracy for continuous speech recognition using a simulated phoneme recognizer are described. A left‐to‐right and top‐down parser based on Earley's algorithm was used as a syntactic parser. The beam search technique was embedded in this parser to reduce the search space. The syntactic constraints were represented by a context‐free grammar. The word lattice for an utterance was generated by a word spotting algorithm from an ambiguous phoneme sequence. The input to the parser was the word lattice of spotted words that consisted of a group of four terms: word name, beginning point, ending point. and matched score. Three tasks, “connected word (digits or city names),” “computer network (about 250 words),” and “inquiries on UNIX (about 500 words)” were used for the evaluation. From experiments, the conclusions are as follows. (1) It is difficult to recognize continuous speech without syntactic knowledge, (2) The search space ought to be large for managing an ambiguous input. (3) The sentence recognition rate is not so sensitive to vocabulary size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.