Abstract

This article investigates a novel multi-stage approach for spoken language understanding (SLU), with an application to a pioneering Thai spoken dialogue system in a hotel reservation domain. Given an input word string, the system determines a goal and concept-values by three-stage processing; concept extraction, goal identification, and concept-value recognition. The concept extraction utilizes weighted finite state transducers (WFST) to extract concepts from the word string. Given the extracted concepts, a goal of the utterance is identified using a pattern classifier. Within a particular goal, the necessary concept-values are recognized from the WFST outputs produced in the concept extraction stage. A new logical N-gram model, which strategically combines the conventional N-gram parser with a regular grammar, is evaluated for concept extraction and concept-value recognition. Several classifiers are optimized and compared for goal identification. An advantage of the proposed SLU model is that it can be trained by a partially annotated corpus, where only the relevant keywords and the goal of each training utterance are required. Although the proposed model is evaluated only on the Thai hotel reservation system, the SLU itself is general and it is expected to be applicable for other languages once training data is available.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.