Abstract

Task classification is an important subproblem of Spoken Language Understanding (SLU) in automated systems providing natural language user interface, whose goal is to identify the topic of a query from the user. This paper presents a combination of multiple statistical classifiers to improve the accuracy of task classification in the context of city public transportation information inquiry domain. Three different typical types of statistical classifiers are trained on the same data to be the base classifiers of the combination system: naïve bayes classifier, n-gram model, and support vector machines. The combination method of two-stage classification is emplored to yield better overall performance. Our experiments showed that support vector machines outperform excessively the other base classifiers for task classification in our domain. The comparative experimental results between two-stage classification and voting strategy indicated, under the circumstance that the best base classifier has the overwhelming performance over the other base classifiers, the strategy of two-stage classification was more effective and could produce better results than the best component classifier.KeywordsSupport Vector MachineTask ClassificationChinese CharacterBase ClassifierWeighted VoteThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call