Using conditional random fields to predict focus word pair in spontaneous spoken English

Xiao Zang,Zhiyong Wu,Jia Jia,Lianhong Cai,Helen Meng

doi:10.21437/interspeech.2014-175

Abstract

This paper addresses the problem of automatically labeling focus word pairs in spontaneous spoken English, where a focus word pair refers to salient part of text or speech and the word motivating it. The prediction of focus word pairs is important for speech applications such as expressive text-tospeech (TTS) synthesis and speech recognition. It can also help in better textual and intention understanding for spoken dialog systems. Traditional approaches such as support vector machines (SVMs) prediction neglect the dependency between words and meet the obstacle of the imbalanced distribution of positive and negative samples of dataset. This paper introduces conditional random fields (CRFs) to the task of automatically predicting focus word pair from lexical, syntactic and semantic features. Furthermore, several new features related to syntactic and semantic information are proposed to achieve better performance. Experiments on the publicly available Switchboard corpus demonstrate that CRF model outperforms the baseline and SVM model for focus word pair prediction, and newly proposed features can further improve performance for CRF based predictor. Specifically, compared to the low recall rate of 11.31% achieved by the SVM model, the proposed CRF based predictor can yield a high recall rate of 70.88% with little impact on precision. Index Terms: focus word pair, focus prediction, conditional random fields (CRFs), support vector machines (SVMs)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using conditional random fields to predict focus word pair in spontaneous spoken English

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories
Mehedi Hasan ... Kathryn Brogan Hartlieb
Journal of Biomedical Informatics | VOL. 62
Mehedi Hasan, et. al.Mehedi Hasan ... Kathryn Brogan Hartlieb
13 May 2016
Journal of Biomedical Informatics | VOL. 62

مدل سازی پایداری خاکدانهها با استفاده از ماشینهای بردار پشتیبان و رگرسیون خطی چند متغیره
...
-
, et. al. ...
25 Apr 2015
25 Apr 2015

Sentiment Classification based on Linguistic Patterns in Citation Context
Mingyang Wang ... Yiming Zeng
Current Science | VOL. 117
Mingyang Wang, et. al.Mingyang Wang ... Yiming Zeng
25 Aug 2019
Current Science | VOL. 117

Traffic Volume Forecasting Model of Freeway Toll Stations During Holidays – An SVM Model
Xiaowei Hu ... Tianlin Wang
Promet - Traffic&Transportation | VOL. 34
Xiaowei Hu, et. al.Xiaowei Hu ... Tianlin Wang
15 Jun 2022
Promet - Traffic&Transportation | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using conditional random fields to predict focus word pair in spontaneous spoken English

Abstract

Talk to us

Similar Papers