Training a prosody-based dialog act tagger from unlabeled data

A Venkataraman,L Ferrer,A Stolcke,E Shriberg

doi:10.1109/icassp.2003.1198770

Training a prosody-based dialog act tagger from unlabeled data

A Venkataraman, L Ferrer + Show 2 more

https://doi.org/10.1109/icassp.2003.1198770

Copy DOI

Publication Date: Apr 6, 2003

Citations: 34

Affiliation: SRI International

#Use Of Unlabeled Data #Use Of Prosodic Information + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Dialog act tagging is an important step toward speech understanding, yet training such taggers usually requires large amounts of data labeled by linguistic experts. Here we investigate the use of unlabeled data for training HMM-based dialog act taggers. Three techniques are shown to be effective for bootstrapping a tagger from very small amounts of labeled data: iterative relabeling and retraining on unlabeled data; a dialog grammar to model dialog act context, and a model of the prosodic correlates of dialog acts. On the SPINE dialog corpus, the combined use of prosodic information and unlabeled data reduces the tagging error between 12% and 16%, compared to baseline systems using word information and various amounts of labeled data only.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.