Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation

Gökhan Tür,Dilek Hakkani-Tür,Andreas Stolcke,Elizabeth Shriberg

doi:10.1162/089120101300346796

Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation

Gökhan Tür, Dilek Hakkani-Tür + Show 2 more

Open Access

https://doi.org/10.1162/089120101300346796

Copy DOI

Journal: Computational Linguistics	Publication Date: Mar 1, 2001
Citations: 131

Affiliation: Bilkent University, SRI International

#Lexical Information #Broadcast News Corpus + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmentation of speech into topically coherent units. We propose two methods for combining lexical and prosodic information using hidden Markov models and decision trees. Lexical information is obtained from a speech recognizer, and prosodic features are extracted automatically from speech waveforms. We evaluate our approach on the Broadcast News corpus, using the DARPA-TDT evaluation metrics. Results show that the prosodic model alone is competitive with word-based segmentation methods. Furthermore, we achieve a significant reduction in error by combining the prosodic and word-based knowledge sources.

Full Text