Recognizing disfluencies in conversational speech

M Lease,M Johnson,E Charniak

doi:10.1109/tasl.2006.878269

Abstract

We present a system for modeling disfluency in conversational speech: repairs, fillers, and self-interruption points (IPs). For each sentence, candidate repair analyses are generated by a stochastic tree adjoining grammar (TAG) noisy-channel model. A probabilistic syntactic language model scores the fluency of each analysis, and a maximum-entropy model selects the most likely analysis given the language model score and other features. Fillers are detected independently via a small set of deterministic rules, and IPs are detected by combining the output of repair and filler detection modules. In the recent Rich Transcription Fall 2004 (RT-04F) blind evaluation, systems competed to detect these three forms of disfluency under two input conditions: a best-case scenario of manually transcribed words and a fully automatic case of automatic speech recognition (ASR) output. For all three tasks and on both types of input, our system was the top performer in the evaluation

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognizing disfluencies in conversational speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Sep 1, 2006
Citations: 60

Similar Papers

Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech.
Jimmy Tobin ... Jordan R Green
Journal of speech, language, and hearing research : JSLHR | VOL. 67
Jimmy Tobin, et. al.Jimmy Tobin ... Jordan R Green
07 Nov 2024
Journal of speech, language, and hearing research : JSLHR | VOL. 67

Filled pause refinement based on the pronunciation probability for lecture speech.
Yan-Hua Long ... Ian Mcloughlin
PloS one | VOL. 10
Yan-Hua Long, et. al.Yan-Hua Long ... Ian Mcloughlin
10 Apr 2015
PloS one | VOL. 10

Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
Yang Liu ... Mary Harper
-
Yang Liu, et. al.Yang Liu ... Mary Harper
04 Sep 2005
04 Sep 2005

Conversational Speech Recognition by Learning Conversation-Level Characteristics
Kun Wei ... Long Ma
-
Kun Wei, et. al.Kun Wei ... Long Ma
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognizing disfluencies in conversational speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing