Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio&amp;#x2013;Visual Information

Serdar Yildirim,Shrikanth Narayanan

doi:10.1109/tasl.2008.2006728

Abstract

The presence of disfluencies in spontaneous speech, while poses a challenge for robust automatic recognition, also offers means for gaining additional insights into understanding a speaker's communicative and cognitive state. This paper analyzes disfluencies in children's spontaneous speech, in the context of spoken dialog based computer game play, and addresses the automatic detection of disfluency boundaries. Although several approaches have been proposed to detect disfluencies in speech, relatively little work has been done to utilize visual information to improve the performance and robustness of the disfluency detection system. This paper describes the use of visual information along with prosodic and language information to detect the presence of disfluencies in a child's computer-directed speech and shows how these information sources can be integrated to increase the overall information available for disfluency detection. The experimental results on our children's multimodal dialog corpus indicate that disfluency detection accuracy of over 80% can be obtained by utilizing audio-visual information. Specifically, results showed that the addition of visual information to prosody and language features yield relative improvements in disfluency detection error rates of 3.6% and 6.3%, respectively, for information fusion at the feature level and decision level.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio&#x2013;Visual Information

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2009
Citations: 43

Similar Papers

Current Aspects of Speech Disfluency Research: Scientific Review of the International Workshop DiSS 19 (Budapest) Issues
Tatiana Sineokova
Nizhny Novgorod Linguistics University Bulletin | VOL. -
Tatiana SineokovaTatiana Sineokova
30 Jun 2020
Nizhny Novgorod Linguistics University Bulletin | VOL. -

How Listeners Compensate for Disfluencies in Spontaneous Speech
Susan E Brennan ... Michael F Schober
Journal of Memory and Language | VOL. 44
Susan E Brennan, et. al.Susan E Brennan ... Michael F Schober
01 Feb 2001
Journal of Memory and Language | VOL. 44

Auxiliary Sequence Labeling Tasks for Disfluency Detection
Dongyub Lee ... Jaechoon Jo
-
Dongyub Lee, et. al.Dongyub Lee ... Jaechoon Jo
30 Aug 2021
30 Aug 2021

Disfluencies in non-stuttering adults across sample lengths and topics
Patricia M Roberts ... Joanne Wilding
Journal of Communication Disorders | VOL. 42
Patricia M Roberts, et. al.Patricia M Roberts ... Joanne Wilding
21 Jun 2009
Journal of Communication Disorders | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio&amp;#x2013;Visual Information

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio–Visual Information