Automatic ToBI prediction and alignment to speed manual labeling of prosody

Ann K Syrdal,Julia Hirschberg,Julie Mcgory,Mary Beckman

doi:10.1016/s0167-6393(00)00073-x

Abstract

Tagging of corpora for useful linguistic categories can be a time-consuming process, especially with linguistic categories for which annotation standards are relatively new, such as discourse segment boundaries or the intonational events marked in the Tones and Break Indices (ToBI) system for American English. A ToBI prosodic labeling of speech typically takes even experienced labelers from 100 to 200 times real time. An experiment was conducted to determine (1) whether manual correction of automatically assigned ToBI labels would speed labeling, and (2) whether default labels introduced any bias in label assignment. A large speech corpus of one female speaker reading several types of texts was automatically assigned default labels. Default accent placement and phrase boundary location were predicted from text using machine learning techniques. The most common ToBI labels were assigned to these locations for default tones and break type. Predicted pitch accents were automatically aligned to the mid-point of the word, while breaks and edge tones were aligned to the end of the phrase-final word. The corpus was then labeled by a group of five trained transcribers working over a period of nine months. Half of each set of recordings was labeled in the standard fashion without default labels, and the other half was presented with preassigned default labels for labelers to correct. Results indicate that labeling from defaults was generally faster than standard labeling, and that defaults had relatively little impact on label assignment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic ToBI prediction and alignment to speed manual labeling of prosody

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Nov 28, 2000
Citations: 109

Similar Papers

Automatic ToBI prediction and alignment to speed manual labeling of prosody
Ann K Syrdal ... Julie T Mcgory
The Journal of the Acoustical Society of America | VOL. 106
Ann K Syrdal, et. al.Ann K Syrdal ... Julie T Mcgory
01 Oct 1999
The Journal of the Acoustical Society of America | VOL. 106

The Adaptation of the ToBI System for Lithuanian Pitch Change
Neringa Krapikaitė
Žmogus ir žodis | VOL. 17
Neringa KrapikaitėNeringa Krapikaitė
20 May 2015
Žmogus ir žodis | VOL. 17

Intonation in Hebrew-Speaking Children with High Functioning Autism
Hila Green
Asia Pacific Journal of Speech, Language and Hearing | VOL. 12
Hila GreenHila Green
01 Jun 2009
Asia Pacific Journal of Speech, Language and Hearing | VOL. 12

Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence
Sankaranarayanan Ananthakrishnan ... Shrikanth S Narayanan
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
Sankaranarayanan Ananthakrishnan, et. al.Sankaranarayanan Ananthakrishnan ... Shrikanth S Narayanan
01 Jan 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic ToBI prediction and alignment to speed manual labeling of prosody

Abstract

Talk to us

Similar Papers

More From: Speech Communication