Sound Pattern Matching for Automatic Prosodic Event Detection

Milos Cernak,Philip N Garner,Afsaneh Asaei,Pierre-Edouard Honnet,Hervé Bourlard

doi:10.21437/interspeech.2016-875

Abstract

Prosody in speech is manifested by variations of loudness, exaggeration of pitch, and specific phonetic variations of prosodic segments. For example, in the stressed and unstressed syllables, there are differences in place or manner of articulation, vowels in unstressed syllables may have a more central articulation, and vowel reduction may occur when a vowel changes from a stressed to an unstressed position. In this paper, we characterize the sound patterns using phonological posteriors to capture the phonetic variations in a concise manner. The phonological posteriors quantify the posterior probabilities of the phonological features given the input speech acoustics, and they are obtained using the deep neural network (DNN) computational method. Built on the assumption that there are unique sound patterns in different prosodic segments, we devise a sound pattern matching (SPM) method based on 1-nearest neighbour classifier. In this work, we focus on automatic detection of prosodic stress placed on words, called also emphasized words. We evaluate the SPM method on English and French data with emphasized words. The word emphasis detection works very well also on cross-lingual tests, that is using a French classifier on English data, and vice versa.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sound Pattern Matching for Automatic Prosodic Event Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Phonetic realization of vowel reduction in Brazilian Portuguese
Sejin Oh
The Journal of the Acoustical Society of America | VOL. 143
Sejin OhSejin Oh
01 Mar 2018
The Journal of the Acoustical Society of America | VOL. 143

English lexical stress produced by native (L1) Bengali speakers compared to native (L1) English speakers: an acoustic study
Shambhu Nath Saha ... Shyamal Kumar Das Mandal
International Journal of Speech Technology | VOL. 19
Shambhu Nath Saha, et. al.Shambhu Nath Saha ... Shyamal Kumar Das Mandal
03 Oct 2016
International Journal of Speech Technology | VOL. 19

Phonetic realization of English lexical stress by native (L1) Bengali speakers compared to native (L1) English speakers
Shambhu Nath Saha ... Shyamal Kumar Das Mandal
Computer Speech & Language | VOL. 47
Shambhu Nath Saha, et. al.Shambhu Nath Saha ... Shyamal Kumar Das Mandal
23 Jun 2017
Computer Speech & Language | VOL. 47

Quality and duration of unstressed vowels in Polish
Arkadiusz Rojczyk
Lingua | VOL. 217
Arkadiusz RojczykArkadiusz Rojczyk
14 Nov 2018
Lingua | VOL. 217

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sound Pattern Matching for Automatic Prosodic Event Detection

Abstract

Talk to us

Similar Papers