Abstract

Multipulse modeling of articulatory movements can provide a flexible and intuitive representation of the dynamic behavior of speech articulators. Such a representation can prove especially useful for the exploitation of the significant amounts of rich articulatory data that are collected via vocal tract imaging techniques such as real-time MRI or x-ray microbeam. Original multipulse LPC articulatory modeling studies only focused on a limited set of articulations by a single speaker as these were imaged using x-rays. In this work, application of the multipulse modeling framework on the Wisconsin x-ray microbeam speech production database is investigated. This database contains articulatory data, i.e., trajectories of points on the tongue, jaw, and lips during multiple articulations as well as hard palate tracings, from 57 speakers. The original measurements are converted to constriction degree measurements and it is assumed that the corresponding trajectories can be modeled as the output of a low-order linear system when it is excited by a sequence of pulses of variable amplitudes. Pulse locations are determined using optimal matching pursuit and all pulse amplitudes are re-optimized every time a new pulse is introduced. Model similarities and differences are presented both across multiple utterances by the same speaker and across speakers.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call