Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices

Sankaran Panchapagesan,Abeer Alwan

doi:10.21437/interspeech.2008-697

Abstract

Acoustic-to-articulatory inversion for vowels is performed by cepstral analysis-by-synthesis, using chain-matrix calculation of vocal tract (VT) acoustics and the Maeda articulatory model. The derivative of the VT chain matrix with respect to the area function was calculated in a novel efficient manner, and used in the BFGS quasi-Newton method for optimizing a distance measure between input and synthesized cepstral features over the entire articulatory trajectory. The optimization is initialized by a fast search of an articulatory codebook with a bin structure in formant space and the cost function also includes regularization and continuity terms to obtain realistic inverted VT shapes and smooth articulatory trajectories. Inversion is evaluated on the three diphthongs /ai/, /oi/ and /au/ of two speakers, one male and one female, from the University of Wisconsin X-ray microbeam (XRMB) database, and good agreement was achieved between inverted midsagittal vocal tract outlines and measured XRMB tongue and lip pellet positions, with an average relative error of less than 3% in the first three formants.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model
Sankaran Panchapagesan ... Abeer Alwan
The Journal of the Acoustical Society of America | VOL. 129
Sankaran Panchapagesan, et. al.Sankaran Panchapagesan ... Abeer Alwan
01 Apr 2011
The Journal of the Acoustical Society of America | VOL. 129

A new method of synthesis of reactance networks
A Talbot
Proceedings of the IEE - Part IV: Institution Monographs | VOL. 101
A TalbotA Talbot
01 Feb 1954
Proceedings of the IEE - Part IV: Institution Monographs | VOL. 101

Relevance of the Implementation of Teeth in Three-Dimensional Vocal Tract Models.
Louisa Traser ... Robert Kamberger
Journal of Speech, Language, and Hearing Research | VOL. 60
Louisa Traser, et. al.Louisa Traser ... Robert Kamberger
18 Sep 2017
Journal of Speech, Language, and Hearing Research | VOL. 60

Articulation and vocal tract acoustics at soprano subject's high fundamental frequencies.
Matthias Echternach ... Michael Burdumy
The Journal of the Acoustical Society of America | VOL. 137
Matthias Echternach, et. al.Matthias Echternach ... Michael Burdumy
01 May 2015
The Journal of the Acoustical Society of America | VOL. 137

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices

Abstract

Talk to us

Similar Papers