Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model

Yongwei Li,Masato Akagi,Ken-Ichi Sakakibara

doi:10.1109/iscslp.2018.8706694

Abstract

The widely used method to estimate glottal source waveform and vocal tract shape is to process speech signal using inverse filter and then to fit residual signal using glottal source model. However, since source-tract interactions, estimation accuracy is reduced. In this paper, we propose a method to estimate glottal source waveform and vocal tract shape simultaneously based on analysis-by-synthesis approach with a source-filter model constructed with an auto-regressive eXogenous (ARX) model combined with the Lilijencrant-Fant (LF) model. Since the optimization of multiple parameters makes simultaneous estimation difficult, there are two steps: the glottal source parameters are initialized using the inverse filter method, then the accurate parameters of the glottal source and the vocal tract shape are estimated simultaneously using an analysis-by-synthesis approach. Experimental results with synthetic and real speech signals showed the higher estimation accuracy of the proposed method than inverse filter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model
Yongwei Li ... Ken-Ichi Sakakibara
Journal of Signal Processing Systems | VOL. 92
Yongwei Li, et. al.Yongwei Li ... Ken-Ichi Sakakibara
23 Dec 2019
Journal of Signal Processing Systems | VOL. 92

$F_0$-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model
Yongwei Li ... Donna Erickson
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Yongwei Li, et. al.Yongwei Li ... Donna Erickson
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range
Kyoko Takahashi ... Masato Akagi
-
Kyoko Takahashi, et. al.Kyoko Takahashi ... Masato Akagi
01 Nov 2018
01 Nov 2018

Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals
Sudarsana Reddy Kadiri ... Paavo Alku
Computer Speech & Language | VOL. 83
Sudarsana Reddy Kadiri, et. al.Sudarsana Reddy Kadiri ... Paavo Alku
28 Jul 2023
Computer Speech & Language | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model

Abstract

Talk to us

Similar Papers