Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Rizwan Ishaq,Dhanananjaya Gowda,Begonya Garcia Zapirain,Paavo Alku

doi:10.18653/v1/w15-5110

Abstract

This paper presents an enhancement system for early stage Spanish Esophageal Speech (ES) vowels. The system decomposes the input ES into neoglottal waveform and vocal tract filter components using Iterative Adaptive Inverse Filtering (IAIF). The neoglottal waveform is further decomposed into fundamental frequency F0, Harmonic to Noise Ratio (HNR), and neoglottal source spectrum. The enhanced neoglottal source signal is constructed using a natural glottal flow pulse computed from real speech. The F0 and HNR are replaced with natural speech F0 and HNR. The vocal tract formant frequencies (spectral peaks) and bandwidths are smoothed, the formants are shifted downward using second order frequency warping polynomial and the bandwidth is increased to make it close to the natural speech. The system is evaluated using subjective listening tests on the Spanish ES vowels /a/, /e/, /i/, /o/, /u/. The Mean Opinion Score (MOS) shows significant improvement in the overall quality (naturalness and intelligibility) of the vowels. Index Terms: speech enhancement, glottal flow, analysis synthesis vocal tract, spectral sharpening, warping

Highlights

The removal of the larynx after a Total Laryngectomy (TL), changes the speech production mechanism
In order to deal with these deficiencies, this paper proposes an Esophageal Speech (ES) enhancement method based on the GlottHMM single pulse synthesis [15, 16, 17]
The vocal tract spectrum of ES has the following characteristics, i) higher frequencies are emphasized more compared to lower frequencies, ii) spectral resonances are moved to higher frequencies, and iii) resonance bandwidths are reduced in comparison to normal speech vowels

Summary

Introduction

The removal of the larynx after a Total Laryngectomy (TL), changes the speech production mechanism. Compared to the production of normal speech according to the source-filter model [1], the voicing source in ES is severely altered and does not have any fundamental frequency or harmonic components. The ES can be enhanced by transforming the source and filter components to those of normal speech using signal processing algorithms. In [7] the source and filter components were modified by replacing the source with the LF model and increasing the bandwidth of filter formants for better quality speech. The vocal tract formants are typically considered to be the same as in normal speech signals. The spectral peaks of the vocal tract filter are moved to lower frequencies in order to compensate the rising of formant in ES.

System Description

GlottHMM based analysis

Neoglottal source signal enhancement

Vocal tract modification by nonlinear frequency warping

Synthesis of enhanced speech

System Evaluation

Original

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2015
Citations: 12	License type: cc-by

Similar Papers

Harmonic to noise ratio improvement in oesophageal speech.
Ibon Oleagordia-Ruiz ... Begonya Garcia-Zapirain
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 23
Ibon Oleagordia-Ruiz, et. al.Ibon Oleagordia-Ruiz ... Begonya Garcia-Zapirain
10 Jun 2015
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 23

Subband modulator Kalman filtering for single channel speech enhancement
Rizwan Ishaq ... Muhammad Shahid
-
Rizwan Ishaq, et. al.Rizwan Ishaq ... Muhammad Shahid
01 Jan 2013
01 Jan 2013

Esophageal Speech enhancement using modified voicing source
Rizwan Ishaq ... Begona Garcia Zapirain
-
Rizwan Ishaq, et. al.Rizwan Ishaq ... Begona Garcia Zapirain
01 Dec 2013
01 Dec 2013

Enhancement of Spanish Oesophageal Speech vowels using coherent subband modulator Kalman filtering.
Rizwan Ishaq ... Begoña García Zapirain
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 24
Rizwan Ishaq, et. al.Rizwan Ishaq ... Begoña García Zapirain
14 Mar 2016
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Abstract

Highlights

Summary

Talk to us

Similar Papers