An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Hironori Doi,Kiyohiro Shikano,Hiroshi Saruwatari,Tomoki Toda,Keigo Nakamura

doi:10.1109/icassp.2011.5947513

Hironori Doi, Kiyohiro Shikano + Show 3 more

Open Access

https://doi.org/10.1109/icassp.2011.5947513

Copy DOI

Publication Date: May 1, 2011
Citations: 28	License type: cc-by-nc-nd

Affiliation: Nara Institute of Science and Technology

Abstract

In this study, we evaluate our proposed methods for enhancing alaryngeal speech based on statistical voice conversion techniques. Voice conversion based on a Gaussian mixture model has been applied to the conversion of alaryngeal speech into normal speech (AL-to-Speech). Moreover, one-to-many eigenvoice conversion (EVC) has also been applied to AL-to-Speech to enable the recovery of the original voice quality of laryngectomees even if only one arbitrary utterance of the original voice is available. VC/EVC-based AL-to-Speech systems have been developed for several types of alaryngeal speech, such as esophageal speech (ES), electrolaryngeal speech (EL), and body-conducted silent electrolaryngeal speech (silent EL). These proposed systems are compared with each other from various perspectives. The experimental results demonstrate that our proposed systems yield significant enhancement effects on each type of alaryngeal speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Acceptability ratings of normal, esophageal, and artificial larynx speech.
Suzanne Bennett ... Bernd Weinberg
Journal of Speech and Hearing Research | VOL. 16
Suzanne Bennett, et. al.Suzanne Bennett ... Bernd Weinberg
01 Dec 1973
Journal of Speech and Hearing Research | VOL. 16

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
Hironori Doi ... Hiroshi Saruwatari
IEICE Transactions on Information and Systems | VOL. E93-D
Hironori Doi, et. al.Hironori Doi ... Hiroshi Saruwatari
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion
Kou Tanaka ... Sakriani Sakti
-
Kou Tanaka, et. al.Kou Tanaka ... Sakriani Sakti
25 Aug 2013
25 Aug 2013

Statistical approach to enhancing esophageal speech based on Gaussian mixture models
Hironori Doi ... Tomoki Toda
-
Hironori Doi, et. al.Hironori Doi ... Tomoki Toda
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Abstract

Talk to us

Similar Papers