Voice Conversion for Persons with Amyotrophic Lateral Sclerosis.

Yunxin Zhao,Minguang Song,Mili Kuruvilla-Dugdale

doi:10.1109/jbhi.2019.2961844

Yunxin Zhao, Minguang Song + Show 1 more

Open Access

https://doi.org/10.1109/jbhi.2019.2961844

Copy DOI

Abstract

Amyotrophic lateral sclerosis (ALS) results in progressive paralysis of voluntary muscles throughout the body. As speech deteriorates, individuals rely on pre-programmed messages available on commercial speech generating devices to communicate using one of the generic electronic voices on the device. To replace these generic voices and restore vocal identity, our aim is to develop personalized voices for people with ALS via the approach of voice conversion. The task is challenging because very few people have large quantities of their premorbid healthy speech recorded. Therefore, we have to rely on small quantities of dysarthric speech concomitant with an individual's disease stage. Further, progressive fatigue prohibits acquisition of large speech datasets and individuals display a range of dysarthria severities resulting from breathing, voice, articulation, resonance, and prosody disturbances. As the first step to address these problems, we use healthy source speakers and propose the approach of combining a structured sparse spectral transform with multiple linear regression-based frequency warping prediction for spectral conversion, and interpolating the transformed spectral frames for speech rate modification. Our experimental data included four healthy source speakers from the ARCTIC dataset, and four target ALS speakers with mild to severe dysarthria, forming 16 speaker pairs. Subjective listening evaluations showed that on average, (i) the proposed approach improved speech intelligibility by about 80% over the target speakers' speech, (ii) the converted voice was 3 times more similar to the target speakers' speech than to the source speakers' speech, and (iii) the converted speech quality was close to the MOS scale "good" relative to the source speakers' speech being "excellent."

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Voice Conversion for Persons with Amyotrophic Lateral Sclerosis.

Abstract

Talk to us

Similar Papers

More From: IEEE journal of biomedical and health informatics

Lead the way for us

Journal: IEEE journal of biomedical and health informatics	Publication Date: Dec 25, 2019
Citations: 36

Similar Papers

Personalizing TTS Voices for Progressive Dysarthria
Yunxin Zhao ... Yanghao Yue
-
Yunxin Zhao, et. al.Yunxin Zhao ... Yanghao Yue
27 Jul 2021
27 Jul 2021

Glucose metabolism in amyotrophic lateral sclerosis: it is bitter-sweet.
Johnd Lee ... Titaya Lerskiatiphanich
Neural Regeneration Research | VOL. 17
Johnd Lee, et. al.Johnd Lee ... Titaya Lerskiatiphanich
01 Jan 2021
Neural Regeneration Research | VOL. 17

Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion
Nirmesh J Shah ... Hemant A Patil
-
Nirmesh J Shah, et. al.Nirmesh J Shah ... Hemant A Patil
01 Jan 2017
01 Jan 2017

Multiple Non-Negative Matrix Factorization for Many-to-Many Voice Conversion
Ryo Aihara ... Tetsuya Takiguchi
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Ryo Aihara, et. al.Ryo Aihara ... Tetsuya Takiguchi
01 Jul 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Voice Conversion for Persons with Amyotrophic Lateral Sclerosis.

Abstract

Talk to us

Similar Papers

More From: IEEE journal of biomedical and health informatics