Latent factor analysis for synthesized speech quality-of-experience assessment

Rishabh Gupta,Tiago H Falk

doi:10.1007/s41233-017-0005-6

Abstract

Text-to-speech (TTS) systems are evolving and making way into numerous commercial systems, such as smartphones and assistive technologies. Notwithstanding, their user perceived quality-of-experience (QoE) is still low compared to natural speech, with distortions arising across numerous perceptual dimensions, such as voice pleasantness, comprehension, and appropriateness of intonation, to name a few. Unfortunately, the effects of such perceptual dimensions on overall perceived QoE is still unknown, particularly across listeners of different genders, thus making it difficult for TTS developers to further improve system quality. To overcome this limitation, this study makes use of exploratory factor analysis (EFA), confirmatory factor analysis (CFA), and model invariance tests to shed light on factors responsible for QoE perception across natural and synthesized speech, as well as male and female listeners. Experimental EFA/CFA results on a publicly available database of commercial TTS systems showed the emergence of two key perceptual dimensions responsible for TTS QoE, namely ‘listening pleasure’ and ‘prosody’. Model invariance tests validated the reliability of the model across male and female listeners, as well as across natural and synthetic voices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Latent factor analysis for synthesized speech quality-of-experience assessment

Abstract

Talk to us

Similar Papers

More From: Quality and User Experience

Lead the way for us

Journal: Quality and User Experience	Publication Date: Feb 6, 2017
Citations: 11

Similar Papers

Long-Term Stability of Food Patterns Identified by Use of Factor Analysis among Swedish Women
Pk Newby ... Alicja Wolk
The Journal of Nutrition | VOL. 136
Pk Newby, et. al.Pk Newby ... Alicja Wolk
01 Mar 2006
The Journal of Nutrition | VOL. 136

Structural validity of the Eating Disorder Examination-Questionnaire: A systematic review.
Paul E Jenkins ... Renee D Rienecke
International Journal of Eating Disorders | VOL. 55
Paul E Jenkins, et. al.Paul E Jenkins ... Renee D Rienecke
03 May 2022
International Journal of Eating Disorders | VOL. 55

An Integrated Model of Financial Literacy among B–School Graduates Using Fuzzy AHP and Factor Analysis
Shivani Inder ... Sahil Gupta
The Journal of Wealth Management | VOL. 23
Shivani Inder, et. al.Shivani Inder ... Sahil Gupta
27 Nov 2020
The Journal of Wealth Management | VOL. 23

Exploratory factor analysis in Rehabilitation Psychology: a content analysis.
Richard B Roberson ... Jessica E Chang
Rehabilitation Psychology | VOL. 59
Richard B Roberson, et. al.Richard B Roberson ... Jessica E Chang
01 Nov 2014
Rehabilitation Psychology | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Latent factor analysis for synthesized speech quality-of-experience assessment

Abstract

Talk to us

Similar Papers

More From: Quality and User Experience