Non-intrusive Speech Quality Assessment based on Tucker Decomposition and Deep Neural Network

Yahui Shan,Jing Wang,Xiang Xie,Yiyu Luo,Min Liu

doi:10.1109/icsidp47821.2019.9173067

Abstract

In this paper, we present a new speech quality assessment method to estimate the quality of degraded speech without the reference speech. The traditional non-intrusive assessment methods cannot meet the requirement of high consistency with subjective results owing to the lack of original reference signals. To solve these issues, deep belief network is trained to produce pseudo-reference speech signal of degraded speech. Then the pseudo-reference speech and the degraded speech are modeled by tensor analysis to obtained features which is used to calculate feature differences. The feature differences are mapped to speech quality score using support vector regression. Experiments are conducted in a wideband dataset containing various degraded speech signals and subjective listening scores. When compared with the Gaussian Mixture Model method and deep belief network method, the proposed method brings about a higher correlation coefficient between predicted scores and subjective scores.

Full Text