Abstract

In this paper, we present a new speech quality assessment method to estimate the quality of degraded speech without the reference speech. The traditional non-intrusive assessment methods cannot meet the requirement of high consistency with subjective results owing to the lack of original reference signals. To solve these issues, deep belief network is trained to produce pseudo-reference speech signal of degraded speech. Then the pseudo-reference speech and the degraded speech are modeled by tensor analysis to obtained features which is used to calculate feature differences. The feature differences are mapped to speech quality score using support vector regression. Experiments are conducted in a wideband dataset containing various degraded speech signals and subjective listening scores. When compared with the Gaussian Mixture Model method and deep belief network method, the proposed method brings about a higher correlation coefficient between predicted scores and subjective scores.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.