Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM

Szu-Wei Fu,Yu Tsao,Hsin-Min Wang,Hsin-Te Hwang

doi:10.21437/interspeech.2018-1802

Abstract

Nowadays, most of the objective speech quality assessment tools (e.g., perceptual evaluation of speech quality (PESQ)) are based on the comparison of the degraded/processed speech with its clean counterpart. The need of a "golden" reference considerably restricts the practicality of such assessment tools in real-world scenarios since the clean reference usually cannot be accessed. On the other hand, human beings can readily evaluate the speech quality without any reference (e.g., mean opinion score (MOS) tests), implying the existence of an objective and non-intrusive (no clean reference needed) quality assessment mechanism. In this study, we propose a novel end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory. The evaluation of utterance-level quality in Quality-Net is based on the frame-level assessment. Frame constraints and sensible initializations of forget gate biases are applied to learn meaningful frame-level quality assessment from the utterance-level quality label. Experimental results show that Quality-Net can yield high correlation to PESQ (0.9 for the noisy speech and 0.84 for the speech processed by speech enhancement). We believe that Quality-Net has potential to be used in a wide variety of applications of speech signal processing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Meng Yu ... Chunlei Zhang
-
Meng Yu, et. al.Meng Yu ... Chunlei Zhang
30 Aug 2021
30 Aug 2021

Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features
Ryandhimas E Zezario ... Szu-Wei Fu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31
Ryandhimas E Zezario, et. al.Ryandhimas E Zezario ... Szu-Wei Fu
01 Jan 2023
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31

New research on monaural speech segregation based on quality assessment
Xiaoping Xie ... Fei Ding
Computer Speech & Language | VOL. 85
Xiaoping Xie, et. al.Xiaoping Xie ... Fei Ding
05 Dec 2023
Computer Speech & Language | VOL. 85

Performance Study of Objective Voice Quality Measures in VoIP
Lijing Ding ... Ayman Radwan
-
Lijing Ding, et. al.Lijing Ding ... Ayman Radwan
01 Jul 2007
01 Jul 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM

Abstract

Talk to us

Similar Papers