Neural network-based non-intrusive speech quality assessment using attention pooling function

Miao Liu,Weiming Yi,Fang Liu,Jing Wang

doi:10.1186/s13636-021-00209-4

Abstract

Recently, the non-intrusive speech quality assessment method has attracted a lot of attention since it does not require the original reference signals. At the same time, neural networks began to be applied to speech quality assessment and achieved good performance. To improve the performance of non-intrusive speech quality assessment, this paper proposes a neural network-based assessment method using attention pooling function. The proposed systems are based on the convolutional neural networks (CNNs), bidirectional long short-term memory (BLSTM), and CNN-LSTM structure. Comparing four types of pooling functions both theoretically and experimentally, we find the attention pooling function performs the best among the four. Experiments are conducted in a dataset containing various degraded speech signals with corresponding subjective quality scores. The results show that the proposed CNN-LSTM model using attention pooling function achieves state-of-the-art correlation coefficient (R) and root-mean-square error (RMSE) of 0.967 and 0.269, outperforming the performance of standardization ITU-T P.563 and autoencoder-support vector regression method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: May 17, 2021
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Neural network-based non-intrusive speech quality assessment using attention pooling function

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Non-intrusive objective speech quality assessment using a combination of MFCC, PLP and LSF features
Rajesh Kumar Dubey ... Arun Kumar
-
Rajesh Kumar Dubey, et. al.Rajesh Kumar Dubey ... Arun Kumar
01 Dec 2013
01 Dec 2013

NISQE: Non-Intrusive Speech Quality Evaluator Based on Natural Statistics of Mean Subtracted Contrast Normalized Coefficients of Spectrogram
Shakeel Zafar ... Mobeen Ur Rehman
Sensors | VOL. 23
Shakeel Zafar, et. al.Shakeel Zafar ... Mobeen Ur Rehman
16 Jun 2023
Sensors | VOL. 23

Non-intrusive speech quality assessment: A survey
Kailai Shen ... Zhe Ye
Neurocomputing | VOL. 580
Kailai Shen, et. al.Kailai Shen ... Zhe Ye
06 Mar 2024
Neurocomputing | VOL. 580

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural network-based non-intrusive speech quality assessment using attention pooling function

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing