MiniatureVQNet: A Light-Weight Deep Neural Network for Non-Intrusive Evaluation of VoIP Speech Quality

Elhard James Kumalija,Yukikazu Nakamoto

doi:10.3390/app13042455

Abstract

In IP audio systems, audio quality is degraded by environmental noise, poor network quality, and encoding–decoding algorithms. Therefore, there is a need for a continuous automatic quality evaluation of the transmitted audio. Speech quality monitoring in VoIP systems enables autonomous system adaptation. Furthermore, there are diverse IP audio transmitters and receivers, from high-performance computers and mobile phones to low-memory and low-computing-capacity embedded systems. This paper proposes MiniatureVQNet, a single-ended speech quality evaluation method for VoIP audio applications based on a lightweight deep neural network (DNN) model. The proposed model can predict the audio quality independent of the source of degradation, whether noise or network, and is light enough to run in embedded systems. Two variations of the proposed MiniatureVQNet model were evaluated: a MiniatureVQNet model trained on a dataset that contains environmental noise only, referred to as MiniatureVQNet–Noise, and a second model trained on both noise and network distortions, referred to as MiniatureVQNet–Noise–Network. The proposed MiniatureVQNet model outperforms the traditional P.563 method in terms of accuracy on all tested network conditions and environmental noise parameters. The mean squared error (MSE) of the models compared to the PESQ score for ITU-T P.563, MiniatureVQNet-Noise, and MiniatureVQNet–Noise–Network was 2.19, 0.34, and 0.21, respectively. The performance of both the MiniatureVQNet–Noise–Network and MiniatureVQNet-Noise model depends on the noise type for an SNR greater than 0 dB and less than 10 dB. In addition, training on a noise–network-distorted speech dataset improves the model prediction accuracy in all VoIP environment distortions compared to training the model on a noise-only dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Feb 14, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MiniatureVQNet: A Light-Weight Deep Neural Network for Non-Intrusive Evaluation of VoIP Speech Quality

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Development of a Deep Neural Network Model for Estimating Joint Location of Occupant Indoor Activities for Providing Thermal Comfort
Eun Ji Choi ... Jin Woo Moon
Energies | VOL. 14
Eun Ji Choi, et. al.Eun Ji Choi ... Jin Woo Moon
29 Jan 2021
Energies | VOL. 14

Application of Deep Neural Network-Artificial Neural Network Model for Prediction Of Dew Point Pressure in Gas Condensate Reservoirs from Field-X in the Niger Delta Region Nigeria
P U Abeshi ... J O Emegha
Journal of Applied Sciences and Environmental Management | VOL. 27
P U Abeshi, et. al.P U Abeshi ... J O Emegha
28 Nov 2023
Journal of Applied Sciences and Environmental Management | VOL. 27

A Deep Neural Network Model for Predicting Electric Fields Induced by Transcranial Magnetic Stimulation Coil
Khaleda Akhter Sathi ... Md Anwar Hossain
IEEE Access | VOL. 9
Khaleda Akhter Sathi, et. al.Khaleda Akhter Sathi ... Md Anwar Hossain
01 Jan 2020
IEEE Access | VOL. 9

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MiniatureVQNet: A Light-Weight Deep Neural Network for Non-Intrusive Evaluation of VoIP Speech Quality

Abstract

Talk to us

Similar Papers

More From: Applied Sciences