Abstract

There is an increasing interest in upgrading the E-Model, a parametric tool for speech quality estimation, to the wideband and super-wideband contexts. The main motivation behind this has been to quantify the quality gain lent by various new codecs and communication situations. There have been numerous such contributions, and all of them have been more or less successful. This paper reports on an extension of the E-Model to the mixed narrowband/wideband (NB/WB) context. More specifically, we take a novel approach toward deriving effective equipment impairment factors ( $I_{e,WB,eff}$ ) by considering additional impairments related to the underlying communications network. These additional impairments are the pause and jump temporal discontinuities along with network-related loss and pure codec-related impairments. While the effect of loss is a thoroughly studied topic and has been integrated into the E-Model, pauses and jumps have been given little attention. Pauses and jumps manifest themselves as temporal dilation and contraction, respectively, in the resulting speech signal that is presented to the listener and are normally caused by jitter and jitter buffer interaction. In this paper, we initially present a four-state Markov model to characterize, and also emulate, loss, pause, and jump impairments. Then, we present alternative models for computing effective equipment impairment models. A large number of test stimuli were generated using different NB and WB codecs. WB-PESQ was used to evaluate the stimuli. Genetic programming was employed to derive equipment impairment factors. The proposed models have a high correlation with WB-PESQ. We claim that the models proposed by us outperform the existing E-Model by a factor of approximately 29% while using WB-PESQ as a reference model. The models also outperform the E-Model against results from auditory tests. It is also shown that the models outperform the results of multiple linear regressions.

Highlights

  • Telecommunications technologies are evolving at a rapid pace

  • voice over IP (VoIP) SIMULATION The aim of this study is to develop a model for Ie,WB,eff as a function of losses, pauses and jumps

  • In this paper, we have proposed a novel methodology for determining NB/WB equipment impairment factors Ie,WB,eff, for a mixed NB/WB context

Read more

Summary

Introduction

Telecommunications technologies are evolving at a rapid pace. The old Public Switched Telephone Network (PSTN), which normally operates in the narrowband (NB) range (300 – 3400 Hz), is being replaced with wireless and voice over IP (VoIP) systems. For instance, two individual programs are randomly chosen from the underlying population and a new offspring is formed by applying the aforementioned genetic operators. This mating process is repeated till the size of the offspring population becomes equal to that of the parent population. The fitter individuals are retained and the worse are littered This evolutionary process is repeated until a certain user-specified criterion is met. This could be to stop when an individual with desired fitness is achieved or when a certain number of generations have elapsed. Numerous solution representations for GP trees exist, albeit abstract syntax trees are by far the most common choices

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.