Abstract

ITU-T P.862 - “Perceptual Evaluation of Speech Quality (PESQ)” is well known as an intrusive objective speech quality assessment method. Some reports have found that the PESQ time alignment mechanism fails to estimate delay where signals with high packet loss rate and dynamic time processing are present. A new time-alignment algorithm to improve the PESQ accuracy for time-scale modified voice transmission is suggested here. In the propose model, the time alignment of reference and degraded speech is estimated using Dynamic Time- Warping (DTW) in contrast to correlation and splitting methods used in the standard PESQ. Comparative results versus subjective Mean Opinion Score (MOS) show improvement in cases where dynamic time processing of signals is present.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call