Abstract

The authors propose a robust artificial bandwidth extension (ABE) technique to improve narrowband (NB) speech signal quality using an enhanced spectrum envelope and excitation estimation. For envelope estimation, they propose an enhanced envelope estimation method using a deep neural network with multiple layers. For excitation estimation, they use a whitened NB excitation signal that is generated by passing the excitation signal through a whitening filter. An adaptive spectral double shifting method is introduced to obtain an enhanced wideband (WB) excitation signal. The proposed ABE system is applied to the decoded output of an adaptive multi-rate (AMR) codec at 12.2 kbps. They evaluate its performance using log spectral distortion, a WB perceptual evaluation of speech quality, and a formal listening test. The objective and subjective evaluations confirm that the proposed ABE system provides better speech quality than AMR at the same bit rate.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call