Abstract

Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call