Abstract
Head-related transfer functions (HRTFs) play a crucial role in virtual acoustics and spatial audio applications. However, obtaining personalized, high-resolution HRTFs remains challenging due to the time-consuming and expensive measurements. Recently, deep learning-based methods have shown promise in predicting high-resolution HRTFs from sparse measurements. Nevertheless, most of these methods often treat HRTF upsampling as an image super-resolution task, which overlooks critical spatial information and acoustic principles, leading to overfitting on datasets. This paper proposes a physics-informed spherical convolutional neural network for HRTF upsampling. First, spherical convolutional layers are used to capture spatial features on the sphere, allowing efficient handling of spherical sampled HRTF data. Second, in the upsampling process, the proposed method incorporates the Helmholtz equation as a constraint, adhering tothe physics of the acoustic system. This method ensures the generation of physically feasible HRTF interpolations, thus promoting better generalization.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.