Abstract
In this paper, a new audio coding scheme combining the Hilbert transform and the empirical mode decomposition (EMD) is introduced. Based on the EMD, the coding is fully a data-driven approach. Audio signal is first decomposed adaptively, by EMD, into intrinsic oscillatory components called intrinsic mode functions (IMFs). The key idea of this work is to code both instantaneous amplitude (IA) and instantaneous frequency (IF), of the extracted IMFs, calculated using Hilbert transform. Since IA (resp. IF) is strongly correlated, it is encoded via a linear prediction technique. The decoder recovers the original signal by superposition of the demodulated IMFs. The proposed approach is applied to audio signals, and the results are compared to those obtained by advanced audio coding (AAC) and MP3 codecs, and wavelets-based compression. Coding performances are evaluated using the bit rate, objective difference grade (ODG) and noise to mask ratio (NMR) measures. Based on the analyzed audio signals, overall, our coding scheme performs better than wavelet compression, AAC and MP3 codecs. Results also show that this new scheme has good coding performances without significant perceptual distortion, resulting in an ODG in range $$[-1,0]$$ and large negative NMR values.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.