Abstract
In this paper, a new hybrid multi-stage musical instrument sound signal compression method, based on DWPT and MDCT using efficient psychoacoustic models is proposed. The primary objective is to perform lossy and perceptually transparent compression on the sounds of various Indian musical instruments. Firstly, the original sound signal is decomposed into wavelet packets using an optimal wavelet basis. Wavelet packets are then run through psychoacoustic model in wavelet domain to determine auditory masking level for thresholding, which in turn is used to perform thresholding of wavelet coefficients in respective sub-bands. Further, audio signal coefficients are partitioned into frames which overlap in such a way that each block gets 512 samples and windowed using a hanning window with 1/16 frame overlap. Secondly, MDCT is applied to each block to de-correlate the spectral information. Removal of spectral redundancy is achieved by compressing the subordinate components more than the dominant components. The resulting signal is quantized with variable number of bits, which are determined based on the results of the psychoacoustic model in FFT domain. This technique provides an efficient way to exploit key strengths of both DWPT and MDCT.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.