Abstract

In this paper, a new hybrid multi-stage musical instrument sound signal compression method, based on DWPT and MDCT using efficient psychoacoustic models is proposed. The primary objective is to perform lossy and perceptually transparent compression on the sounds of various Indian musical instruments. Firstly, the original sound signal is decomposed into wavelet packets using an optimal wavelet basis. Wavelet packets are then run through psychoacoustic model in wavelet domain to determine auditory masking level for thresholding, which in turn is used to perform thresholding of wavelet coefficients in respective sub-bands. Further, audio signal coefficients are partitioned into frames which overlap in such a way that each block gets 512 samples and windowed using a hanning window with 1/16 frame overlap. Secondly, MDCT is applied to each block to de-correlate the spectral information. Removal of spectral redundancy is achieved by compressing the subordinate components more than the dominant components. The resulting signal is quantized with variable number of bits, which are determined based on the results of the psychoacoustic model in FFT domain. This technique provides an efficient way to exploit key strengths of both DWPT and MDCT.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.