Abstract

A sound source separation algorithm based on the spectral amplitudes of 2-channel signals has been developed for the up-mixing playback of 2-channel stereo. Short-term Fourier transforms (STFT) of the signals on the left and right channels are first calculated. The coefficients of the discrete Fourier transform (DFT) are used to calculate the ratio of the spectral amplitudes of the left and right channels, which is termed the channel level difference (CLD). The DFT coefficients are then divided into multiple groups on the basis of the CLD, with each group representing a separated sound source. The signal-to-distortion ratio (SDR)is used to evaluate the signal separation performance. It was found that a rough estimate of the CLD threshold yielding the best SDR could be obtained by cross-correlating the separated sounds. For playback on a headset, each separated signal is convoluted with head-related transfer functions (HRTF) that represent the direction of that particular sound source. Subjective listening tests showed that the sound synthesized by this method is more realistic than that synthesized with HRTFs that represent only left and right speakers.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.