Abstract
Traditional methods often only use monaural masking models to decorrelate input signals for stereo acoustic echo cancellation. Whereas, it seems more reasonable to use binaural masking models for the following two reasons. First, stereo signals are heard by two ears rather than just one. Second, psychoacoustic researchers have already shown that there are obvious masking level differences between binaural masking models and monaural masking models. By studying binaural masking level difference models, we first introduce a simplified binaural masking model for stereo acoustic echo cancellation. Considering that the interaural time difference is dominant at low frequencies (⩽1.5 kHz) and the interaural level difference is a major cue at higher frequencies, we propose to use different signal decorrelation schemes at these two frequency bands. In the low-frequency band, a pitch-driven sinusoidal injection scheme is proposed to maintain the interaural time difference, where the amount of injection is determined by the proposed binaural masking model. In the high-frequency band, a modified sinusoidal phase modulation scheme is applied to make a trade-off between preserving the interaural level difference and decorrelating the stereophonic input signals. Assessment results show that the proposed method can effectively improve the non-unique problem and retain good speech quality.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.