Abstract

A new blind microphone array method to enhance speech signals generated by multiple sources in a noisy environment is proposed. This approach is based on a two-stage scheme. A subband time-delay estimation algorithm is first used to localize the dominant speech sources. The speech enhancement is performed in a second stage, based on the acquired spatial information, by means of a spatially constrained subband beamformer. The robustness of this structure is ensured by the spatial constraint constructed to include the discrepancies in the acoustical environment model as well as errors in the time-delay estimation. Such scheme also allows for an efficient adaptation of the beamformer to speakers movement. The proposed subband approach for time-delay estimation exploits the sparseness of speech signals in the time-frequency domain to localize multiple speakers simultaneously. It also provides means to select the number of target sources. Evaluation in a real environment shows promising results.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.