Abstract

Beamforming with multiple microphones is essential for Automatic Speech Recognition (ASR) in earbuds, cell phones, and smart speakers. Although fixed delay-and-sum (DAS) beamforming is simple to implement, it only suppresses noise from a fixed direction of arrival (DoA) [1]; hence, it is ineffective in real varying noise conditions. Reference [2] implements ultra-low-power keyword spotting (KWS) with noise suppression, but the lack of an ADC and beamforming limit practical application. On the other hand, adaptive beamforming (ABF) actively adjusts nulls to suppress varying noise sources. Adaptive beamforming with a trained DNN is promising [3] but requires extensive training data and high power consumption and is not applicable for battery-operated systems. Conventional adaptive beamforming [4 – 5] (Fig. 32.5.1) adaptively reduces noise and interference in the output of a fixed DAS beamformer. Although conventional ABF is effective and compact, it is hampered by: 1) high DSP power consumption due to high ADC sampling rate and the need for complex calculations, especially in the blocking matrix (BM); 2) target signal direction errors in DAS cause severe signal distortion; and 3) worst-case input-SNR design causes high ADC and DSP power regardless of actual signal conditions.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.