Abstract

A microphone array system for speech data input must include a robust algorithm for determining the location of the desired talker. Here, a two-stage talker location algorithm based on filtered cross-correlation is introduced. At each stage, maximization of a sum-of-independent-cross-correlations functional is used to establish talker position. Suitable accuracy is obtained at low cost by using multirate interpolation. Experimental evidence has shown that a two-stage procedure improves performance; in the first stage, closely spaced microphone pairs are used to determine the x location of the talker ( x 0 ), and more broadly spaced pairs are used in the second stage to find y 0 for a restricted range of x. Substantive results, based on real data, are presented to indicate performance. An efficient, global, non-linear optimization technique, stochastic region contraction (SRC), is briefly introduced and is shown to make this algorithm feasible in real time.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.