Performance of speaker localization using microphone array

R Visalakshi,S Palanivel,P Dhanalakshmi

doi:10.1007/s10772-016-9341-9

Abstract

Speaker localization is a technique to locate and track an active speaker from multiple acoustic sources using microphone array. Microphone array is used to improve the speech quality of recorded speech signal in meeting room and other places. In this work, the time delay estimation between source and each microphone is calculated using a localization method called time differences of arrival (TDOA). TDOA localization consists of two steps namely (a) a time delay estimator and (b) a localization estimator. For time delay estimation, the generalized cross-correlation using phase transform, the generalized cross correlation using maximum likelihood, linear prediction (LP) residual and the Hilbert envelope of the LP residual are chosen for estimating the location of a person. A new speaker localization algorithm known as group search optimization (GSO) algorithm is proposed. The performance of this algorithm is analyzed and compared with Gauss–Newton nonlinear least square method and genetic algorithm. Experimental results show that the proposed GSO method outperforms the other methods in terms of mean square error, root mean square error, mean absolute error, mean absolute percentage error, euclidean distance and mean absolute relative error.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of speaker localization using microphone array

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Apr 25, 2016
Citations: 4

Similar Papers

Neural network-based estimation of lighting condition in indoor environment with improved brain storm algorithm
Sneha Patil ... Ravindra Kharadkar
Journal of Engineering, Design and Technology | VOL. 20
Sneha Patil, et. al.Sneha Patil ... Ravindra Kharadkar
22 Jul 2021
Journal of Engineering, Design and Technology | VOL. 20

Support Vector Regression for Bus Travel Time Prediction Using Wavelet Transform
...
-
, et. al. ...
25 Jun 2019
25 Jun 2019

Artificial Intelligence based accurately load forecasting system to forecast short and medium-term load demands.
Faisal Mehmood Butt ... Kashif Javed Lone
Mathematical Biosciences and Engineering | VOL. 18
Faisal Mehmood Butt, et. al.Faisal Mehmood Butt ... Kashif Javed Lone
14 Dec 2020
Mathematical Biosciences and Engineering | VOL. 18

Comparison of autoregressive integrated moving average model and generalised regression neural network model for prediction of haemorrhagic fever with renal syndrome in China: a time-series study
Ya-Wen Wang ... Yu Jiang
BMJ open | VOL. 9
Ya-Wen Wang, et. al.Ya-Wen Wang ... Yu Jiang
01 Jun 2019
BMJ open | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of speaker localization using microphone array

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology