Forward-backward recursive expectation-maximization for concurrent speaker tracking

Yuval Dorfan,Sharon Gannot,Boaz Schwartz

doi:10.1186/s13636-020-00189-x

Yuval Dorfan, Sharon Gannot + Show 1 more

Open Access

https://doi.org/10.1186/s13636-020-00189-x

Copy DOI

Abstract

In this paper, a study addressing the task of tracking multiple concurrent speakers in reverberant conditions is presented. Since both past and future observations can contribute to the current location estimate, we propose a forward-backward approach, which improves tracking accuracy by introducing near-future data to the estimator, in the cost of an additional short latency. Unlike classical target tracking, we apply a non-Bayesian approach, which does not make assumptions with respect to the target trajectories, except for assuming a realistic change in the parameters due to natural behaviour. The proposed method is based on the recursive expectation-maximization (REM) approach. The new method is dubbed forward-backward recursive expectation-maximization (FB-REM). The performance is demonstrated using an experimental study, where the tested scenarios involve both simulated and recorded signals, with typical reverberation levels and multiple moving sources. It is shown that the proposed algorithm outperforms the regular common causal (REM).

Highlights

The task of multiple target tracking has significant importance in civil, military and surveillance applications such as improving beamforming accuracy in speech enhancement applications, e.g. speech separation, indoor robotic assistance, and automatic steering of cameras [1,2,3,4]
We propose a new tracking mechanism and use it to modify the recursive distributed expectationmaximization (RDEM) [51], resulting in the tracking forward-backward recursive expectation-maximization (TFB-REM), which is a non-Bayesian algorithm
4.2 recursive distributed expectation-maximization (RDEM) applied in the forward direction In [44] and [51], the tracking forward-recursive expectation-maximization (TF-REM) was derived for the general algorithm in (11) in detail, and only the resulting formulae are given

Summary

Introduction

The task of multiple target tracking (or dynamic localization) has significant importance in civil, military and surveillance applications such as improving beamforming accuracy in speech enhancement applications, e.g. speech separation, indoor robotic assistance, and automatic steering of cameras [1,2,3,4]. A further study of the recursive expectation-maximization (REM) approach appeared in [42] for the problem of DOA estimation, using TREM and another recursive algorithm suggested by the authors. We propose a new tracking mechanism and use it to modify the recursive distributed expectationmaximization (RDEM) [51], resulting in the tracking forward-backward recursive expectation-maximization (TFB-REM), which is a non-Bayesian algorithm.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Jan 9, 2021
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Forward-backward recursive expectation-maximization for concurrent speaker tracking

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Recursive online EM estimation of mixture autoregressions
Abdelhakim Aknouche
Journal of Statistical Computation and Simulation | VOL. 83
Abdelhakim AknoucheAbdelhakim Aknouche
01 Feb 2013
Journal of Statistical Computation and Simulation | VOL. 83

Auditory inspired methods for localization of multiple concurrent speakers
Tania Habib ... Harald Romsdorfer
Computer Speech & Language | VOL. 27
Tania Habib, et. al.Tania Habib ... Harald Romsdorfer
25 Sep 2012
Computer Speech & Language | VOL. 27

A recursive expectation-maximization algorithm for speaker tracking and separation
Ofer Schwartz ... Sharon Gannot
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021
Ofer Schwartz, et. al.Ofer Schwartz ... Sharon Gannot
01 Dec 2021
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021

Recursive EM algorithm with adaptive step size
Pei-Jung Chung ... J.F Bohme
-
Pei-Jung Chung, et. al. Pei-Jung Chung ... J.F Bohme
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Forward-backward recursive expectation-maximization for concurrent speaker tracking

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing