This paper proposes a novel distributed particle filter with sampling-based consensus density fusion for speaker tracking in distributed microphone networks (DMNs). Firstly, to combat the reverberation and ambient noise, multi-hypothesis likelihood model is applied to describe the likelihood distribution in the state-space model of speaker tracking problem. Then, the sequential importance resampling (SIR) is performed on each node to obtain the local posteriors of the state vector of the speaker. Since the non-Gaussian noise and the reverberation may make local posteriors presenting non-Gaussian and multi-modal characters, in this paper, the local posterior densities are approximated by Gaussian mixtures whose parameters are further transmitted to neighboring nodes to complete the distributed fusion based on the consensus method. We investigate a novel adaptive multiple importance sampling (AMIS) algorithm combining parallel tempering Monte Carlo (PTMC) samplers to achieve the sampling-based consensus density fusion of the non-Gaussian local posteriors and estimate the global posterior locally on each node. At last, the optimal global estimation of the speaker state vector could be calculated by using the estimated global posterior on each node in DMNs. Simulation and real-world recording experiment results show that the proposed speaker tracking method exhibits satisfactory tracking performance under reverberant and noisy environments. Moreover, it is easy and feasible to extend the proposed tracking algorithm to solve the multi-speaker tracking problem.
Read full abstract