Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array

Shigeki Miyabe,Kiyohiro Shikano,Yosuke Tatekura,Hiroshi Saruwatari,Yoichi Hinamoto

doi:10.1155/2007/57470

Abstract

A barge-in free spoken dialogue interface using sound field control and microphone array is proposed. In the conventional spoken dialogue system using an acoustic echo canceller, it is indispensable to estimate a room transfer function, especially when the transfer function is changed by various interferences. However, the estimation is difficult when the user and the system speak simultaneously. To resolve the problem, we propose a sound field control technique to prevent the response sound from being observed. Combined with a microphone array, the proposed method can achieve high elimination performance with no adaptive process. The efficacy of the proposed interface is ascertained in the experiments on the basis of sound elimination and speech recognition.

Highlights

For hands-free realization of smooth communication with a spoken dialogue system, it should be guaranteed that a user’s command utterance reaches the system clearly
In order to achieve robustness, we propose a new interface for a barge-in free spoken dialogue system that combines multichannel sound field control and a microphone array
It can be seen that increasing both the number of microphone elements and the number of loudspeakers improves the performance of the proposed method, and can make the control robust against the fluctuation of room transfer functions

Summary

Introduction

For hands-free realization of smooth communication with a spoken dialogue system, it should be guaranteed that a user’s command utterance reaches the system clearly. A user might interrupt sound responses from the system and utter a command, or he might start speaking before the termination of the sound responses from the system In such a situation, the sound given from the system to the user is observed as an acoustic echo return at a microphone used for acquisition of the user’s speech input, and degrades the speech recognition performance in receiving the user’s input command. In the state of barge-in (this is called a “double-talk problem”), since user’s speech input is mixed in the observed signal, the speech acts as noise to the estimation and the estimation fails In this case, the adaptation process should be stopped by some type of double-talk detection technique [8, 9]. When the room transfer function changes in the barge-in state, the elimination performance degrades

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Jan 8, 2007
Citations: 17	License type: cc-by

R Discovery Prime

R Discovery Prime

Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

Interface for barge-in free spoken dialogue system based on sound field control and microphone array
H Yoichi ... K Shikano
-
H Yoichi, et. al.H Yoichi ... K Shikano
06 Apr 2003
06 Apr 2003

Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array
T Asai
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E88-A
T AsaiT Asai
01 Jun 2005
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E88-A

A Bayesian spherical harmonics source radiation model for sound field control.
Diego Caviedes-Nozal ... Franz M Heuchel
The Journal of the Acoustical Society of America | VOL. 146
Diego Caviedes-Nozal, et. al.Diego Caviedes-Nozal ... Franz M Heuchel
01 Nov 2019
The Journal of the Acoustical Society of America | VOL. 146

Down-mixing of multi-channel audio for sound field reproduction based on spatial covariance
Yoshinori Takahashi ... Akio Ando
Applied Acoustics | VOL. 71
Yoshinori Takahashi, et. al.Yoshinori Takahashi ... Akio Ando
30 Aug 2010
Applied Acoustics | VOL. 71

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing