Multi-party human-robot interaction with distant-talking speech recognition

Randy Gomez,Tatsuya Kawahara,Kazuhiro Nakadai,Keisuke Nakamura

doi:10.1145/2157689.2157835

Abstract

Speech is one of the most natural medium for human communication, which makes it vital to human-robot interaction. In real environments where robots are deployed, distant-talking speech recognition is difficult to realize due to the effects of reverberation. This leads to the degradation of speech recognition and understanding, and hinders a seamless human-robot interaction. To minimize this problem, traditional speech enhancement techniques optimized for human perception are adopted to achieve robustness in human-robot interaction. However, human and machine perceive speech differently: an improvement in speech recognition performance may not automatically translate to an improvement in human-robot interaction experience (as perceived by the users). In this paper, we propose a method in optimizing speech enhancement techniques specifically to improve automatic speech recognition (ASR) with emphasis on the human-robot interaction experience. Experimental results using real reverberant data in a multi-party conversation, show that the proposed method improved human-robot interaction experience in severe reverberant conditions compared to the traditional techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-party human-robot interaction with distant-talking speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The effects of changes in head angle on auditory and visual input for omnidirectional and directional microphone hearing aids.
Paula Henry ... Todd Ricketts
American journal of audiology | VOL. 12
Paula Henry, et. al.Paula Henry ... Todd Ricketts
01 Jun 2003
American journal of audiology | VOL. 12

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Integration of articulatory knowledge and voicing features based on DNN/HMM for Mandarin speech recognition
Ying-Wei Tan ... Wei Jiang
-
Ying-Wei Tan, et. al. Ying-Wei Tan ... Wei Jiang
01 Jul 2015
01 Jul 2015

Automated Speech Recognition in Complex Systems: Review and Analysis of Factors Affecting Performance
Robert W Root ... Michael E Mccauley
Proceedings of the Human Factors Society Annual Meeting | VOL. 27
Robert W Root, et. al.Robert W Root ... Michael E Mccauley
01 Oct 1983
Proceedings of the Human Factors Society Annual Meeting | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-party human-robot interaction with distant-talking speech recognition

Abstract

Talk to us

Similar Papers