Vision-guided robot hearing

Xavier Alameda-Pineda,Radu Horaud

doi:10.1177/0278364914548050

Abstract

Natural human–robot interaction (HRI) in complex and unpredictable environments is important with many potential applications. While vision-based HRI has been thoroughly investigated, robot hearing and audio-based HRI are emerging research topics in robotics. In typical real-world scenarios, humans are at some distance from the robot and, hence, the sensory (microphone) data are strongly impaired by background noise, reverberations and competing auditory sources. In this context, the detection and localization of speakers plays a key role that enables several tasks, such as improving the signal-to-noise ratio for speech recognition, speaker recognition, speaker tracking, etc. In this paper we address the problem of how to detect and localize people that are both seen and heard. We introduce a hybrid deterministic/probabilistic model. The deterministic component allows us to map 3D visual data onto a 1D auditory space. The probabilistic component of the model enables the visual features to guide the grouping of the auditory features in order to form audiovisual (AV) objects. The proposed model and the associated algorithms are implemented in real-time (17 FPS) using a stereoscopic camera pair and two microphones embedded into the head of the humanoid robot NAO. We perform experiments with (i) synthetic data, (ii) publicly available data gathered with an audiovisual robotic head, and (iii) data acquired using the NAO robot. The results validate the approach and are an encouragement to investigate how vision and hearing could be further combined for robust HRI.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Vision-guided robot hearing

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research

Lead the way for us

Journal: The International Journal of Robotics Research	Publication Date: Oct 27, 2014
Citations: 31

Similar Papers

First Attempt of Gender-free Speech Style Transfer for Genderless Robot
Chuang Yu ... Adriana Tapus
-
Chuang Yu, et. al.Chuang Yu ... Adriana Tapus
07 Mar 2022
07 Mar 2022

Calibrating intuitive and natural human–robot interaction and performance for power-assisted heavy object manipulation using cognition-based intelligent admittance control schemes
S M Mizanoor Rahman ... Ryojun Ikeura
International Journal of Advanced Robotic Systems | VOL. 15
S M Mizanoor Rahman, et. al.S M Mizanoor Rahman ... Ryojun Ikeura
01 Jul 2018
International Journal of Advanced Robotic Systems | VOL. 15

Natural hand posture recognition based on Zernike moments and hierarchical classifier
Lizhong Gu ... Jianbo Su
-
Lizhong Gu, et. al. Lizhong Gu ... Jianbo Su
01 May 2008
01 May 2008

Empathetic Robot With Transformer-Based Dialogue Agent
Baijun Xie ... Chung Hyuk Park
-
Baijun Xie, et. al.Baijun Xie ... Chung Hyuk Park
12 Jul 2021
12 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vision-guided robot hearing

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research