Robot Audition Research Articles

Acoustic interactions are important for understanding intra‐ and interspecific communication in songbird communities from the viewpoint of soundscape ecology. It has been suggested that birds may divide up sound space to increase communication efficiency in such a manner that they tend to avoid overlap with other birds when they sing. We are interested in clarifying the dynamics underlying the process as an example of complex systems based on short‐term behavioral plasticity. However, it is very problematic to manually collect spatiotemporal patterns of acoustic events in natural habitats using data derived from a standard single‐channel recording of several species singing simultaneously. Our purpose here was to investigate fine‐scale spatiotemporal acoustic interactions of the great reed warbler. We surveyed spatial and temporal patterns of several vocalizing color‐banded great reed warblers (Acrocephalus arundinaceus) using an open‐source software for robot audition HARK (Honda Research Institute Japan Audition for Robots with Kyoto University) and three new 16‐channel, stand‐alone, and water‐resistant microphone arrays, named DACHO spread out in the bird's habitat. We first show that our system estimated the location of two color‐banded individuals’ song posts with mean error distance of 5.5 ± 4.5 m from the location of observed song posts. We then evaluated the temporal localization accuracy of the songs by comparing the duration of localized songs around the song posts with those annotated by human observers, with an accuracy score of average 0.89 for one bird that stayed at one song post. We further found significant temporal overlap avoidance and an asymmetric relationship between songs of the two singing individuals, using transfer entropy. We believe that our system and analytical approach contribute to a better understanding of fine‐scale acoustic interactions in time and space in bird communities.

Read full abstract

Robot audition, the ability of a robot to listen to several things at once with its own “ears,” is crucial to the improvement of interactions and symbiosis between humans and robots. Since robot audition was originally proposed and has been pioneered by Japanese research groups, this special issue on robot audition technologies of the Journal of Robotics and Mechatronics covers a wide collection of advanced topics studied mainly in Japan. Specifically, two consecutive JSPS Grants-in-Aid for Scientific Research (S) on robot audition (PI: Hiroshi G. Okuno) from 2007 to 2017, JST Japan-France Research Cooperative Program on binaural listening for humanoids (PI: Hiroshi G. Okuno and Patrick Danès) from 2009 to 2013, and the ImPACT Tough Robotics Challenge (PM: Prof. Satoshi Tadokoro) on extreme audition for search and rescue robots since 2015 have contributed to the promotion of robot audition research, and most of the papers in this issue are the outcome of these projects. Robot audition was surveyed in the special issue on robot audition in the Journal of Robotic Society of Japan, Vol.28, No.1 (2011) and in our IEEE ICASSP-2015 paper. This issue covers the most recent topics in robot audition, except for human-robot interactions, which was covered by many papers appearing in Advanced Robotics as well as other journals and international conferences, including IEEE IROS. This issue consists of twenty-three papers accepted through peer reviews. They are classified into four categories: signal processing, music and pet robots, search and rescue robots, and monitoring animal acoustics in natural habitats. In signal processing for robot audition, Nakadai, Okuno, et al. report on HARK open source software for robot audition, Takeda, et al. develop noise-robust MUSIC-sound source localization (SSL), and Yalta, et al. use deep learning for SSL. Odo, et al. develop active SSL by moving artificial pinnae, and Youssef, et al. propose binaural SSL for an immobile or mobile talker. Suzuki, Otsuka, et al. evaluate the influence of six impulse-response-measuring signals on MUSIC-based SSL, Sekiguchi, et al. give an optimal allocation of distributed microphone arrays for sound source separation, and Tanabe, et al. develop 3D SSL by using a microphone array and LiDAR. Nakadai and Koiwa present audio-visual automatic speech recognition, and Nakadai, Tezuka, et al. suppress ego-noise, that is, noise generated by the robot itself. In music and pet robots, Ohkita, et al. propose audio-visual beat tracking for a robot to dance with a human dancer, and Tomo, et al. develop a robot that operates a wayang puppet, an Indonesian world cultural heritage, by recognizing emotion in Gamelan music. Suzuki, Takahashi, et al. develop a pet robot that approaches a sound source. In search and rescue robots, Hoshiba, et al. implement real-time SSL with a microphone array installed on a multicopter UAV, and Ishiki, et al. design a microphone array for multicopters. Ohata, et al. detect a sound source with a multicopter microphone array, and Sugiyama, et al. identify detected acoustic events through a combination of signal processing and deep learning. Bando, et al. enhance the human-voice online and offline for a hose-shaped rescue robot with a microphone array. In monitoring animal acoustics in natural habitats, Suzuki, Matsubayashi, et al. design and implement HARKBird, Matsubayashi, et al. report on the experience of monitoring birds with HARKBird, and Kojima, et al. use a spatial-cue-based probabilistic model to analyze the songs of birds singing in their natural habitat. Aihara, et al. analyze a chorus of frogs with dozens of sound-to-light conversion device Firefly, the design and analysis of which is reported on by Mizumoto, et al. The editors and authors hope that this special issue will promote the further evolution of robot audition technologies in a diversity of applications.

Read full abstract

Robot Audition Research Articles

Related Topics

Articles published on Robot Audition

Complex systems approaches to temporal soundspace partitioning in bird communities as a self-organizing phenomenon based on behavioral plasticity

On the Use of the AIRA-UAS Corpus to Evaluate Audio Processing Algorithms in Unmanned Aerial Systems.

Motion planning for robot audition

A DNN-based Post Filter for Geometric Source Separation

Acoustic interactions for robot audition: A corpus of real auditory scenes.

Field observations of ecoacoustic dynamics of a Japanese bush warbler using an open-source software for robot audition HARK

Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization

Aural Servo: Sensor-Based Control From Robot Audition

Transition and the current technologies in acoustic signal processing: From the viewpoint of robot audition

A spatiotemporal analysis of acoustic interactions between great reed warblers (Acrocephalus arundinaceus) using microphone arrays and robot audition software HARK.

Localization of sound sources in robotics: A review

Special Issue on Robot Audition Technologies

Bird Song Scene Analysis Using a Spatial-Cue-Based Probabilistic Model

Development, Deployment and Applications of Robot Audition Open Source Software HARK

Development of a Robotic Pet Using Sound Source Localization with the HARK Robot Audition System

Acoustic Monitoring of the Great Reed Warbler Using Multiple Microphone Arrays and Robot Audition

HARKBird: Exploring Acoustic Interactions in Bird Communities Using a Microphone Array

Influence of Different Impulse Response Measurement Signals on MUSIC-Based Sound Source Localization

Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition

Fast multiple moving sound sources localization utilizing sparseness of speech signals

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Robot Audition Research Articles

Related Topics

Articles published on Robot Audition

Complex systems approaches to temporal soundspace partitioning in bird communities as a self-organizing phenomenon based on behavioral plasticity

On the Use of the AIRA-UAS Corpus to Evaluate Audio Processing Algorithms in Unmanned Aerial Systems.

Motion planning for robot audition

A DNN-based Post Filter for Geometric Source Separation

Acoustic interactions for robot audition: A corpus of real auditory scenes.

Field observations of ecoacoustic dynamics of a Japanese bush warbler using an open-source software for robot audition HARK

Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization

Aural Servo: Sensor-Based Control From Robot Audition

Transition and the current technologies in acoustic signal processing: From the viewpoint of robot audition

A spatiotemporal analysis of acoustic interactions between great reed warblers (Acrocephalus arundinaceus) using microphone arrays and robot audition software HARK.

Localization of sound sources in robotics: A review

Special Issue on Robot Audition Technologies

Bird Song Scene Analysis Using a Spatial-Cue-Based Probabilistic Model

Development, Deployment and Applications of Robot Audition Open Source Software HARK

Development of a Robotic Pet Using Sound Source Localization with the HARK Robot Audition System

Acoustic Monitoring of the Great Reed Warbler Using Multiple Microphone Arrays and Robot Audition

HARKBird: Exploring Acoustic Interactions in Bird Communities Using a Microphone Array

Influence of Different Impulse Response Measurement Signals on MUSIC-Based Sound Source Localization

Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition

Fast multiple moving sound sources localization utilizing sparseness of speech signals