A multi-channel corpus for distant-speech interaction in presence of known interferences

Erich Zwyssig,Piergiorgio Svaizer,Maurizio Omologo,Mirco Ravanelli

doi:10.1109/icassp.2015.7178818

Abstract

This paper describes a new corpus of multi-channel audio data designed to study and develop distant-speech recognition systems able to cope with known interfering sounds propagating in an environment. The corpus consists of both real and simulated signals and of a corresponding detailed annotation. An extensive set of speech recognition experiments was conducted using three different Acoustic Echo Cancellation (AEC) techniques to establish baseline results for future reference. The AEC techniques were applied both to single distant microphone input signals and beamformed signals generated using two state-of-the-art beamforming techniques. We show that the speech recognition performance using the different techniques is comparable for both the simulated and real data, demonstrating the usefulness of this corpus for speech research. We also show that a significant improvement in speech recognition performance can be obtained by combining state-of-the-art AEC and beamforming techniques, compared to using a single distant microphone input.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multi-channel corpus for distant-speech interaction in presence of known interferences

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The effects of changes in head angle on auditory and visual input for omnidirectional and directional microphone hearing aids.
Paula Henry ... Todd Ricketts
American journal of audiology | VOL. 12
Paula Henry, et. al.Paula Henry ... Todd Ricketts
01 Jun 2003
American journal of audiology | VOL. 12

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Integration of articulatory knowledge and voicing features based on DNN/HMM for Mandarin speech recognition
Ying-Wei Tan ... Wei Jiang
-
Ying-Wei Tan, et. al. Ying-Wei Tan ... Wei Jiang
01 Jul 2015
01 Jul 2015

Automated Speech Recognition in Complex Systems: Review and Analysis of Factors Affecting Performance
Robert W Root ... Michael E Mccauley
Proceedings of the Human Factors Society Annual Meeting | VOL. 27
Robert W Root, et. al.Robert W Root ... Michael E Mccauley
01 Oct 1983
Proceedings of the Human Factors Society Annual Meeting | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-channel corpus for distant-speech interaction in presence of known interferences

Abstract

Talk to us

Similar Papers