Speech recognition apparatus, speech recognition apparatus and program thereof

Osamu Ichikawa

doi:10.1121/1.3274301

Abstract

Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate speech recognition, and a system using the same. In terms of directional characteristics of a microphone array, due to a capability of approximating a power distribution of each angle of each of possible various sound source directions by use of a sum of coefficient multiples of a base form angle power distribution of a target sound source measured beforehand by base form angle by using a base form sound, and power distribution of a non-directional background sound by base form, only a component of the target sound source direction is extracted at a noise suppression part. In addition, when the target sound source direction is unknown, at a sound source localization part, a distribution for minimizing the approximate residual is selected from base form angle power distributions of various sound source directions to assume a target sound source direction. Further, maximum likelihood estimation is executed by using voice data of the component of the sound source direction passed through these processes, and a voice model obtained by predetermined modeling of the voice data, and speech recognition is carried out based on an obtained assumption value.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech recognition apparatus, speech recognition apparatus and program thereof

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Estimation of sound source number and directions under a multi-source environment
Jwu-Sheng Hu ... Cheng-Kang Wang
-
Jwu-Sheng Hu, et. al.Jwu-Sheng Hu ... Cheng-Kang Wang
01 Oct 2009
01 Oct 2009

Improved DOA Estimation Method for Sound Source Direction Based on Binaural Signals Using an Array of Two Pairs of Microphones
Belgacem Douaer
-
Belgacem DouaerBelgacem Douaer
25 Nov 2021
25 Nov 2021

Phase-locked onset detectors for monaural sound grouping and binaural direction finding
Leslie Smith
The Journal of the Acoustical Society of America | VOL. 111
Leslie SmithLeslie Smith
01 May 2002
The Journal of the Acoustical Society of America | VOL. 111

Influence of sound source characteristics in determining objective speech intelligibility metrics
Peisheng Zhu ... Jian Kang
Applied Acoustics | VOL. 89
Peisheng Zhu, et. al.Peisheng Zhu ... Jian Kang
23 Oct 2014
Applied Acoustics | VOL. 89

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech recognition apparatus, speech recognition apparatus and program thereof

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America