Abstract
This paper describes a speaker‐independent isolated word recognition system which accepts telephone line speech. A recognition method is named selective weighted matching (SWM) which uses a weighted distance measure. The input speech signal is frequency‐analyzed every 10 ms by a filter bank. The individual glottal characteristic is normalized frame by frame using a least‐square‐fit line of the speech spectrum. Each reference pattern has a specific region in the time‐frequency domain. In the matching process of that region, the weighted distance computation is carried out under the predetermined condition. In the computer simulation of telephone line speech, we got the recognition accuracy greater than 96% with 12 words (digits and two command words in Japanese) spoken by 130 talkers. The same result was also obtained in the recognition test of the prototype machine.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have