Abstract

A novel speaker-independent speech recognition method, which registers speech uttered by a small number of speakers into a dictionary as model speech is presented. It is based on the hypothesis that movement of the vocal tract differs little among individuals when the same word is spoken. This idea leads to the conclusion that dynamic characteristics extracted from a small number of speaker's utterances are effective for speaker-independent speech recognition. A speech recognition method using model utterances in which similarity values of an input word are calculated by matching a small number of speakers' utterances with phoneme templates for speaker-independent recognition is described. When tested with 212 Japanese words, a word recognition rate of 95.8% was obtained. The evaluation of the noise robustness is also reported.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">&gt;</ETX>

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call