Abstract
Audio-Visual Speech Recognition (AVSR) using acoustic and visual signals of speech have received attention recently because of its robustness in noisy environments. An important issue in decision fusion AVSR systems is the determination of appropriate integration weight for better performance. A new Genetic Algorithm (GA) based scheme to obtain an appropriate integration weight is proposed here. The performance of the proposed scheme is demonstrated for commonly used mobile functions isolated word recognition via multi-speaker database experiment. The results show that the proposed scheme improves robust recognition accuracy over conventional unimodal systems and other related bimodal systems, namely, Reliability ratio and Neural Network based AVSR systems.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have