Design of Audio-Visual Interface for Aiding Driver’s Voice Commands in Automotive Environment

Kihyeon Kim,Hanseok Ko,Seokyeong Jeong,Junho Park,Changwon Jeon,David K Han

doi:10.1007/978-0-387-79582-9_17

Abstract

This chapter describes an information-modeling and integration of an embedded audio-visual speech recognition system, aimed at improving speech recognition under adverse automobile noisy environment. In particular, we employ lip-reading as an added feature for enhanced speech recognition. Lip motion feature is extracted by active shape models and the corresponding hidden Markov models are constructed for lip-reading . For realizing efficient hidden Markov models, tied-mixture technique is introduced for both visual and acoustical information. It makes the model structure simple and small while maintaining suitable recognition performance. In decoding process, the audio-visual information is integrated into the state output probabilities of hidden Markov model as multistream features . Each stream is weighted according to the signal-to-noise ratio so that the visual information becomes more dominant under adverse noisy environment of an automobile. Representative experimental results demonstrate that the audio-visual speech recognition system achieves promising performance in adverse noisy condition, making it suitable for embedded devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design of Audio-Visual Interface for Aiding Driver’s Voice Commands in Automotive Environment

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

발화구간 검출을 위해 학습된 CNN 기반 입 모양 인식 방법
Yong-Ki Kim ... Jong Gwan Lim
Journal of Digital Convergence | VOL. 14
Yong-Ki Kim, et. al.Yong-Ki Kim ... Jong Gwan Lim
28 Aug 2016
Journal of Digital Convergence | VOL. 14

Analysis of Lip Geometric Features for Audio-Visual Speech Recognition
M.N Kaynak ... A.D Cheok
IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans | VOL. 34
M.N Kaynak, et. al.M.N Kaynak ... A.D Cheok
01 Jul 2004
IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans | VOL. 34

Enhancing quality and accuracy of speech recognition system by using multimodal audio-visual speech signal
Eslam E El Maghraby ... Amr M Gody
-
Eslam E El Maghraby, et. al.Eslam E El Maghraby ... Amr M Gody
01 Dec 2016
01 Dec 2016

Measuring the effect of high-speed video data on the audio-visual speech recognition accuracy
D V Ivanko ... D A Ryumin
Information and Control Systems | VOL. -
D V Ivanko, et. al.D V Ivanko ... D A Ryumin
19 Apr 2019
Information and Control Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design of Audio-Visual Interface for Aiding Driver’s Voice Commands in Automotive Environment

Abstract

Talk to us

Similar Papers