RBF neural network mouth tracking for audio-visual speech recognition system

Lim Ee Hui Lim Ee Hui,K.M Tse,K.P Seng

doi:10.1109/tencon.2004.1414362

Abstract

A great interest in the research of audio-visual speech recognition (AVSR) systems is driven by the increase in the number of multimedia applications that require robust speech recognition systems. The use of visual features in AVSR is justified by both the audio and visual modality of the speech generation and the need for features that are invariant to acoustic noise perturbation. The performance of the AVSR system relies on a robust set of visual features obtained from the accurate detection and tracking of the mouth region. Therefore the mouth tracking plays a major role in AVSR systems. This paper presents an improvement version of mouth tracking technique using radial basis function neural network (RBF NN) with its applications to AVSR systems. A modified extended Kalman filter (EKF) is used to adjust the parameters of the RBF NN. Simulation results have revealed good performance of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RBF neural network mouth tracking for audio-visual speech recognition system

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker independent audio-visual continuous speech recognition
Luhong Liang ... Xiaoxing Liu
-
Luhong Liang, et. al. Luhong Liang ... Xiaoxing Liu
07 Nov 2002
07 Nov 2002

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset
Jianwei Yu ... Xunying Liu
-
Jianwei Yu, et. al.Jianwei Yu ... Xunying Liu
01 May 2020
01 May 2020

발화구간 검출을 위해 학습된 CNN 기반 입 모양 인식 방법
Yong-Ki Kim ... Mi-Hye Kim
Journal of Digital Convergence | VOL. 14
Yong-Ki Kim, et. al.Yong-Ki Kim ... Mi-Hye Kim
28 Aug 2016
Journal of Digital Convergence | VOL. 14

Automatic Segmented-Syllable and Deep Learning-Based Indonesian Audiovisual Speech Recognition
Suyanto Suyanto ... Kurniawan Nur Ramadhani
-
Suyanto Suyanto, et. al.Suyanto Suyanto ... Kurniawan Nur Ramadhani
14 Dec 2020
14 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RBF neural network mouth tracking for audio-visual speech recognition system

Abstract

Talk to us

Similar Papers