Multimodal Biometric Human Recognition for Perceptual Human–Computer Interaction

Richard M Jiang,Danny Crookes,Abdul H Sadka

doi:10.1109/tsmcc.2010.2050476

Abstract

In this paper, a novel video-based multimodal biometric verification scheme using the subspace-based low-level feature fusion of face and speech is developed for specific speaker recognition for perceptual human-computer interaction (HCI). In the proposed scheme, human face is tracked and face pose is estimated to weight the detected facelike regions in successive frames, where ill-posed faces and false-positive detections are assigned with lower credit to enhance the accuracy. In the audio modality, mel-frequency cepstral coefficients are extracted for voice-based biometric verification. In the fusion step, features from both modalities are projected into nonlinear Laplacian Eigenmap subspace for multimodal speaker recognition and combined at low level. The proposed approach is tested on the video database of ten human subjects, and the results show that the proposed scheme can attain better accuracy in comparison with the conventional multimodal fusion using latent semantic analysis as well as the single-modality verifications. The experiment on MATLAB shows the potential of the proposed scheme to attain the real-time performance for perceptual HCI applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Biometric Human Recognition for Perceptual Human–Computer Interaction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)

Lead the way for us

Journal: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)	Publication Date: Nov 1, 2010
Citations: 65

Similar Papers

Multimodal speaker/speech recognition using lip motion, lip texture and audio
H.E Çetingül ... A.M Tekalp
Signal Processing | VOL. 86
H.E Çetingül, et. al.H.E Çetingül ... A.M Tekalp
02 Jun 2006
Signal Processing | VOL. 86

Significance of analytic phase of speech signals in speaker verification
Karthika Vijayan ... K Sri Rama Murty
Speech Communication | VOL. 81
Karthika Vijayan, et. al.Karthika Vijayan ... K Sri Rama Murty
26 Feb 2016
Speech Communication | VOL. 81

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks
Gurpreet Kaur ... Amod Kumar
Journal of Telecommunications and Information Technology | VOL. 2
Gurpreet Kaur, et. al.Gurpreet Kaur ... Amod Kumar
29 Jun 2018
Journal of Telecommunications and Information Technology | VOL. 2

Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter
Paresh M Chauhan ... Nikita P Desai
-
Paresh M Chauhan, et. al.Paresh M Chauhan ... Nikita P Desai
01 Mar 2014
01 Mar 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Biometric Human Recognition for Perceptual Human–Computer Interaction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)