Audio-visual integration in multimodal communication

Tsuhan Chen Tsuhan Chen,R.R Rao

doi:10.1109/5.664274

Audio-visual integration in multimodal communication

Tsuhan Chen Tsuhan Chen, R.R Rao

https://doi.org/10.1109/5.664274

Copy DOI

Journal: Proceedings of the IRE	Publication Date: May 1, 1998
Citations: 280

Affiliation: AT&T (United States), Carnegie Mellon University

#Multimodal Communication #Automated Lip Reading + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We review recent research that examines audio-visual integration in multimodal communication. The topics include bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, and bimodal speaker verification. We also study the enabling technologies for these research topics, including automatic facial-feature tracking and audio-to-visual mapping. Recent progress in audio-visual research shows that joint processing of audio and video provides advantages that are not available when the audio and video are processed independently.

Full Text