A Review of Audio-Visual Fusion with Machine Learning

Xiaoyu Song,Qing Wang,Hui Tang,Hong Chen,Yunqiang Chen,Mengxiao Tian

doi:10.1088/1742-6596/1237/2/022144

Xiaoyu Song, Qing Wang + Show 4 more

Open Access

https://doi.org/10.1088/1742-6596/1237/2/022144

Copy DOI

Abstract

For the study of single-modal recognition, for example, the research on speech signals, ECG signals, facial expressions, body postures and other physiological signals have made some progress. However, the diversity of human brain information sources and the uncertainty of single-modal recognition determine that the accuracy of single-modal recognition is not high. Therefore, building a multimodal recognition framework in combination with multiple modalities has become an effective means of improving performance. With the rise of multi-modal machine learning, multi-modal information fusion has become a research hotspot, and audio-visual fusion is the most widely used direction. The audio-visual fusion method has been successfully applied to various problems, such as emotion recognition and multimedia event detection, biometric and speech recognition applications. This paper firstly introduces multimodal machine learning briefly, and then summarizes the development and current situation of audio-visual fusion technology in some major areas, and finally puts forward the prospect for the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Jun 1, 2019
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

A Review of Audio-Visual Fusion with Machine Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Research on Multi-modal Emotion Recognition Based on Speech, EEG and ECG Signals
Hui Guo ... Dongmei Shao
-
Hui Guo, et. al.Hui Guo ... Dongmei Shao
01 Jan 2020
01 Jan 2020

Emotion recognition using multi-modal data and machine learning techniques: A tutorial and review
Jianhua Zhang ... Stefano Nichele
Information Fusion | VOL. 59
Jianhua Zhang, et. al.Jianhua Zhang ... Stefano Nichele
31 Jan 2020
Information Fusion | VOL. 59

Multi-Modal Machine Learning in Engineering Design: A Review and Future Directions
Binyang Song ... Rui Zhou
Journal of Computing and Information Science in Engineering | VOL. 24
Binyang Song, et. al.Binyang Song ... Rui Zhou
24 Nov 2023
Journal of Computing and Information Science in Engineering | VOL. 24

ECG-based emotion recognition using random convolutional kernel method
Ancheng Fang ... Peiyu He
Biomedical Signal Processing and Control | VOL. 91
Ancheng Fang, et. al.Ancheng Fang ... Peiyu He
09 Jan 2024
Biomedical Signal Processing and Control | VOL. 91

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Review of Audio-Visual Fusion with Machine Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series