Research on Design Innovation Method Based on Multimodal Perception and Recognition Technology

Wu kan,Men Longlong

doi:10.1088/1742-6596/1607/1/012107

Research on Design Innovation Method Based on Multimodal Perception and Recognition Technology

Wu kan, Men Longlong

Open Access

https://doi.org/10.1088/1742-6596/1607/1/012107

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Aug 1, 2020
Citations: 2	License type: cc-by

Affiliation: Xi'an University of Science and Technology

#Audio-visual Speech Recognition #Multimodal Perception + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In order to solve the problems of low efficiency and poor robustness of single-mode speech emotion recognition, this paper uses a multi-mode fusion mechanism to fuse speech and visual information across modes to build an audio-visual speech recognition (AVSR) system. The results show that the modal attention mechanism can automatically adjust to a more stable and reliable state according to the quality of a single signal, so that the audio-visual multimodal perception is accurate and the recognition efficiency is high.

Full Text