Speech recognition and intelligent translation under multimodal human–computer interaction system

Danhua Huang,Shuaiqiu Xiang

doi:10.1515/jisys-2023-0192

Abstract

Abstract The traditional translation robot is limited to the translation of single-mode text images and text videos, which has the problem of low translation accuracy. Therefore, speech recognition and intelligent translation in multimodal human–computer interaction (HCI) system are proposed. First, the network structure of speech recognition model in multi-channel HCI system is established, and the multi-head self-attention mechanism is constructed. Then, the artificial intelligence voice wake-up function is designed, and a multimodal machine translation model is constructed. On this basis, selective attention is added to obtain visual recognition of perceived text, and the decoder is used for multimodal gating fusion to realize the output of encoder translation results. Experimental results show that this method has high BLUE value and high translation accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech recognition and intelligent translation under multimodal human–computer interaction system

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Systems

Lead the way for us

Journal: Journal of Intelligent Systems	Publication Date: Sep 3, 2024
License type: CC BY 4.0

Similar Papers

Design Optimization of Pressure Sensing Floor for Multimodal Human-Computer Interaction
Sankar Rangarajan ... Stjepan Rajko
-
Sankar Rangarajan, et. al.Sankar Rangarajan ... Stjepan Rajko
01 Oct 2008
01 Oct 2008

Multimodal Human-Computer Interaction Based on Bayesian Classification Algorithm
Wang Xing ... Zuo Tao
-
Wang Xing, et. al.Wang Xing ... Zuo Tao
15 Aug 2022
15 Aug 2022

Multi-Agent Based Approach to Support HCI
Zhen Zhu ... Jing-Yan Wang
-
Zhen Zhu, et. al.Zhen Zhu ... Jing-Yan Wang
01 Jan 2006
01 Jan 2006

A Multimodal Human-Computer Interaction System and Its Application in Smart Learning Environments
Jiyou Jia ... Huixiao Le
-
Jiyou Jia, et. al.Jiyou Jia ... Huixiao Le
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech recognition and intelligent translation under multimodal human–computer interaction system

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Systems