STMMI: A Self-Tuning Multi-Modal Fusion Algorithm Applied in Assist Robot Interaction

Ya Hou,Xin Zhang,Xiaohui Yang,Zhiquan Feng,Tao Xu,Xiaoyu Qiu,Pengwei Wang

doi:10.1155/2022/3952758

Ya Hou, Xin Zhang + Show 5 more

Open Access

https://doi.org/10.1155/2022/3952758

Copy DOI

Journal: Scientific programming	Publication Date: Aug 29, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: University of Jinan

Abstract

While facing complex surroundings, robots need to identify the same intention which is expressed in different ways. In order to solve the problem of assisting robots to get a better intention understanding, a self-tuning multimodal fusion algorithm is put forward in this paper, which is not restricted by the expressions of interacting participants and environment. The multimodal fusion algorithm can be transferred to different application platforms. Robots can own the understanding competence and adapt new tasks by changing the content of the robot knowledge base. Compared with other multimodal fusion algorithms, this paper attempts to transfer the basic structure of feed-forward neural networks on discrete sets, which has strengthened the consistency and perfect the complementary relations between multiple mode, and has realized the simultaneous operation of fusion operator’s self-tuning and intention search. There are three kinds of modes selected in the paper: speech, gesture, and scene objects, where the single modal classifiers are trained separately. This method conducted a human-computer interaction experiment on the bionic robot Pepper platform, which proved that the method can effectively improve the accuracy and robustness of robots in aspects of understanding human intentions, and reduce the uncertainty about intention judgment in a single modal interaction.

Full Text