Multimodal Sentiment Research Articles

The purpose is to improve the application of museum robots in museum scenes, enhance the service capabilities of robots in museums, break tourists’ boring concepts of museum environment, manual explanation, services, etc., and promote tourists’ exhibition experience. A method for sentiment analysis of humanoid robots in museums is proposed by studying the transformation of museums with the help of artificial intelligence (AI) technology, as well as the function and significance of museums in history education. First, the function of museums in history education and the role of AI in constructing intelligent museums are described. Second, on account of the multimodal sentiment analysis method of speech and emotion, a scenario model of the visitor museum is established. An uncertain reasoning method for robot service tasks based on Multi-entity Bayesian network (MEBN) is also proposed. Finally, the proposed model is validated by experiments. The results show that compared with the recognition rates of Arousal and Valence dimensions, the consistency correlation coefficient value of the Kalman filter is higher. The Consistency Correlation Coefficient (CCC) value of the Arousal dimension is 0.703, and the CCC value of the Valence dimension is 0.766. Besides, in different tour times, the proportion of services that tourists want to be provided with varies in different emotional states. From time t1 to time t2, the proportion of tourists who want to hear explanations of cultural relics dropped by 11.5%, while the proportion of tourists who want to be provided with tea service increased by 24%. This indicates that when the Kalman filter algorithm performs continuous emotion recognition of a multimodal fusion, the final emotion recognition accuracy is higher, and emotion analysis can help humanoid robots to be more intelligent and humanized. The proposed sentiment analysis based on the multimodal analysis and MEBN’s uncertainty reasoning method for robot service tasks not only broadens the practical application field of intelligent robots under human–computer interaction technology but also has important research significance for the innovative education development of museum history education.

Read full abstract

The recent booming of artificial intelligence (AI) applications, e.g., affective robots, human-machine interfaces, autonomous vehicles, and so on, has produced a great number of multi-modal records of human communication. Such data often carry latent subjective users’ attitudes and opinions, which provides a practical and feasible path to realize the connection between human emotion and intelligence services. Sentiment and emotion analysis of multi-modal records is of great value to improve the intelligence level of affective services. However, how to find an optimal manner to learn people’s sentiments and emotional representations has been a difficult problem, since both of them involve subtle mind activity. To solve this problem, a lot of approaches have been published, but most of them are insufficient to mine sentiment and emotion, since they have treated sentiment analysis and emotion recognition as two separate tasks. The interaction between them has been neglected, which limits the efficiency of sentiment and emotion representation learning. In this work, emotion is seen as the external expression of sentiment, while sentiment is the essential nature of emotion. We thus argue that they are strongly related to each other where one’s judgment helps the decision of the other. The key challenges are multi-modal fused representation and the interaction between sentiment and emotion. To solve such issues, we design an external knowledge enhanced multi-task representation learning network, termed KAMT. The major elements contain two attention mechanisms, which are inter-modal and inter-task attentions and an external knowledge augmentation layer. The external knowledge augmentation layer is used to extract the vector of the participant’s gender, age, occupation, and of overall color or shape. The main use of inter-modal attention is to capture effective multi-modal fused features. Inter-task attention is designed to model the correlation between sentiment analysis and emotion classification. We perform experiments on three widely used datasets, and the experimental performance proves the effectiveness of the KAMT model.

Read full abstract

Multimodal Sentiment Research Articles

Related Topics

Articles published on Multimodal Sentiment

Correlations Between Positive or Negative Utterances and Basic Acoustic Features of Voice: a Preliminary Analysis

Transfer-based adaptive tree for multimodal sentiment analysis based on user latent aspects

PS-Mixer: A Polar-Vector and Strength-Vector Mixer Model for Multimodal Sentiment Analysis

TETFN: A text enhanced transformer fusion network for multimodal sentiment analysis

Joint multimodal sentiment analysis based on information relevance

StyleBERT: Text-audio sentiment analysis with Bi-directional Style Enhancement

Multimodal Sentiment Analysis of Online Product Information Based on Text Mining Under the Influence of Social Media

Video-Based Cross-Modal Auxiliary Network for Multimodal Sentiment Analysis

Visual Enhancement Capsule Network for Aspect-based Multimodal Sentiment Analysis

Multimodal Sentiment Analysis: A Systematic review of History, Datasets, Multimodal Fusion Methods, Applications, Challenges and Future Directions

Learning modality-fused representation based on transformer for emotion analysis

AOBERT: All-modalities-in-One BERT for multimodal sentiment analysis

The Application of Interactive Humanoid Robots in the History Education of Museums Under Artificial Intelligence

TEDT: Transformer-Based Encoding–Decoding Translation Network for Multimodal Sentiment Analysis

Heterogeneous graph convolution based on In-domain Self-supervision for Multimodal Sentiment Analysis

Excavating multimodal correlation for representation learning

Impact of Annotator Demographics on Sentiment Dataset Labeling

Modality-invariant temporal representation learning for multimodal sentiment classification

Affective Interaction: Attentive Representation Learning for Multi-Modal Sentiment Classification

MALO-LSTM: Multimodal Sentiment Analysis Using Modified Ant Lion Optimization with Long Short Term Memory Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Sentiment Research Articles

Related Topics

Articles published on Multimodal Sentiment

Correlations Between Positive or Negative Utterances and Basic Acoustic Features of Voice: a Preliminary Analysis

Transfer-based adaptive tree for multimodal sentiment analysis based on user latent aspects

PS-Mixer: A Polar-Vector and Strength-Vector Mixer Model for Multimodal Sentiment Analysis

TETFN: A text enhanced transformer fusion network for multimodal sentiment analysis

Joint multimodal sentiment analysis based on information relevance

StyleBERT: Text-audio sentiment analysis with Bi-directional Style Enhancement

Multimodal Sentiment Analysis of Online Product Information Based on Text Mining Under the Influence of Social Media

Video-Based Cross-Modal Auxiliary Network for Multimodal Sentiment Analysis

Visual Enhancement Capsule Network for Aspect-based Multimodal Sentiment Analysis

Multimodal Sentiment Analysis: A Systematic review of History, Datasets, Multimodal Fusion Methods, Applications, Challenges and Future Directions

Learning modality-fused representation based on transformer for emotion analysis

AOBERT: All-modalities-in-One BERT for multimodal sentiment analysis

The Application of Interactive Humanoid Robots in the History Education of Museums Under Artificial Intelligence

TEDT: Transformer-Based Encoding–Decoding Translation Network for Multimodal Sentiment Analysis

Heterogeneous graph convolution based on In-domain Self-supervision for Multimodal Sentiment Analysis

Excavating multimodal correlation for representation learning

Impact of Annotator Demographics on Sentiment Dataset Labeling

Modality-invariant temporal representation learning for multimodal sentiment classification

Affective Interaction: Attentive Representation Learning for Multi-Modal Sentiment Classification

MALO-LSTM: Multimodal Sentiment Analysis Using Modified Ant Lion Optimization with Long Short Term Memory Network