Multimodal Task Research Articles

Abstract Abnormalities in the neural mechanisms of the stress response have the potential to serve as a transdiagnostic marker for stress-related disorders. This potential is rooted in the highly individual and dynamic stress response, which poses a challenge to classical experimentally informed models that focus on specific phases, conditions, or stimuli during a stress task. Here, we integrate individual heart rates (HR) as an immanent index of the stress-response and combine these with functional magnetic resonance imaging (fMRI) data. In this study, 83 healthy participants completed a multimodal psychosocial imaging stress task comprising three different task phases (PreStress, Stress, and PostStress), with each phase consisting of five 60 s blocks of active task interleaved with 40 s of rest, and simultaneous recordings of pulse plethysmography. Participants were asked to solve mental calculations and were exposed to negative social feedback during the Stress phase. We estimated a general linear model (GLM) with individual heart rates averaged per active block as a single parametric modulator of the task regressor across all 15 active blocks irrespective of the task phase. Results revealed a negative correlation between HR and activation in the bilateral amygdala and anterior hippocampus as well as deactivation in the default mode network. Positive correlations with HR were detected in the bilateral insular cortex, bilateral angular gyrus, and parts of the inferior and superior parietal lobes. In summary, our findings emphasize the utility of integrating the commonly assessed cardiovascular stress response (here: HR) as an immediate index of the participant’s stress status. We conclude that by such integration, brain regions involved in regulating the acute stress response, such as the anterior hippocampus and amygdala, are detected more sensitively by tracking the individual’s “stress wave” rather than treating every experimental block uniformly. Our approach may serve as a complementary analysis to the task-regressor based model.

Read full abstract

The misuse of social platforms and the difficulty in regulating post contents have culminated in a surge of negative sentiments, sarcasms, and the rampant spread of fake news. In response, Multimodal sentiment analysis, sarcasm detection and fake news detection based on image and text have attracted considerable attention recently. Due to that these areas share semantic and sentiment features and confront related fusion challenges in deciphering complex human expressions across different modalities, integrating these multimodal classification tasks that share commonalities across different scenarios into a unified framework is expected to simplify research in sentiment analysis, and enhance the effectiveness of classification tasks involving both semantic and sentiment modeling. Therefore, we consider integral components of a broader spectrum of research known as multimodal affective analysis towards semantics and sentiment, and propose a novel multimodal dual perception fusion framework (MDPF). Specifically, MDPF contains three core procedures: (1) Generating bootstrapping language-image Knowledge to enrich origin modality space, and utilizing cross-modal contrastive learning for aligning text and image modalities to understand underlying semantics and interactions. (2) Designing dynamic connective mechanism to adaptively match image-text pairs and jointly employing gaussian-weighted distribution to intensify semantic sequences. (3) Constructing a cross-modal graph to preserve the structured information of both image and text data and share information between modalities, while introducing sentiment knowledge to refine the edge weights of the graph to capture cross-modal sentiment interaction. We evaluate MDPF on three publicly available datasets across three tasks, and the empirical results demonstrate the superiority of our proposed model.

Read full abstract

Multimodal Task Research Articles

Related Topics

Articles published on Multimodal Task

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

Capturing inter-individual variability in stress dynamics with heart rate traces reveals activity in the bilateral hippocampus, amygdala, and insula

Integrating Neural-Symbolic Reasoning With Variational Causal Inference Network for Explanatory Visual Question Answering.

Vision + X: A Survey on Multimodal Learning in the Light of Data.

Backdoor Attacks and Defenses Targeting Multi-Domain AI Models: A Comprehensive Review

NeuroMechFly v2: simulating embodied sensorimotor control in adult Drosophila.

LRCN: Layer-residual Co-Attention Networks for visual question answering

A Multimodal conceptual framework to achieve automated software evolution for context-rich intelligent applications

Robot imitation from multimodal observation with unsupervised cross-modal representation

ME[formula omitted]A: A Multimodal Entity Entailment framework for multimodal Entity Alignment

Black-box backdoor attack with everyday physical objects in mobile crowdsourcing

GeoPredict-LLM: Intelligent tunnel advanced geological prediction by reprogramming large language models

WaterGPT: Training a Large Language Model to Become a Hydrology Expert

Multimodal dual perception fusion framework for multimodal affective analysis

Estrogen predicts multimodal emotion recognition accuracy across the menstrual cycle.

A novel mobile robot with origami wheels designed for navigating sandy terrains

Identifying Implicit Social Biases in Vision-Language Models

Wearable EEG-Based Brain-Computer Interface for Stress Monitoring.

A unified multimodal classification framework based on deep metric learning

Building a Multimodal Dataset of Academic Paper for Keyword Extraction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Task Research Articles

Related Topics

Articles published on Multimodal Task

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

Capturing inter-individual variability in stress dynamics with heart rate traces reveals activity in the bilateral hippocampus, amygdala, and insula

Integrating Neural-Symbolic Reasoning With Variational Causal Inference Network for Explanatory Visual Question Answering.

Vision + X: A Survey on Multimodal Learning in the Light of Data.

Backdoor Attacks and Defenses Targeting Multi-Domain AI Models: A Comprehensive Review

NeuroMechFly v2: simulating embodied sensorimotor control in adult Drosophila.

LRCN: Layer-residual Co-Attention Networks for visual question answering

A Multimodal conceptual framework to achieve automated software evolution for context-rich intelligent applications

Robot imitation from multimodal observation with unsupervised cross-modal representation

ME[formula omitted]A: A Multimodal Entity Entailment framework for multimodal Entity Alignment

Black-box backdoor attack with everyday physical objects in mobile crowdsourcing

GeoPredict-LLM: Intelligent tunnel advanced geological prediction by reprogramming large language models

WaterGPT: Training a Large Language Model to Become a Hydrology Expert

Multimodal dual perception fusion framework for multimodal affective analysis

Estrogen predicts multimodal emotion recognition accuracy across the menstrual cycle.

A novel mobile robot with origami wheels designed for navigating sandy terrains

Identifying Implicit Social Biases in Vision-Language Models

Wearable EEG-Based Brain-Computer Interface for Stress Monitoring.

A unified multimodal classification framework based on deep metric learning

Building a Multimodal Dataset of Academic Paper for Keyword Extraction