Multimodal Context Research Articles

This study examined the role of intersensory redundancy on 12-month-old infants' attention to and processing of face stimuli. Two experiments were conducted. In Experiment 1, 72 12-month-olds were tested using an online platform called Lookit. Infants were familiarized with two videos of an actor reciting a children's story presented simultaneously. A soundtrack either matched one of the videos (experimental condition) or neither of the videos (control condition). Visual-paired comparison (VPC) trials were completed to measure looking preferences for the faces presented synchronously and asynchronously during familiarization and for novel faces. Neither group displayed looking preferences during the VPC trials. It is possible that the complexity of the familiarization phase made the modality-specific face properties (i.e., facial characteristics and configuration) difficult to process. In Experiment 2, 56 12-month-old infants were familiarized with the video of only one actor presented either synchronously or asynchronously with the soundtrack. Following familiarization, participants completed a VPC procedure including the familiar face and a novel face. Results from Experiment 2 showed that infants in the synchronous condition paid more attention during familiarization than infants in the asynchronous condition. Infants in the asynchronous condition demonstrated recognition of the familiar face. These findings suggest that the competing face stimuli in the Experiment 1 were too complex for the facial characteristics to be processed. The procedure in Experiment 2 led to increased processing of the face in the asynchronous presentation. These results indicate that intersensory redundancy in the presentation of synchronous audiovisual faces is very salient, discouraging the processing of modality-specific visual properties. This research contributes to the understanding of face processing in multimodal contexts, which have been understudied, although a great deal of naturalistic face exposure occurs multimodally.

Human brown adipose tissue (BAT), mostly located in the cervical/supraclavicular region, is a promising target in obesity treatment. Magnetic resonance imaging (MRI) allows for mapping the fat content quantitatively. However, due to the complex heterogeneous distribution of BAT, it has been difficult to establish a standardized segmentation routine based on magnetic resonance (MR) images. Here, we suggest using a multi-modal deep neural network to detect the supraclavicular fat pocket. A total of 50 healthy subjects [median age/body mass index (BMI) =36 years/24.3 kg/m2] underwent MRI scans of the neck region on a 3 T Ingenia scanner (Philips Healthcare, Best, Netherlands). Manual segmentations following fixed rules for anatomical borders were used as ground truth labels. A deep learning-based method (termed as BAT-Net) was proposed for the segmentation of BAT on MRI scans. It jointly leveraged two-dimensional (2D) and three-dimensional (3D) convolutional neural network (CNN) architectures to efficiently encode the multi-modal and 3D context information from multi-modal MRI scans of the supraclavicular region. We compared the performance of BAT-Net to that of 2D U-Net and 3D U-Net. For 2D U-Net, we analyzed the performance difference of implementing 2D U-Net in three different planes, denoted as 2D U-Net (axial), 2D U-Net (coronal), and 2D U-Net (sagittal). The proposed model achieved an average dice similarity coefficient (DSC) of 0.878 with a standard deviation of 0.020. The volume segmented by the network was smaller compared to the ground truth labels by 9.20 mL on average with a mean absolute increase in proton density fat fraction (PDFF) inside the segmented regions of 1.19 percentage points. The BAT-Net outperformed all implemented 2D U-Nets and the 3D U-Nets with average DSC enhancement ranging from 0.016 to 0.023. The current work integrates a deep neural network-based segmentation into the automated segmentation of supraclavicular fat depot for quantitative evaluation of BAT. Experiments show that the presented multi-modal method benefits from leveraging both 2D and 3D CNN architecture and outperforms the independent use of 2D or 3D networks. Deep learning-based segmentation methods show potential towards a fully automated segmentation of the supraclavicular fat depot.

Multimodal Context Research Articles

Articles published on Multimodal Context

The Wildfire Dataset: Enhancing Deep Learning-Based Forest Fire Detection with a Diverse Evolving Open-Source Dataset Focused on Data Representativeness and a Novel Multi-Task Learning Approach

Intact speech-gesture integration in narrative recall by adults with moderate-severe traumatic brain injury

Balancing Project Schedule, Cost, and Value under Uncertainty: A Reinforcement Learning Approach

Multi-modal graph contrastive encoding for neural machine translation

Specific facial signals associate with categories of social actions conveyed through questions.

Incorporating multimodal context information into traffic speed forecasting through graph deep learning

Intersensory redundancy impedes face recognition in 12-month-old infants.

Automated segmentation of the human supraclavicular fat depot via deep neural network in water-fat separated magnetic resonance images.

Counterfactual Scenario-relevant Knowledge-enriched Multi-modal Emotion Reasoning

Transformer-Based Visual Grounding with Cross-Modality Interaction

Multisite regulation integrates multimodal context in sensory circuits to control persistent behavioral states in C. elegans

Effects of Bilingual Videos in National Innovation Project in College English Listening Teaching Under the Influences of Multi-modal Theory

Conceptual complexes in multimodal advertising

I See What You're Hearing: Facilitating The Effect of Environment on Perceived Emotion While Teleconferencing

BAFN: Bi-Direction Attention Based Fusion Network for Multimodal Sentiment Analysis

Multimodal Named Entity Recognition with Bottleneck Fusion and Contrastive Learning

Global and local priming in a multi-modal context.

Dynamic dual graph networks for textbook question answering

New Directions in Cognitive Grammar and Style

Music and discourse: A systemic-functional approach for music analysis in multimodal contexts

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Context Research Articles

Articles published on Multimodal Context

The Wildfire Dataset: Enhancing Deep Learning-Based Forest Fire Detection with a Diverse Evolving Open-Source Dataset Focused on Data Representativeness and a Novel Multi-Task Learning Approach

Intact speech-gesture integration in narrative recall by adults with moderate-severe traumatic brain injury

Balancing Project Schedule, Cost, and Value under Uncertainty: A Reinforcement Learning Approach

Multi-modal graph contrastive encoding for neural machine translation

Specific facial signals associate with categories of social actions conveyed through questions.

Incorporating multimodal context information into traffic speed forecasting through graph deep learning

Intersensory redundancy impedes face recognition in 12-month-old infants.

Automated segmentation of the human supraclavicular fat depot via deep neural network in water-fat separated magnetic resonance images.

Counterfactual Scenario-relevant Knowledge-enriched Multi-modal Emotion Reasoning

Transformer-Based Visual Grounding with Cross-Modality Interaction

Multisite regulation integrates multimodal context in sensory circuits to control persistent behavioral states in C. elegans

Effects of Bilingual Videos in National Innovation Project in College English Listening Teaching Under the Influences of Multi-modal Theory

Conceptual complexes in multimodal advertising

I See What You're Hearing: Facilitating The Effect of Environment on Perceived Emotion While Teleconferencing

BAFN: Bi-Direction Attention Based Fusion Network for Multimodal Sentiment Analysis

Multimodal Named Entity Recognition with Bottleneck Fusion and Contrastive Learning

Global and local priming in a multi-modal context.

Dynamic dual graph networks for textbook question answering

New Directions in Cognitive Grammar and Style

Music and discourse: A systemic-functional approach for music analysis in multimodal contexts