Cross-modal Matching Research Articles

<italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Visible Thermal Person Re-Identification (VTReID) is a cross-modality retrieval problem in computer vision. Accurate VTReID is very challenging due to large modality discrepancies. In this work, we design a novel <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Multi-Patch Matching Network (MPMN) framework to simultaneously mitigate the heterogeneity of coarse-grained and fine-grained visual semantics. In view of cross-modality matching, we verify that aligning modality distributions of the original features is likely to suffer from the selective alignment behavior, i.e., only focuses on easiest dimensions or subspaces. Inspired by adversarial learning, we propose a new <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Multi-Patch Modality Alignment (MPMA) loss to jointly balance and reduce the modality discrepancies of multi-patch features by mining hard subspaces and abandoning easy subspaces. Since multi-patch features are potentially complementary to each other, the semantic correlations between different patches should be exploited during training. Motivated by knowledge distillation, we put forward a new <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Cross-Patch Correlation Distillation (CPCD) loss to transfer the semantic knowledges across different patches. To balance multi-patch tasks, an effective <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Patch-Aware Priority Attention (PAPA) method is further introduced to dynamically prioritize hard patch tasks during training. This paper experimentally demonstrates the effectiveness of the proposed methods, achieving superior performance over the state-of-the-art methods on RegDB and SYSU-MM01 datasets.

Read full abstract

Cross-talk between distant brain regions is an integral part of complex brain dynamics. In order to study such interactions, bifocal high-definition tACS (hd-tACS) represents a promising tool to modulate oscillatory synchrony between distant brain regions in a phase- and frequency-specific manner. Here, we investigated cortico-cortical synchronization in alpha and gamma bands underlying multisensory integration. First, we employed a paradigm on crossmodal matching in an EEG study to show that stimulus-driven benefits of crossmodal congruence are paralleled by decreases in gamma band coherence between early sensory cortices. Second, we used bifocal hd-tACS to target visual and somatosensory regions in a similar crossmodal matching paradigm. In- or anti-phase stimulation between the two regions (0°/180° phase shift) was applied with either 10 or 40 Hz and was controlled by sham. As an additional control, electrical stimulation was applied ipsilateral or contralateral to the presentation of visual-tactile stimuli. As hypothesized based on our EEG data, we found contralateral gamma stimulation to decrease behavioral benefits of crossmodal congruence compared with sham. To our surprise, both in- and anti-phase stimulation showed such an effect. We conclude that 40 Hz stimulation might have synchronized gamma band activity across modalities irrespective of stimulation phase. Ipsilateral stimulation showed no interaction with stimulus congruence. Yet, overall performance was increased by ipsilateral alpha and decreased by ipsilateral gamma stimulation. We suggest that, in the absence of sensory input, tACS modulated general cortical excitation/inhibition and related hemispheric competition. Taken together, we present evidence for the involvement of gamma synchronization in crossmodal matching. Our results further suggest that conventional approaches to decrease functional coupling by anti-phase tACS might not always be effective. Our study design – incorporating both EEG and behavioral studies – also demonstrates how acute effects of tACS can be studied bypassing the unsolved problem of tACS-related artifacts in electrophysiological recordings.

Read full abstract

Cross-modal Matching Research Articles

Related Topics

Articles published on Cross-modal Matching

Dual-View Semantic Inference Network for image-text matching

Twenty-four or four-and-twenty: Language modulates cross-modal matching for multidigit numbers in children and adults

MDA: Multimodal Data Augmentation Framework for Boosting Performance on Sentiment/Emotion Classification Tasks

The temporal dynamics of first and second language processing: ERPs to spoken words in Mandarin-English bilinguals

Modality adversarial neural network for visible-thermal person re-identification

Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification

HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs

Cold and heavy: grasping the temperature\u2013weight illusion

Crossmodal Congruency Enhances Performance of Healthy Older Adults in Visual-Tactile Pattern Matching.

Vision-Language Navigation Policy Learning and Adaptation.

RGB-IR Person Re-identification by Cross-Modality Similarity Preservation

Cross-Modality Matching for Evaluating User Experience of Emerging Mobile EEG Technology

Cross-modality earth mover’s distance-driven convolutional neural network for different-modality data

Cross-modal correspondences in sine wave: Speech versus non-speech modes.

Disentangled Variational Representation for Heterogeneous Face Recognition

The Recognition of Solid Object Shape: The Importance of Inhomogeneity.

Active Visual-Tactile Cross-Modal Matching

Using Eye Tracking and Heart-Rate Activity to Examine Crossmodal Correspondences QoE in Mulsemedia

The role of sex-related voice variation in children's gender-role stereotype attributions.

Bifocal high-definition tACS over early sensory regions modulates crossmodal matching: Combined evidence from EEG and tACS/behavioral studies

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cross-modal Matching Research Articles

Related Topics

Articles published on Cross-modal Matching

Dual-View Semantic Inference Network for image-text matching

Twenty-four or four-and-twenty: Language modulates cross-modal matching for multidigit numbers in children and adults

MDA: Multimodal Data Augmentation Framework for Boosting Performance on Sentiment/Emotion Classification Tasks

The temporal dynamics of first and second language processing: ERPs to spoken words in Mandarin-English bilinguals

Modality adversarial neural network for visible-thermal person re-identification

Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification

HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs

Cold and heavy: grasping the temperature\u2013weight illusion

Crossmodal Congruency Enhances Performance of Healthy Older Adults in Visual-Tactile Pattern Matching.

Vision-Language Navigation Policy Learning and Adaptation.

RGB-IR Person Re-identification by Cross-Modality Similarity Preservation

Cross-Modality Matching for Evaluating User Experience of Emerging Mobile EEG Technology

Cross-modality earth mover’s distance-driven convolutional neural network for different-modality data

Cross-modal correspondences in sine wave: Speech versus non-speech modes.

Disentangled Variational Representation for Heterogeneous Face Recognition

The Recognition of Solid Object Shape: The Importance of Inhomogeneity.

Active Visual-Tactile Cross-Modal Matching

Using Eye Tracking and Heart-Rate Activity to Examine Crossmodal Correspondences QoE in Mulsemedia

The role of sex-related voice variation in children's gender-role stereotype attributions.

Bifocal high-definition tACS over early sensory regions modulates crossmodal matching: Combined evidence from EEG and tACS/behavioral studies