Semantic Vision Research Articles

<italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Visible Thermal Person Re-Identification (VTReID) is a cross-modality retrieval problem in computer vision. Accurate VTReID is very challenging due to large modality discrepancies. In this work, we design a novel <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Multi-Patch Matching Network (MPMN) framework to simultaneously mitigate the heterogeneity of coarse-grained and fine-grained visual semantics. In view of cross-modality matching, we verify that aligning modality distributions of the original features is likely to suffer from the selective alignment behavior, i.e., only focuses on easiest dimensions or subspaces. Inspired by adversarial learning, we propose a new <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Multi-Patch Modality Alignment (MPMA) loss to jointly balance and reduce the modality discrepancies of multi-patch features by mining hard subspaces and abandoning easy subspaces. Since multi-patch features are potentially complementary to each other, the semantic correlations between different patches should be exploited during training. Motivated by knowledge distillation, we put forward a new <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Cross-Patch Correlation Distillation (CPCD) loss to transfer the semantic knowledges across different patches. To balance multi-patch tasks, an effective <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Patch-Aware Priority Attention (PAPA) method is further introduced to dynamically prioritize hard patch tasks during training. This paper experimentally demonstrates the effectiveness of the proposed methods, achieving superior performance over the state-of-the-art methods on RegDB and SYSU-MM01 datasets.

Read full abstract

The article looks at the experience of building a media environment at school as a system of self-education and self-development for modern teenagers. The authors present a model of development of teenagers’ creative self-actualization in the media educational environment, the technology of media culture use in the educational environment and an original diagnostic methodology for determination of the degree of development of school students’ creative self-actualization based on assessment of independent work carried out by teenagers (media projects creation) taking into account emotional and semantic visualization of information and the symbolism of associative-semantic connections to evaluate the quality of the conducted work. As a result of the performed experiment, the authors identified personal characteristics shown by school students in the course of their work on media projects and developed a typology of dominant activities that includes four types of individual characteristics observed in school students: analytical, informational, proactive and synthetic types. Based on this typology the authors have determined the relevance and necessity for differentiation of work with teenagers of different ages in various forms of media educational environment and development of their individual educational paths. Correspondingly, a four-stage mechanism of introduction of a teenager into a media educational environment has been suggested providing for the integration of art into the educational environment. The article presents the results of a pilot project that was carried out in state budgetary educational institutions located in Moscow: School no. 1253, School no. 171, School no. 89 and in the children's creative center Trubetskoy Estate in Khamovniki. The article is intended for scientists, teachers, pedagogues and methodological experts who deal with the issues of media education.

Read full abstract

Semantic Vision Research Articles

Related Topics

Articles published on Semantic Vision

Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue.

The Historical Change in the Role of the Two Hungarian Words for Red (Piros and Vörös) and Their Visual Semantics

RETRACTED: Basketball action recognition based on FPGA and particle image

Cortical network responses map onto data-driven features that capture visual semantics of movie fragments

A novel deep quality-aware CNN for image edge smoothening

Real distorted images quality assessment based on multi-layer visual perception mechanism and high-level semantics

B5G and Explainable Deep Learning Assisted Healthcare Vertical at the Edge: COVID-I9 Perspective

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.

Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification

DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue

Deep hard modality alignment for visible thermal person re-identification

Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval

Hand gesture understanding by weakly-supervised fusing shallow/deep image attributes

Classification and Risk-Mapping of River Water Quality in Surabaya with Semantic Visualitzation

SeDAR: Reading Floorplans Like a Human\u2014Using Deep Learning to Enable Human-Inspired Localisation

Find and Seek

The Technology of Media Culture usage in the Educational Environment: Development of Creative Self-Actualization

Object manipulation with a variable-stiffness robotic mechanism using deep neural networks for visual semantics and load estimation

Synthesis of Image from Text using Generative Adversarial Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Vision Research Articles

Related Topics

Articles published on Semantic Vision

Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue.

The Historical Change in the Role of the Two Hungarian Words for Red (Piros and Vörös) and Their Visual Semantics

RETRACTED: Basketball action recognition based on FPGA and particle image

Cortical network responses map onto data-driven features that capture visual semantics of movie fragments

A novel deep quality-aware CNN for image edge smoothening

Real distorted images quality assessment based on multi-layer visual perception mechanism and high-level semantics

B5G and Explainable Deep Learning Assisted Healthcare Vertical at the Edge: COVID-I9 Perspective

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.

Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification

DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue

Deep hard modality alignment for visible thermal person re-identification

Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval

Hand gesture understanding by weakly-supervised fusing shallow/deep image attributes

Classification and Risk-Mapping of River Water Quality in Surabaya with Semantic Visualitzation

SeDAR: Reading Floorplans Like a Human\u2014Using Deep Learning to Enable Human-Inspired Localisation

Find and Seek

The Technology of Media Culture usage in the Educational Environment: Development of Creative Self-Actualization

Object manipulation with a variable-stiffness robotic mechanism using deep neural networks for visual semantics and load estimation

Synthesis of Image from Text using Generative Adversarial Networks