Semantic Object Research Articles

Images contain rich information and can induce various emotions in the audience. Image emotion classification aims to identify the emotion categories that images can evoke. It is widely used in mental health assessment, human–computer interaction, etc. There are two main problems in existing image emotion classification methods: (1) Most of them only focus on a single emotion label; (2) The global structural relationship among semantic objects in the image is ignored. Therefore, this paper proposes an Image Emotion classification method based on Multi-Graph Multi-Label learning (IE-MGML). In contrast to the existing approaches, the image is transformed into a graph-based representation by extracting the emotional features of semantic objects and calculating the similarity between the features. The local (semantic objects) features and global structure (relationship among semantic objects) features of the image are fused by the relationship between nodes. Furthermore, the graph representation of an image from the perspective of multiple emotional features is pooled and modeled as a graph bag containing multiple graphs (i.e., multi-graph). In multi-graph learning, the graph kernel directly evaluates a graph-label dependency score to avoid the loss of structural information caused by graph-instance degradation. The bag(image)-label dependency score is obtained by aggregating the graph-label dependency score from different perspectives through the aggregation function. The problem of error accumulation in the learning process is handled by proposing a threshold-based ranking loss objective function. Moreover, the non-convex optimization problem is addressed using a subgradient descent algorithm to deal with the required high-dimensional space computation. Experimental results on three general image emotion datasets show that the proposed method outperforms the state-of-the-art methods.

The Point-of-Interest (POI) transition behaviors could hold absolute sparsity and relative sparsity very differently for different cities. Hence, it is intuitive to transfer knowledge across cities to alleviate those data sparsity and imbalance problems for next POI recommendation. Recently, pre-training over a large-scale dataset has achieved great success in many relevant fields, like computer vision and natural language processing. By devising various self-supervised objectives, pre-training models can produce more robust representations for downstream tasks. However, it is not trivial to directly adopt such existing pre-training techniques for next POI recommendation, due to the lacking of common semantic objects (users or items) across different cities . Thus in this paper, we tackle such a new research problem of pre-training across different cities for next POI recommendation. Specifically, to overcome the key challenge that different cities do not share any common object, we propose a novel pre-training model named CATUS , by transferring the cat egory-level u niversal tran s ition knowledge over different cities. Firstly, we build two self-supervised objectives in CATUS : next category prediction and next POI prediction , to obtain the universal transition-knowledge across different cities and POIs. Then, we design a category-transition oriented sampler on the data level and an implicit and explicit transfer strategy on the encoder level to enhance this transfer process. At the fine-tuning stage, we propose a distance oriented sampler to better align the POI representations into the local context of each city. Extensive experiments on two large datasets consisting of four cities demonstrate the superiority of our proposed CATUS over the state-of-the-art alternatives. The code and datasets are available at https://github.com/NLPWM-WHU/CATUS.

Semantic Object Research Articles

Related Topics

Articles published on Semantic Object

Visual object detection using audio data

Low-Resolution Steel Surface Defects Classification Network Based on Autocorrelation Semantic Enhancement

Recurrent connectivity supports higher-level visual and semantic object representations in the brain

Semantic object processing is modulated by prior scene context

CDS-Net: Cooperative dual-stream network for image manipulation detection

Image emotion multi-label classification based on multi-graph learning

Accurate Perception and Association of Objects for Humanoid Robots Under Dynamic Visual SLAM

Inferring Attention Shifts for Salient Instance Ranking

Spatiotemporal characteristics of the neural representation of event concepts

Types and Type Theories in Natural Language Analysis

Pre-Training Across Different Cities for Next POI Recommendation

Deep Surveillance System

MISL: Multi-grained image-text semantic learning for text-guided image inpainting

Global enhancement network underwater archaeology scene parsing method

S3R: Shape and Semantics-Based Selective Regularization for Explainable Continual Segmentation Across Multiple Sites.

Probabilistic Approach for Road-Users Detection

A Comparative Study of Object Detection using YOLO and SSD Algorithms

Semantic-Aware Contrastive Learning for Multi-Object Medical Image Segmentation.

Class key feature extraction and fusion for 2D medical image segmentation.

“I hate all the children, especially mine”: Applying relational dialectics theory to examine the experiences of formerly childfree regretful parents

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Object Research Articles

Related Topics

Articles published on Semantic Object

Visual object detection using audio data

Low-Resolution Steel Surface Defects Classification Network Based on Autocorrelation Semantic Enhancement

Recurrent connectivity supports higher-level visual and semantic object representations in the brain

Semantic object processing is modulated by prior scene context

CDS-Net: Cooperative dual-stream network for image manipulation detection

Image emotion multi-label classification based on multi-graph learning

Accurate Perception and Association of Objects for Humanoid Robots Under Dynamic Visual SLAM

Inferring Attention Shifts for Salient Instance Ranking

Spatiotemporal characteristics of the neural representation of event concepts

Types and Type Theories in Natural Language Analysis

Pre-Training Across Different Cities for Next POI Recommendation

Deep Surveillance System

MISL: Multi-grained image-text semantic learning for text-guided image inpainting

Global enhancement network underwater archaeology scene parsing method

S3R: Shape and Semantics-Based Selective Regularization for Explainable Continual Segmentation Across Multiple Sites.

Probabilistic Approach for Road-Users Detection

A Comparative Study of Object Detection using YOLO and SSD Algorithms

Semantic-Aware Contrastive Learning for Multi-Object Medical Image Segmentation.

Class key feature extraction and fusion for 2D medical image segmentation.

“I hate all the children, especially mine”: Applying relational dialectics theory to examine the experiences of formerly childfree regretful parents