Visual relation detection based on pyramidal convolution and gated recurrent unit

Guanhua Zhong,Huang Wei

doi:10.1088/1742-6596/2303/1/012064

Visual relation detection based on pyramidal convolution and gated recurrent unit

Guanhua Zhong, Huang Wei

https://doi.org/10.1088/1742-6596/2303/1/012064

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Jul 1, 2022
License type: cc-by

Affiliation: Wuhan Institute of Technology

#Visual Relation Detection #Recurrent Neural Network + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Visual relationship detection, not only needs to identify the targets in the image and their position, but also to identify the interrelationship between the targets, is a comprehensive task including object detection, positioning, image classification. In this paper, pyramid convolutional networks are embedded in the image feature extraction module, and three convolutional kernels of different scales and depths are used to increase the sensing field of the network and ensure multi-scale feature extraction. At the same time, a recurrent neural network is introduced, which combines the background information in the image to improve the accuracy of semantic relationship detection.

Full Text