Graph-LSTM with Global Attribute for Scene Graph Generation

Tong Shao,Dapeng Oliver Wu

doi:10.1088/1742-6596/2003/1/012001

Abstract

Lots of machine learning tasks require dealing with graph data, and among them, scene graph generation is a challenging one that calls for graph neural networks’ potential ability. In this paper, we present a definition of graph neural network (GNN) consists of node, edge and global attribute, as well as their corresponding update and aggregate functions. Based on this, we then propose a realization of GNN model called Graph-LSTM and use it in scene graph generation. The model first extracts the item features in the image as the initial states of the node-LSTM representing subject/object and edge-LSTM representing predicate. Two LSTMs update the states via LSTM’s timestep and aggregate information via message passing. Repeat the update-aggregate until convergence. Meanwhile, the tag feature, i.e., the generated probability distribution of image’s semantic concepts is sent to the LSTM through a semantic compositional network (SCN). The SCN-LSTM is trained in an ensemble style, and hence allows the tag feature to serve as the global attribute providing context information to all individuals. The LSTMs’ final states are input to inference modules and generate the triplet (subject, predicate, object) of the scene graph. Experimental results show that Graph-LSTM outperforms the Message Passing and the attention Graph Covolutional Network methods, proving the effectiveness of the proposed scheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Aug 1, 2021
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Graph-LSTM with Global Attribute for Scene Graph Generation

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Boosting Scene Graph Generation with Visual Relation Saliency
Yong Zhang ... Yingwei Pan
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19
Yong Zhang, et. al.Yong Zhang ... Yingwei Pan
05 Jan 2023
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19

A modified GNN architecture with enhanced aggregator and Message Passing Functions
Debjit Sarkar ... Ram Sarkar
Engineering Applications of Artificial Intelligence | VOL. 122
Debjit Sarkar, et. al.Debjit Sarkar ... Ram Sarkar
08 Mar 2023
Engineering Applications of Artificial Intelligence | VOL. 122

NeuSyRE: Neuro-symbolic visual understanding and reasoning framework based on scene graph enrichment
M Jaleed Khan ... Edward Curry
Semantic Web | VOL. -
M Jaleed Khan, et. al.M Jaleed Khan ... Edward Curry
13 Dec 2023
Semantic Web | VOL. -

Lightweight Visual Question Answering using Scene Graphs
Sai Vidyaranya Nuthalapati ... Bowen Li
-
Sai Vidyaranya Nuthalapati, et. al.Sai Vidyaranya Nuthalapati ... Bowen Li
26 Oct 2021
26 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph-LSTM with Global Attribute for Scene Graph Generation

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series