Finding Images by Dialoguing with Image

Lejian Ren,Shuicheng Yan,Jizhong Han,Han Huang,Bo Li,Si Liu

doi:10.1145/3343031.3350907

Abstract

Image retrieval in complicated scene is a challenging task that requires the comprehensive understanding of an image. In this paper, we propose a scene graph based image retrieval framework that combines the scene graph generation with image retrieval and fine tuning the searching results via a dialogue mechanism. Specifically, we proposed an image retrieval oriented scene graph generation model that takes an image and a text describing the image as inputs. The additional text input is used to control the generated scene graph. It provides information for a newly introduced attributes head to better predict the attributes and helps constructing an adjacency matrix at the same time. Graph Convolutional Network is further used to gather information among nodes for precise relation estimation. Moreover, modification on the scene graph can be done by changing the text. Our proposed approach achieves the state-of-the-art performances in both scene graph based image retrieval and scene graph generation in the Visual Genome dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding Images by Dialoguing with Image

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning
Muhammad Jaleed Khan ... Edward Curry
-
Muhammad Jaleed Khan, et. al.Muhammad Jaleed Khan ... Edward Curry
01 Jan 2021
01 Jan 2021

Relation Regularized Scene Graph Generation
Yuyu Guo ... Heng Tao Shen
IEEE Transactions on Cybernetics | VOL. 52
Yuyu Guo, et. al.Yuyu Guo ... Heng Tao Shen
12 Mar 2021
IEEE Transactions on Cybernetics | VOL. 52

EdgeNet for efficient scene graph classification
Vivek B.S ... Arpan Pal
-
Vivek B.S, et. al.Vivek B.S ... Arpan Pal
18 Jul 2022
18 Jul 2022

NeuSyRE: Neuro-symbolic visual understanding and reasoning framework based on scene graph enrichment
M Jaleed Khan ... Edward Curry
Semantic Web | VOL. -
M Jaleed Khan, et. al.M Jaleed Khan ... Edward Curry
13 Dec 2023
Semantic Web | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding Images by Dialoguing with Image

Abstract

Talk to us

Similar Papers