Visual Entity Linking via Multi-modal Learning

Qiushuo Zheng,Meng Wang,Guilin Qi,Hao Wen

doi:10.1162/dint_a_00114

Abstract

Abstract Existing visual scene understanding methods mainly focus on identifying coarse-grained concepts about the visual objects and their relationships, largely neglecting fine-grained scene understanding. In fact, many data-driven applications on the Web (e.g., news-reading and e-shopping) require accurate recognition of much less coarse concepts as entities and proper linking them to a knowledge graph (KG), which can take their performance to the next level. In light of this, in this paper, we identify a new research task: visual entity linking for fine-grained scene understanding. To accomplish the task, we first extract features of candidate entities from different modalities, i.e., visual features, textual features, and KG features. Then, we design a deep modal-attention neural network-based learning-to-rank method which aggregates all features and maps visual objects to the entities in KG. Extensive experimental results on the newly constructed dataset show that our proposed method is effective as it significantly improves the accuracy performance from 66.46% to 83.16% compared with baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Intelligence	Publication Date: Feb 3, 2022
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Visual Entity Linking via Multi-modal Learning

Abstract

Talk to us

Similar Papers

More From: Data Intelligence

Lead the way for us

Similar Papers

Knowledge graph entity typing via learning connecting embeddings
Yu Zhao ... Fuji Ren
Knowledge-Based Systems | VOL. 196
Yu Zhao, et. al.Yu Zhao ... Fuji Ren
25 Mar 2020
Knowledge-Based Systems | VOL. 196

Connecting Embeddings for Knowledge Graph Entity Typing
Yu Zhao ... Anxiang Zhang
-
Yu Zhao, et. al.Yu Zhao ... Anxiang Zhang
01 Jan 2020
01 Jan 2020

Cross-modal Knowledge Transfer
Fabian Both ... Steffen Thoma
-
Fabian Both, et. al.Fabian Both ... Steffen Thoma
04 Dec 2017
04 Dec 2017

Research Progress of Knowledge Graph Embedding
Chengyuan Duan ... Hongliang You
-
Chengyuan Duan, et. al.Chengyuan Duan ... Hongliang You
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual Entity Linking via Multi-modal Learning

Abstract

Talk to us

Similar Papers

More From: Data Intelligence