A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective.

Chaoqi Chen,Yushuang Wu,Qiyuan Dai,Hong-Yu Zhou,Mutian Xu,Sibei Yang,Xiaoguang Han,Yizhou Yu

doi:10.1109/tpami.2024.3445463

Abstract

Graph Neural Networks (GNNs) have gained momentum in graph representation learning and boosted the state of the art in a variety of areas, such as data mining (e.g., social network analysis and recommender systems), computer vision (e.g., object detection and point cloud learning), and natural language processing (e.g., relation extraction and sequence learning), to name a few. With the emergence of Transformers in natural language processing and computer vision, graph Transformers embed a graph structure into the Transformer architecture to overcome the limitations of local neighborhood aggregation while avoiding strict structural inductive biases. In this paper, we present a comprehensive review of GNNs and graph Transformers in computer vision from a task-oriented perspective. Specifically, we divide their applications in computer vision into five categories according to the modality of input data, i.e., 2D natural images, videos, 3D data, vision + language, and medical images. In each category, we further divide the applications according to a set of vision tasks. Such a task-oriented taxonomy allows us to examine how each task is tackled by different GNN-based approaches and how well these approaches perform. Based on the necessary preliminaries, we provide the definitions and challenges of the tasks, in-depth coverage of the representative approaches, as well as discussions regarding insights, limitations, and future directions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Journal: IEEE transactions on pattern analysis and machine intelligence	Publication Date: Jan 1, 2024
Citations: 2

Similar Papers

Introduction to Graph Neural Network
Ganga Devi S V S
-
Ganga Devi S V SGanga Devi S V S
03 Mar 2023
03 Mar 2023

Towards Exploring the Limitations of Test Selection Techniques on Graph Neural Networks: An Empirical Study
Xueqi Dang ... Yves Le Traon
Empirical Software Engineering | VOL. 29
Xueqi Dang, et. al.Xueqi Dang ... Yves Le Traon
22 Jul 2024
Empirical Software Engineering | VOL. 29

Everything is connected: Graph neural networks
Petar Veličković
Current Opinion in Structural Biology | VOL. 79
Petar VeličkovićPetar Veličković
09 Feb 2023
Current Opinion in Structural Biology | VOL. 79

Graph Neural Network (GNN) in Image and Video Understanding Using Deep Learning for Computer Vision Applications
P Pradhyumna ... Mohana
-
P Pradhyumna, et. al.P Pradhyumna ... Mohana
04 Aug 2021
04 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence