FICOM: an effective and scalable active learning framework for GNNs on semi-supervised node classification

Xingyi Zhang,Jinchao Huang,Fangyuan Zhang,Sibo Wang

doi:10.1007/s00778-024-00870-z

Abstract

Active learning for graph neural networks (GNNs) aims to select B nodes to label for the best possible GNN performance. Carefully selected labeled nodes can help improve GNN performance and hence motivates a line of research works. Unfortunately, existing methods still provide inferior GNN performance or cannot scale to large networks.Motivated by these limitations, in this paper, we present FICOM, an effective and scalable GNN active learning framework. Firstly, we formulate the node selection as an optimization problem where we consider the importance of a node from (i) the importance of a node during the feature propagation with a connection to the personalized PageRank (PPR), and (ii) the diversity of a node brings in the embedding space generated by feature propagation. We show that the defined problem is submodular, and a greedy solution can provide a (1-1/e)\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$(1-1/e)$$\\end{document}-approximate solution.However, a standard greedy solution requires getting the node with the maximum marginal gain of the objective score in each iteration, which incurs a prohibitive running cost and cannot scale to large datasets. As our main contribution, we present FICOM, an efficient and scalable solution that provides (1-1/e)\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$(1-1/e)$$\\end{document}-approximation guarantee and scales to graphs with millions of nodes on a single machine. The main idea is that we adaptively maintain the lower- and upper-bound of the marginal gain for each node v. In each iteration, we can first derive a small subset of candidate nodes and then compute the exact score for this subset of candidate nodes so that we can find the node with the maximum marginal gain efficiently. Extensive experiments on six benchmark datasets using four GNNs, including GCN, SGC, APPNP, and GCNII, show that our FICOM consistently outperforms existing active learning approaches on semi-supervised node classification tasks using different GNNs. Moreover, our solution can finish within 5 h on a million-node graph.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FICOM: an effective and scalable active learning framework for GNNs on semi-supervised node classification

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal

Lead the way for us

Journal: The VLDB Journal	Publication Date: Jul 22, 2024
License type: CC BY 4.0

Similar Papers

The interplay between communities and homophily in semi-supervised classification using graph neural networks
Hussain Hussain ... Elisabeth Lex
Applied Network Science | VOL. 6
Hussain Hussain, et. al.Hussain Hussain ... Elisabeth Lex
26 Oct 2021
Applied Network Science | VOL. 6

Semi-Supervised Node Classification on Graphs: Markov Random Fields vs. Graph Neural Networks
Binghui Wang ... Neil Zhenqiang Gong
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Binghui Wang, et. al.Binghui Wang ... Neil Zhenqiang Gong
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Towards an Optimal Asymmetric Graph Structure for Robust Semi-supervised Node Classification
Zixing Song ... Irwin King
-
Zixing Song, et. al.Zixing Song ... Irwin King
14 Aug 2022
14 Aug 2022

Semantic graph neural network with multi-measure learning for semi-supervised classification
Junchao Lin ... Xingchen Qi
Engineering Applications of Artificial Intelligence | VOL. 140
Junchao Lin, et. al.Junchao Lin ... Xingchen Qi
29 Nov 2024
Engineering Applications of Artificial Intelligence | VOL. 140

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FICOM: an effective and scalable active learning framework for GNNs on semi-supervised node classification

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal