Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning

Yongduo Sui,Jiancan Wu,Xiang Wang,Xiangnan He,Shuyao Wang,Wenyu Mao,Tat-Seng Chua

doi:10.1145/3644392

Abstract

In graph classification, attention- and pooling-based graph neural networks (GNNs) predominate to extract salient features from the input graph and support the prediction. They mostly follow the paradigm of “learning to attend,” which maximizes the mutual information between the attended graph and the ground-truth label. However, this paradigm causes GNN classifiers to indiscriminately absorb all statistical correlations between input features and labels in the training data without distinguishing the causal and noncausal effects of features. Rather than emphasizing causal features, the attended graphs tend to rely on noncausal features as shortcuts to predictions. These shortcut features may easily change outside the training distribution, thereby leading to poor generalization for GNN classifiers. In this article, we take a causal view on GNN modeling. Under our causal assumption, the shortcut feature serves as a confounder between the causal feature and prediction. It misleads the classifier into learning spurious correlations that facilitate prediction in in-distribution (ID) test evaluation while causing significant performance drop in out-of-distribution (OOD) test data. To address this issue, we employ the backdoor adjustment from causal theory—combining each causal feature with various shortcut features, to identify causal patterns and mitigate the confounding effect. Specifically, we employ attention modules to estimate the causal and shortcut features of the input graph. Then, a memory bank collects the estimated shortcut features, enhancing the diversity of shortcut features for combination. Simultaneously, we apply the prototype strategy to improve the consistency of intra-class causal features. We term our method as CAL+, which can promote stable relationships between causal estimation and prediction, regardless of distribution changes. Extensive experiments on synthetic and real-world OOD benchmarks demonstrate our method’s effectiveness in improving OOD generalization. Our codes are released at https://github.com/shuyao-wang/CAL-plus .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data

Lead the way for us

Journal: ACM Transactions on Knowledge Discovery from Data	Publication Date: Mar 26, 2024
Citations: 1

Similar Papers

Causal Attention for Interpretable and Generalizable Graph Classification
Yongduo Sui ... Xiangnan He
-
Yongduo Sui, et. al.Yongduo Sui ... Xiangnan He
14 Aug 2022
14 Aug 2022

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals
Zhao Wang ... Aron Culotta
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Zhao Wang, et. al.Zhao Wang ... Aron Culotta
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Debiased Graph Neural Networks With Agnostic Label Selection Bias.
Shaohua Fan ... Nian Liu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35
Shaohua Fan, et. al.Shaohua Fan ... Nian Liu
01 Apr 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35

Imbalanced Graph Classification via Graph-of-Graph Neural Networks
Yu Wang ... Yuying Zhao
-
Yu Wang, et. al.Yu Wang ... Yuying Zhao
17 Oct 2022
17 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data