A Survey of Deep Graph Learning under Distribution Shifts: from Graph Out-of-Distribution Generalization to Adaptation

📅 2024-10-25
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Graph machine learning often suffers from distributional shifts (out-of-distribution, OOD) between training and test data in real-world scenarios, leading to significant performance degradation. To address this, this work systematically investigates the robustness of graph deep learning under distribution shifts and proposes the first unified taxonomy comprising three task paradigms: graph OOD generalization, training-time adaptation, and test-time adaptation. Methodologically, it categorizes approaches into “model-centric” (e.g., invariant learning, causal reasoning) and “data-centric” (e.g., graph augmentation, reweighting, subgraph reconstruction) strategies. Contributions include formal modeling of covariate and concept drift on graphs, comprehensive curation of benchmark datasets and evaluation protocols, identification of key challenges—particularly structural-attribute coupling shifts—and maintenance of an open-source, dynamically updated reading list. Collectively, this work establishes foundational principles and directions for scalable, theoretically grounded OOD adaptation in graph learning.

Technology Category

Application Category

📝 Abstract
Distribution shifts on graphs -- the discrepancies in data distribution between training and employing a graph machine learning model -- are ubiquitous and often unavoidable in real-world scenarios. These shifts may severely deteriorate model performance, posing significant challenges for reliable graph machine learning. Consequently, there has been a surge in research on graph machine learning under distribution shifts, aiming to train models to achieve satisfactory performance on out-of-distribution (OOD) test data. In our survey, we provide an up-to-date and forward-looking review of deep graph learning under distribution shifts. Specifically, we cover three primary scenarios: graph OOD generalization, training-time graph OOD adaptation, and test-time graph OOD adaptation. We begin by formally formulating the problems and discussing various types of distribution shifts that can affect graph learning, such as covariate shifts and concept shifts. To provide a better understanding of the literature, we introduce a systematic taxonomy that classifies existing methods into model-centric and data-centric approaches, investigating the techniques used in each category. We also summarize commonly used datasets in this research area to facilitate further investigation. Finally, we point out promising research directions and the corresponding challenges to encourage further study in this vital domain. We also provide a continuously updated reading list at https://github.com/kaize0409/Awesome-Graph-OOD.
Problem

Research questions and friction points this paper is trying to address.

Addressing performance degradation in graph learning due to distribution shifts
Surveying deep graph learning methods for OOD generalization and adaptation
Classifying approaches to handle covariate and concept shifts in graph data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep graph learning under distribution shifts
Model-centric and data-centric approaches
Graph OOD generalization and adaptation
🔎 Similar Papers
No similar papers found.
Kexin Zhang
Kexin Zhang
Tsinghua University
Data MiningMachine Learning
S
Shuhan Liu
Department of Statistics and Data Science, Northwestern University
S
Song Wang
Department of Electrical and Computer Engineering, University of Virginia
Weili Shi
Weili Shi
University of Virginia
adversarial learning vision tranformer imbalanced learning
C
Chen Chen
Department of Computer Science, University of Virginia
P
Pan Li
School of Electrical and Computer Engineering, Georgia Institute of Technology, and Department of Computer Science, Purdue University
S
Sheng Li
School of Data Science, University of Virginia
Jundong Li
Jundong Li
Associate Professor, University of Virginia
AIMachine LearningData MiningGraph Learning
Kaize Ding
Kaize Ding
Assistant Professor of Stats & Data Science, Northwestern University
Reliable Machine LearningData-Efficient LearningAnomaly/OOD DetectionLLMs and GNNs