🤖 AI Summary
Existing graph neural network (GNN) pre-training and prompt learning methods overlook the dynamic evolution of node similarity/dissimilarity, particularly under non-homophilous graph structures—where homophilous and heterophilous patterns coexist—leading to suboptimal performance. Method: We propose ProNoG, the first pre-training and prompt learning framework tailored for non-homophilous graphs. It systematically analyzes how non-homophily affects pre-training objectives, introduces a conditional node-aware prompting mechanism to capture node-level heterophilous patterns, and adopts a dual-path design integrating theoretically grounded interpretability analysis with data-driven optimization. Contribution/Results: Evaluated on 10 public benchmarks, ProNoG consistently outperforms state-of-the-art methods, achieving an average accuracy gain of 7.2% on strongly heterophilous graphs. Ablation studies validate both the effectiveness and generalizability of node-level prompts, establishing ProNoG as a principled and empirically robust solution for non-homophilous graph representation learning.
📝 Abstract
Graphs are ubiquitous for modeling complex relationships between objects across various fields. Graph neural networks (GNNs) have become a mainstream technique for graph-based applications, but their performance heavily relies on abundant labeled data. To reduce labeling requirement, pre-training and prompt learning has become a popular alternative. However, most existing prompt methods do not differentiate homophilic and heterophilic characteristics of real-world graphs. In particular, many real-world graphs are non-homophilic, not strictly or uniformly homophilic with mixing homophilic and heterophilic patterns, exhibiting varying non-homophilic characteristics across graphs and nodes. In this paper, we propose ProNoG, a novel pre-training and prompt learning framework for such non-homophilic graphs. First, we analyze existing graph pre-training methods, providing theoretical insights into the choice of pre-training tasks. Second, recognizing that each node exhibits unique non-homophilic characteristics, we propose a conditional network to characterize the node-specific patterns in downstream tasks. Finally, we thoroughly evaluate and analyze ProNoG through extensive experiments on ten public datasets.