🤖 AI Summary
Graph neural networks (GNNs) struggle to capture pair-specific subgraph structures in link prediction, as their message-passing mechanisms inherently ignore topological details of connecting paths; existing structural enhancement approaches either incur prohibitive computational overhead or rely solely on shallow heuristics (e.g., common neighbors), failing to model multi-hop dependencies. This paper proposes PathGNN: the first GNN-based link prediction framework that explicitly incorporates shortest-path sequence modeling—first generating node embeddings via a base GNN, then enumerating all shortest paths between candidate node pairs, and finally aggregating node embeddings along each path using a sequence model (e.g., Transformer). We theoretically prove that PathGNN’s expressive power strictly surpasses both standard GNNs and mainstream structural feature methods. Empirically, PathGNN achieves state-of-the-art performance across multiple benchmark datasets, while maintaining linear time complexity and high representational capacity.
📝 Abstract
Graph Neural Networks (GNNs) often struggle to capture the link-specific structural patterns crucial for accurate link prediction, as their node-centric message-passing schemes overlook the subgraph structures connecting a pair of nodes. Existing methods to inject such structural context either incur high computational cost or rely on simplistic heuristics (e.g., common neighbor counts) that fail to model multi-hop dependencies. We introduce SP4LP (Shortest Path for Link Prediction), a novel framework that combines GNN-based node encodings with sequence modeling over shortest paths. Specifically, SP4LP first applies a GNN to compute representations for all nodes, then extracts the shortest path between each candidate node pair and processes the resulting sequence of node embeddings using a sequence model. This design enables SP4LP to capture expressive multi-hop relational patterns with computational efficiency. Empirically, SP4LP achieves state-of-the-art performance across link prediction benchmarks. Theoretically, we prove that SP4LP is strictly more expressive than standard message-passing GNNs and several state-of-the-art structural features methods, establishing it as a general and principled approach for link prediction in graphs.