Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification

📅 2026-03-10

📈 Citations: 0

✨ Influential: 0

career value

172K/year

🤖 AI Summary

This work addresses the limitations of existing transductive learning generalization bounds, which rely on classical complexity measures that are computationally intractable and poorly aligned with empirical performance. Under a distribution-free transductive setting, the authors propose a novel generalization bound grounded in optimal transport theory, leveraging the Wasserstein distance to quantify the discrepancy between encoded feature distributions. This bound is efficiently computable, exhibits strong correlation with empirical generalization error, and reveals a representation trade-off in graph neural networks (GNNs)—namely, the balance between intra-class aggregation and inter-class separation during message passing. The proposed measure significantly outperforms conventional metrics, effectively explaining the non-monotonic relationship between GNN depth and generalization error, and demonstrates strong predictive power on node classification tasks.

Technology Category

Application Category

📝 Abstract

Many existing transductive bounds rely on classical complexity measures that are computationally intractable and often misaligned with empirical behavior. In this work, we establish new representation-based generalization bounds in a distribution-free transductive setting, where learned representations are dependent, and test features are accessible during training. We derive global and class-wise bounds via optimal transport, expressed in terms of Wasserstein distances between encoded feature distributions. We demonstrate that our bounds are efficiently computable and strongly correlate with empirical generalization in graph node classification, improving upon classical complexity measures. Additionally, our analysis reveals how the GNN aggregation process transforms the representation distributions, inducing a trade-off between intra-class concentration and inter-class separation. This yields depth-dependent characterizations that capture the non-monotonic relationship between depth and generalization error observed in practice. The code is available at https://github.com/ml-postech/Transductive-OT-Gen-Bound.

Problem

Research questions and friction points this paper is trying to address.

transductive generalization

graph node classification

generalization bounds

optimal transport

Wasserstein distance

Innovation

Methods, ideas, or system contributions that make the work stand out.

Optimal Transport

Transductive Generalization

Wasserstein Distance