Posterior Label Smoothing for Node Classification

📅 2024-06-01
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address overfitting in graph node classification when transitioning from homophilous to heterophilous graphs, this paper proposes Posterior Label Smoothing (PLS). Under a semi-supervised setting, PLS employs Bayesian posterior inference to jointly model neighborhood label likelihoods and global graph-level statistical priors, dynamically generating adaptive soft labels for structure-aware label regularization. Moreover, it is the first work to theoretically and empirically reveal that iterative pseudo-label refinement enhances global statistical consistency. Extensive experiments across 10 benchmark datasets and 8 backbone models demonstrate significant accuracy improvements. Empirical analysis confirms that PLS effectively mitigates overfitting and strengthens generalization. By integrating structural information and global statistics into label smoothing, PLS establishes a novel, interpretable, and robust paradigm for heterophilous graph learning.

Technology Category

Application Category

📝 Abstract
Label smoothing is a widely studied regularization technique in machine learning. However, its potential for node classification in graph-structured data, spanning homophilic to heterophilic graphs, remains largely unexplored. We introduce posterior label smoothing, a novel method for transductive node classification that derives soft labels from a posterior distribution conditioned on neighborhood labels. The likelihood and prior distributions are estimated from the global statistics of the graph structure, allowing our approach to adapt naturally to various graph properties. We evaluate our method on 10 benchmark datasets using eight baseline models, demonstrating consistent improvements in classification accuracy. The following analysis demonstrates that soft labels mitigate overfitting during training, leading to better generalization performance, and that pseudo-labeling effectively refines the global label statistics of the graph. Our code is available at https://github.com/ml-postech/PosteL.
Problem

Research questions and friction points this paper is trying to address.

Develops posterior label smoothing for transductive node classification tasks
Adapts to diverse graph properties from homophilic to heterophilic structures
Addresses overfitting through soft labels derived from neighborhood distributions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Posterior label smoothing for node classification
Soft labels from neighborhood-based posterior distribution
Adapts to graph properties using global statistics
🔎 Similar Papers
No similar papers found.
J
Jaeseung Heo
Graduate School of Artificial Intelligence, POSTECH, South Korea
M
M. Park
Graduate School of Artificial Intelligence, POSTECH, South Korea
D
Dongwoo Kim
Graduate School of Artificial Intelligence, POSTECH, South Korea; Department of Computer Science & Engineering, POSTECH, South Korea