Exploring Heterophily in Graph-level Tasks

📅 2025-09-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work investigates the impact mechanism of heterophily (disassortativity) in graph-level tasks, addressing a critical gap in both theoretical understanding and methodological development for graph-level learning. We propose a graph-level label pattern taxonomy, grounded in motif-based local structural signatures, and theoretically establish—via spectral analysis—for the first time that motif detection relies on a dynamic mixture of multiple spectral components, contradicting conventional global-frequency-dominant mechanisms. Building on this insight, we design a frequency-adaptive GNN architecture and conduct rigorous theoretical analysis based on energy gradient flow. Experiments on synthetic benchmarks and real-world molecular property prediction tasks demonstrate that our model significantly outperforms frequency-dominant baselines under controlled heterophilous settings, empirically validating the essential role of spectral adaptivity in modeling graph-level heterophily.

Technology Category

Application Category

📝 Abstract
While heterophily has been widely studied in node-level tasks, its impact on graph-level tasks remains unclear. We present the first analysis of heterophily in graph-level learning, combining theoretical insights with empirical validation. We first introduce a taxonomy of graph-level labeling schemes, and focus on motif-based tasks within local structure labeling, which is a popular labeling scheme. Using energy-based gradient flow analysis, we reveal a key insight: unlike frequency-dominated regimes in node-level tasks, motif detection requires mixed-frequency dynamics to remain flexible across multiple spectral components. Our theory shows that motif objectives are inherently misaligned with global frequency dominance, demanding distinct architectural considerations. Experiments on synthetic datasets with controlled heterophily and real-world molecular property prediction support our findings, showing that frequency-adaptive model outperform frequency-dominated models. This work establishes a new theoretical understanding of heterophily in graph-level learning and offers guidance for designing effective GNN architectures.
Problem

Research questions and friction points this paper is trying to address.

Analyzing heterophily's impact on graph-level learning tasks
Investigating motif detection requirements for mixed-frequency dynamics
Designing frequency-adaptive GNN architectures for graph-level tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Analyzed heterophily in graph-level tasks theoretically and empirically
Used energy-based gradient flow for motif detection analysis
Proposed frequency-adaptive models outperform frequency-dominated architectures
🔎 Similar Papers
Q
Qinhan Hou
Doctoral Program of Computer Science, University of Helsinki, Helsinki, Finland
Y
Yilun Zheng
Centre for Information Sciences and Systems, Nanyang Technological University, Singapore
X
Xichun Zhang
Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
Sitao Luan
Sitao Luan
University of Montreal, Mila
Graph LearningAI4ScienceGraph for LLMLLM for GraphRL Reasoning
J
Jing Tang
Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland