A Self-Explainable Heterogeneous GNN for Relational Deep Learning

📅 2024-11-30

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

212K/year

🤖 AI Summary

Existing heterogeneous graph neural networks (HGNNs) struggle to efficiently and interpretably mine discriminative meta-paths from multi-table relational databases: conventional approaches rely on expert priors or exhaustive enumeration, suffering poor scalability; recent unsupervised methods erroneously assume class labels depend solely on meta-path *existence*, ignoring multiplicity and semantics. This paper proposes the first self-explaining HGNN framework supporting *multi-instance meta-path aggregation*. It integrates differentiable meta-path discovery, attention-driven importance scoring, and an aggregation-based semantic reasoning module—enabling node classification decisions to explicitly depend on the joint semantic aggregation of multiple meta-paths. Evaluated on synthetic and real-world datasets, our method significantly outperforms state-of-the-art baselines, achieving up to a 12.6% accuracy gain. Moreover, it automatically identifies highly discriminative meta-paths that faithfully reflect the model’s reasoning logic, enhancing both performance and interpretability.

Technology Category

Application Category

📝 Abstract

Recently, significant attention has been given to the idea of viewing relational databases as heterogeneous graphs, enabling the application of graph neural network (GNN) technology for predictive tasks. However, existing GNN methods struggle with the complexity of the heterogeneous graphs induced by databases with numerous tables and relations. Traditional approaches either consider all possible relational meta-paths, thus failing to scale with the number of relations, or rely on domain experts to identify relevant meta-paths. A recent solution does manage to learn informative meta-paths without expert supervision, but assumes that a node's class depends solely on the existence of a meta-path occurrence. In this work, we present a self-explainable heterogeneous GNN for relational data, that supports models in which class membership depends on aggregate information obtained from multiple occurrences of a meta-path. Experimental results show that in the context of relational databases, our approach effectively identifies informative meta-paths that faithfully capture the model's reasoning mechanisms. It significantly outperforms existing methods in both synthetic and real-world scenario.

Problem

Research questions and friction points this paper is trying to address.

addresses complexity in heterogeneous graphs

learns meta-paths without expert supervision

improves relational database predictive accuracy

Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-explainable heterogeneous GNN

Aggregate meta-path information

Automated informative meta-path identification

🔎 Similar Papers

No similar papers found.