Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning

📅 2025-10-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the severe class imbalance in triple confidence distributions within Uncertain Knowledge Graphs (UKGs)—which degrades embedding quality and limits completion performance—this paper proposes a semi-supervised confidence distribution learning framework. Methodologically, it models scalar confidences as distributional representations, leverages meta-learning to generate high-quality pseudo-labels, and jointly optimizes embeddings for both labeled and unlabeled data via iterative semi-supervised training. This approach effectively mitigates confidence distribution skew and substantially enhances model generalization to unseen triples. Experiments on two real-world UKG benchmarks demonstrate that our method significantly outperforms existing state-of-the-art baselines on both triple completion and confidence prediction—achieving more robust and accurate joint completion.

Technology Category

Application Category

📝 Abstract
Uncertain knowledge graphs (UKGs) associate each triple with a confidence score to provide more precise knowledge representations. Recently, since real-world UKGs suffer from the incompleteness, uncertain knowledge graph (UKG) completion attracts more attention, aiming to complete missing triples and confidences. Current studies attempt to learn UKG embeddings to solve this problem, but they neglect the extremely imbalanced distributions of triple confidences. This causes that the learnt embeddings are insufficient to high-quality UKG completion. Thus, in this paper, to address the above issue, we propose a new semi-supervised Confidence Distribution Learning (ssCDL) method for UKG completion, where each triple confidence is transformed into a confidence distribution to introduce more supervision information of different confidences to reinforce the embedding learning process. ssCDL iteratively learns UKG embedding by relational learning on labeled data (i.e., existing triples with confidences) and unlabeled data with pseudo labels (i.e., unseen triples with the generated confidences), which are predicted by meta-learning to augment the training data and rebalance the distribution of triple confidences. Experiments on two UKG datasets demonstrate that ssCDL consistently outperforms state-of-the-art baselines in different evaluation metrics.
Problem

Research questions and friction points this paper is trying to address.

Addresses imbalanced confidence distributions in uncertain knowledge graphs
Completes missing triples and confidence scores via semi-supervised learning
Enhances embedding quality using confidence distributions and pseudo-labels
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semi-supervised confidence distribution learning for UKG completion
Transforms triple confidence into distribution for embedding reinforcement
Meta-learning generates pseudo labels to rebalance confidence distribution
🔎 Similar Papers
No similar papers found.
Tianxing Wu
Tianxing Wu
Ph.D. Student, Nanyang technological university
Computer Vision
S
Shutong Zhu
School of Computer Science and Engineering, Southeast University, China
J
Jingting Wang
School of Computer Science and Engineering, Southeast University, China
N
Ning Xu
School of Computer Science and Engineering, Southeast University, China
Guilin Qi
Guilin Qi
Southeast University
Artificial Intelligenceontology
Haofen Wang
Haofen Wang
Tongji University
Knowledge GraphNatural Language ProcessingRetrieval Augmented Generation