Pseudo-Knowledge Graph: Meta-Path Guided Retrieval and In-Graph Text for RAG-Equipped LLM

📅 2025-03-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address incomplete retrieval and weak relational awareness—leading to fragmented answers—in Retrieval-Augmented Generation (RAG) applied to high-volume, low-information-density data, this paper proposes the Pseudo-Knowledge Graph (PKG) framework. PKG eschews explicit triple extraction; instead, it leverages meta-paths to guide multi-hop semantic retrieval and natively embeds raw text into a graph structure. It jointly employs intra-graph textual modeling and hybrid vector retrieval to construct relation-enhanced contexts. Its core innovation lies in the first-of-its-kind dual-mechanism design: meta-path-driven navigation coupled with native text preservation—enabling implicit entity relationship modeling without additional annotation cost. Evaluated on Open Compass and MultiHop-RAG benchmarks, PKG achieves significant improvements in multi-hop question answering accuracy and long-tail fact recall, demonstrating its effectiveness for high-precision relational reasoning over massive, sparse datasets.

Technology Category

Application Category

📝 Abstract
The advent of Large Language Models (LLMs) has revolutionized natural language processing. However, these models face challenges in retrieving precise information from vast datasets. Retrieval-Augmented Generation (RAG) was developed to combining LLMs with external information retrieval systems to enhance the accuracy and context of responses. Despite improvements, RAG still struggles with comprehensive retrieval in high-volume, low-information-density databases and lacks relational awareness, leading to fragmented answers. To address this, this paper introduces the Pseudo-Knowledge Graph (PKG) framework, designed to overcome these limitations by integrating Meta-path Retrieval, In-graph Text and Vector Retrieval into LLMs. By preserving natural language text and leveraging various retrieval techniques, the PKG offers a richer knowledge representation and improves accuracy in information retrieval. Extensive evaluations using Open Compass and MultiHop-RAG benchmarks demonstrate the framework's effectiveness in managing large volumes of data and complex relationships.
Problem

Research questions and friction points this paper is trying to address.

Enhance LLM information retrieval accuracy and context.
Address fragmented answers in high-volume, low-density databases.
Improve relational awareness in Retrieval-Augmented Generation systems.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates Meta-path Retrieval for enhanced data navigation
Combines In-graph Text with Vector Retrieval techniques
Utilizes Pseudo-Knowledge Graph for improved relational awareness
🔎 Similar Papers
No similar papers found.
Y
Yuxin Yang
Peking University, Beijing, China
Haoyang Wu
Haoyang Wu
Intel Labs China
wireless sensingwireless network architecturesoftware-defined radio
T
Tao Wang
Peking University, Beijing, China
J
Jia Yang
Peking University, Beijing, China
H
Hao Ma
Peking University, Beijing, China
Guojie Luo
Guojie Luo
Peking University
Electronic Design AutomationReconfigurable Architecture