TAdaRAG: Task Adaptive Retrieval-Augmented Generation via On-the-Fly Knowledge Graph Construction

📅 2025-11-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional RAG suffers from context-window limitations that truncate knowledge snippets, inducing hallucinations and breaking reasoning chains; moreover, retrieving unstructured text introduces noise that impairs reasoning. To address these issues, we propose a dynamic, task-adaptive knowledge graph construction framework that integrates intent-driven routing with implicit structured knowledge extraction, enabling domain-specific, concise, and non-redundant knowledge fusion. Our method jointly optimizes retrieval and generation via supervised fine-tuning and reinforcement learning, supporting real-time construction of structured knowledge representations from external sources. Evaluated on six public benchmarks and the real-world business dataset NowNewsQA, our approach achieves significant improvements over state-of-the-art methods, demonstrating superior performance in long-context understanding, cross-domain generalization, and reasoning coherence.

Technology Category

Application Category

📝 Abstract
Retrieval-Augmented Generation (RAG) improves large language models by retrieving external knowledge, often truncated into smaller chunks due to the input context window, which leads to information loss, resulting in response hallucinations and broken reasoning chains. Moreover, traditional RAG retrieves unstructured knowledge, introducing irrelevant details that hinder accurate reasoning. To address these issues, we propose TAdaRAG, a novel RAG framework for on-the-fly task-adaptive knowledge graph construction from external sources. Specifically, we design an intent-driven routing mechanism to a domain-specific extraction template, followed by supervised fine-tuning and a reinforcement learning-based implicit extraction mechanism, ensuring concise, coherent, and non-redundant knowledge integration. Evaluations on six public benchmarks and a real-world business benchmark (NowNewsQA) across three backbone models demonstrate that TAdaRAG outperforms existing methods across diverse domains and long-text tasks, highlighting its strong generalization and practical effectiveness.
Problem

Research questions and friction points this paper is trying to address.

Addresses information loss from truncated knowledge chunks in RAG
Solves irrelevant detail introduction from unstructured knowledge retrieval
Improves reasoning chains and reduces hallucinations in LLM responses
Innovation

Methods, ideas, or system contributions that make the work stand out.

On-the-fly task-adaptive knowledge graph construction
Intent-driven routing to domain-specific extraction templates
Reinforcement learning-based implicit extraction mechanism
🔎 Similar Papers
No similar papers found.
J
Jie Zhang
College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China
B
Bo Tang
AIDS and SIAR, University of Science and Technology of China, Suzhou 215123, China
W
Wanzi Shao
College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China
W
Wenqiang Wei
MemTensor (Shanghai) Technology Co., Ltd., Shanghai 201306, China
J
Jihao Zhao
MemTensor (Shanghai) Technology Co., Ltd., Shanghai 201306, China; School of Information, Renmin University of China, Beijing 100086, China
J
Jianqing Zhu
China Haisum Engineering Co., Ltd., Shanghai 200031, China
Zhiyu Li
Zhiyu Li
Tianjin University
Robust controlattitude control
W
Wen Xi
China Haisum Engineering Co., Ltd., Shanghai 200031, China
Z
Zehao Lin
MemTensor (Shanghai) Technology Co., Ltd., Shanghai 201306, China
Feiyu Xiong
Feiyu Xiong
MemTensor (Shanghai) Technology Co., Ltd.
Machine LearningNLPLLM
Yanchao Tan
Yanchao Tan
Fuzhou University
Data miningRecommender systemHealthcare