After Retrieval, Before Generation: Enhancing the Trustworthiness of Large Language Models in RAG

📅 2025-05-21

📈 Citations: 0

✨ Influential: 0

career value

132K/year

🤖 AI Summary

Retrieval-Augmented Generation (RAG) systems struggle to ensure response trustworthiness when internal parametric knowledge conflicts with or is less reliable than external retrieved knowledge; existing approaches address only isolated scenarios and lack a unified modeling framework. Method: We introduce the Trustworthiness Response Dataset (TRD), comprising 36,000 diverse questions, and propose BRIDGE—a dynamic response strategy framework featuring a novel soft-bias adaptive weighting mechanism and a maximum-soft-bias decision tree. BRIDGE jointly assesses source credibility and selects optimal response strategies across four realistic RAG scenarios, enabling LLM-based trustworthiness intervention between retrieval and generation. Contribution/Results: On TRD, BRIDGE achieves 5–15% higher accuracy than strong baselines, demonstrates balanced and stable performance across all scenarios, and significantly enhances the reliability of RAG responses in open-domain settings.

Technology Category

Application Category

📝 Abstract

Retrieval-augmented generation (RAG) systems face critical challenges in balancing internal (parametric) and external (retrieved) knowledge, especially when these sources conflict or are unreliable. To analyze these scenarios comprehensively, we construct the Trustworthiness Response Dataset (TRD) with 36,266 questions spanning four RAG settings. We reveal that existing approaches address isolated scenarios-prioritizing one knowledge source, naively merging both, or refusing answers-but lack a unified framework to handle different real-world conditions simultaneously. Therefore, we propose the BRIDGE framework, which dynamically determines a comprehensive response strategy of large language models (LLMs). BRIDGE leverages an adaptive weighting mechanism named soft bias to guide knowledge collection, followed by a Maximum Soft-bias Decision Tree to evaluate knowledge and select optimal response strategies (trust internal/external knowledge, or refuse). Experiments show BRIDGE outperforms baselines by 5-15% in accuracy while maintaining balanced performance across all scenarios. Our work provides an effective solution for LLMs' trustworthy responses in real-world RAG applications.

Problem

Research questions and friction points this paper is trying to address.

Balancing internal and external knowledge in RAG systems

Handling conflicts and unreliability in knowledge sources

Lack of unified framework for diverse real-world conditions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic response strategy with BRIDGE framework

Adaptive weighting via soft bias mechanism

Decision tree for optimal knowledge selection

🔎 Similar Papers

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse