First Token Probability Guided RAG for Telecom Question Answering

📅 2025-01-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address inaccurate retrieval and frequent hallucinations in Retrieval-Augmented Generation (RAG) for multi-choice question answering (MCQA) in the telecommunications domain, this paper proposes a first-token-probability-guided dynamic RAG framework. The method leverages the probability distribution over the first token of the large language model’s (LLM) answer as a lightweight, interpretable confidence signal to dynamically adjust key RAG parameters—including retrieved passage count, sliding window size, and context composition—enabling online hyperparameter tuning and adaptive context reconstruction. To our knowledge, this is the first work to employ first-token probability as a real-time control signal for RAG optimization. Evaluated on a telecom-specific MCQA benchmark, our approach achieves significant improvements in answer accuracy while substantially reducing hallucination rates, demonstrating that probability-driven adaptation consistently enhances the robustness and reliability of domain-specific RAG systems.

Technology Category

Application Category

📝 Abstract
Large Language Models (LLMs) have garnered significant attention for their impressive general-purpose capabilities. For applications requiring intricate domain knowledge, Retrieval-Augmented Generation (RAG) has shown a distinct advantage in incorporating domain-specific information into LLMs. However, existing RAG research has not fully addressed the challenges of Multiple Choice Question Answering (MCQA) in telecommunications, particularly in terms of retrieval quality and mitigating hallucinations. To tackle these challenges, we propose a novel first token probability guided RAG framework. This framework leverages confidence scores to optimize key hyperparameters, such as chunk number and chunk window size, while dynamically adjusting the context. Our method starts by retrieving the most relevant chunks and generates a single token as the potential answer. The probabilities of all options are then normalized to serve as confidence scores, which guide the dynamic adjustment of the context. By iteratively optimizing the hyperparameters based on these confidence scores, we can continuously improve RAG performance. We conducted experiments to validate the effectiveness of our framework, demonstrating its potential to enhance accuracy in domain-specific MCQA tasks.
Problem

Research questions and friction points this paper is trying to address.

Telecommunication
Retrieval-Augmented Generation
Accuracy Improvement
Innovation

Methods, ideas, or system contributions that make the work stand out.

First-Character Probability Guided RAG
Confidence Score-based Strategy Adjustment
Enhanced Accuracy in Professional MCQs
🔎 Similar Papers
No similar papers found.
T
Tingwei Chen
Shenzhen Research Institute of Big Data, Shenzhen, China
J
Jiayi Chen
Shenzhen Research Institute of Big Data, Shenzhen, China
Z
Zijian Zhao
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
Haolong Chen
Haolong Chen
The Chinese University of Hong Kong, Shenzhen
Artificial IntelligenceComputer Science
L
Liang Zhang
Shenzhen Research Institute of Big Data, Shenzhen, China
G
Guangxu Zhu
Shenzhen Research Institute of Big Data, Shenzhen, China