Knowing What's Missing: Assessing Information Sufficiency in Question Answering

📅 2025-12-06

📈 Citations: 0

✨ Influential: 0

career value

182K/year

🤖 AI Summary

This work addresses the challenge of assessing contextual sufficiency in question-answering systems, particularly for inferential questions requiring multi-hop reasoning. We propose a two-stage structured judgment framework: first, generating verifiable hypotheses about missing information; second, validating their truthfulness via semantic clustering and re-examination of source passages. The method integrates large language model–driven hypothesis generation, reasoning-chain-guided justification, and consensus-driven semantic verification, substantially improving detection of implicit information gaps. Evaluated on multiple multi-hop and factoid QA benchmarks, our approach outperforms existing baselines in contextual sufficiency classification accuracy. Moreover, it enables precise localization of critical information deficits within reasoning chains, enhancing interpretability and diagnostic capability.

Technology Category

Application Category

📝 Abstract

Determining whether a provided context contains sufficient information to answer a question is a critical challenge for building reliable question-answering systems. While simple prompting strategies have shown success on factual questions, they frequently fail on inferential ones that require reasoning beyond direct text extraction. We hypothesize that asking a model to first reason about what specific information is missing provides a more reliable, implicit signal for assessing overall sufficiency. To this end, we propose a structured Identify-then-Verify framework for robust sufficiency modeling. Our method first generates multiple hypotheses about missing information and establishes a semantic consensus. It then performs a critical verification step, forcing the model to re-examine the source text to confirm whether this information is truly absent. We evaluate our method against established baselines across diverse multi-hop and factual QA datasets. The results demonstrate that by guiding the model to justify its claims about missing information, our framework produces more accurate sufficiency judgments while clearly articulating any information gaps.

Problem

Research questions and friction points this paper is trying to address.

Assess information sufficiency for reliable question-answering systems

Address failure on inferential questions needing reasoning beyond extraction

Propose a framework to identify missing information and verify its absence

Innovation

Methods, ideas, or system contributions that make the work stand out.

Identify-then-Verify framework for sufficiency modeling

Generate hypotheses about missing information for consensus

Verify by re-examining source text for confirmation

🔎 Similar Papers

No similar papers found.