DualResearch: Entropy-Gated Dual-Graph Retrieval for Answer Reconstruction

📅 2025-10-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address context contamination, weak evidential support, and fragile execution paths in multi-tool collaborative scientific reasoning, this paper proposes a dual-graph retrieval framework integrated with entropy-controlled fusion. The framework constructs a semantic breadth graph—modeling tool coverage—and a causal depth graph—capturing causal interpretability of reasoning chains. An entropy-controlled gating mechanism dynamically weights heterogeneous evidence in log-space, enabling reliability-driven path selection and consistency enhancement. Additionally, we introduce seed-anchored semantic diffusion, causal path matching, and layer-native relevance functions, augmented by global calibration. Evaluated on the HLE and GPQA benchmarks, our method achieves substantial improvements: +7.7% accuracy on HLE and +6.06% on GPQA, demonstrating robustness and effectiveness for complex, multi-step scientific reasoning.

Technology Category

Application Category

📝 Abstract
The deep-research framework orchestrates external tools to perform complex, multi-step scientific reasoning that exceeds the native limits of a single large language model. However, it still suffers from context pollution, weak evidentiary support, and brittle execution paths. To address these issues, we propose DualResearch, a retrieval and fusion framework that matches the epistemic structure of tool-intensive reasoning by jointly modeling two complementary graphs: a breadth semantic graph that encodes stable background knowledge, and a depth causal graph that captures execution provenance. Each graph has a layer-native relevance function, seed-anchored semantic diffusion for breadth, and causal-semantic path matching with reliability weighting for depth. To reconcile their heterogeneity and query-dependent uncertainty, DualResearch converts per-layer path evidence into answer distributions and fuses them in log space via an entropy-gated rule with global calibration. The fusion up-weights the more certain channel and amplifies agreement. As a complement to deep-research systems, DualResearch compresses lengthy multi-tool execution logs into a concise reasoning graph, and we show that it can reconstruct answers stably and effectively. On the scientific reasoning benchmarks HLE and GPQA, DualResearch achieves competitive performance. Using log files from the open-source system InternAgent, its accuracy improves by 7.7% on HLE and 6.06% on GPQA.
Problem

Research questions and friction points this paper is trying to address.

Addresses context pollution in multi-step scientific reasoning
Improves evidentiary support through dual-graph retrieval framework
Stabilizes execution paths via entropy-gated answer reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Models two complementary graphs for tool reasoning
Uses entropy-gated fusion to reconcile graph heterogeneity
Compresses execution logs into concise reasoning graphs
🔎 Similar Papers
No similar papers found.
Jinxin Shi
Jinxin Shi
East China Normal Unversity
Z
Zongsheng Cao
Shanghai Artificial Intelligence Laboratory, East China Normal University
Runmin Ma
Runmin Ma
Shanghai AI Lab
Y
Yusong Hu
Shanghai Artificial Intelligence Laboratory
J
Jie Zhou
Shanghai Artificial Intelligence Laboratory, East China Normal University
X
Xin Li
Shanghai Artificial Intelligence Laboratory, East China Normal University
Lei Bai
Lei Bai
Shanghai AI Laboratory
Foundation ModelScience IntelligenceMulti-Agent SystemAutonomous Discovery
L
Liang He
Shanghai Artificial Intelligence Laboratory, East China Normal University
B
Bo Zhang
Shanghai Artificial Intelligence Laboratory