🤖 AI Summary
Large language models often exhibit fragile and unstable performance in multi-step logical reasoning due to high-entropy bifurcations at logical connectives. This work identifies logical connectives as critical vulnerability points and introduces a novel multi-layer framework that applies precise interventions exclusively at these logical junctions. The approach integrates gradient-guided logical steering, localized look-ahead branch search, and preference optimization via reinforcement learning tailored to logical turning points. Without substantially increasing computational overhead, the method achieves a superior trade-off between accuracy and efficiency compared to global strategies such as beam search and self-consistency.
📝 Abstract
While LLMs demonstrate impressive reasoning capabilities, they remain fragile in multi-step logical deduction, where a single transition error can propagate through the entire reasoning chain, leading to unstable performance. In this work, we identify logical connectives as primary points of this structural fragility. Through empirical analysis, we show that connective tokens function as high entropy forking points, at which models frequently struggle to determine the correct logical direction. Motivated by this observation, we hypothesize that intervening in logical connective selection can guide LLMs toward more correct logical direction, thereby improving the overall reasoning chain. To validate this hypothesis, we propose a multi-layered framework that intervenes specifically at these logic-critical junctions in the reasoning process. Our framework includes (1) Gradient-based Logical Steering to guide LLMs internal representations towards valid reasoning subspaces, (2) Localized Branching to resolve ambiguity via targeted look-ahead search, and (3) Targeted Transition Preference Optimization, a surgical reinforcement learning objective that selectively optimizes single-token preferences at logical pivots. Crucially, by concentrating intervention solely on logic-critical transitions, our framework achieves a favorable accuracy--efficiency trade-off compared to global inference time scaling methods like beam search and self-consistency.