Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models

πŸ“… 2026-01-16
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Large language models often fall into suboptimal or redundant reasoning paths due to a lack of foresight. This work proposes a heuristic search–based neural chain-of-thought framework that formulates reasoning as a dynamic exploration of sparse, high-quality thought trajectories. The approach introduces a dual-factor heuristic strategy that jointly optimizes accuracy and computational cost to actively evaluate and select superior reasoning operators. Empirical results demonstrate that the method achieves an average accuracy improvement of over 3.5% across multiple reasoning benchmarks while reducing reasoning generation length by more than 22%, thereby attaining a Pareto improvement in both efficiency and performance.

Technology Category

Application Category

πŸ“ Abstract
Chain-of-Thought reasoning has significantly enhanced the problem-solving capabilities of Large Language Models. Unfortunately, current models generate reasoning steps sequentially without foresight, often becoming trapped in suboptimal reasoning paths with redundant steps. In contrast, we introduce Neural Chain-of-Thought Search (NCoTS), a framework that reformulates reasoning as a dynamic search for the optimal thinking strategy. By quantitatively characterizing the solution space, we reveal the existence of sparse superior reasoning paths that are simultaneously more accurate and concise than standard outputs. Our method actively navigates towards these paths by evaluating candidate reasoning operators using a dual-factor heuristic that optimizes for both correctness and computational cost. Consequently, NCoTS achieves a Pareto improvement across diverse reasoning benchmarks, boosting accuracy by over 3.5% while reducing generation length by over 22%. Our code and data are available at https://github.com/MilkThink-Lab/Neural-CoT-Search.
Problem

Research questions and friction points this paper is trying to address.

Chain-of-Thought
reasoning path
Large Language Models
suboptimal reasoning
redundant steps
Innovation

Methods, ideas, or system contributions that make the work stand out.

Neural Chain-of-Thought Search
reasoning path optimization
dual-factor heuristic
Pareto improvement
large language models
πŸ”Ž Similar Papers
No similar papers found.
G
Guoming Ling
Sun Yat-sen University
Z
Zhongzhan Huang
Sun Yat-sen University
Y
Yupei Lin
Sun Yat-sen University
J
Junxin Li
Sun Yat-sen University
S
Shan Zhong
Sun Yat-sen University
Hefeng Wu
Hefeng Wu
Sun Yat-sen University
Computer visionMachine LearningArtificial Intelligence
Liang Lin
Liang Lin
Fellow of IEEE/IAPR, Professor of Computer Science, Sun Yat-sen University
Embodied AICausal Inference and LearningMultimodal Data Analysis