Chain of Draft: Thinking Faster by Writing Less

📅 2025-02-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) incur high computational overhead and latency in complex reasoning due to reliance on lengthy chain-of-thought (CoT) prompting. Method: This paper introduces Chain of Drafting (CoD), a novel reasoning paradigm inspired by human cognitive refinement of drafts. CoD guides LLMs to generate minimal yet information-complete intermediate reasoning steps—retaining only essential logical elements—via prompt engineering, without fine-tuning or additional parameters, and with broad compatibility across open- and closed-source LLMs. Contribution/Results: On diverse reasoning benchmarks, CoD matches or exceeds CoT accuracy while reducing inference token consumption to 7.6%–32% of CoT’s usage, substantially lowering computational cost and response latency. To our knowledge, this is the first work to formalize human draft-based reasoning cognition into a lightweight, deployable LLM inference paradigm.

Technology Category

Application Category

📝 Abstract
Large Language Models (LLMs) have demonstrated remarkable performance in solving complex reasoning tasks through mechanisms like Chain-of-Thought (CoT) prompting, which emphasizes verbose, step-by-step reasoning. However, humans typically employ a more efficient strategy: drafting concise intermediate thoughts that capture only essential information. In this work, we propose Chain of Draft (CoD), a novel paradigm inspired by human cognitive processes, where LLMs generate minimalistic yet informative intermediate reasoning outputs while solving tasks. By reducing verbosity and focusing on critical insights, CoD matches or surpasses CoT in accuracy while using as little as only 7.6% of the tokens, significantly reducing cost and latency across various reasoning tasks.
Problem

Research questions and friction points this paper is trying to address.

Enhance reasoning efficiency
Reduce token usage
Minimize computational cost
Innovation

Methods, ideas, or system contributions that make the work stand out.

Minimizes token usage
Enhances reasoning efficiency
Reduces computational costs
🔎 Similar Papers
No similar papers found.