🤖 AI Summary
This work systematically investigates the multi-step chain-of-thought (CoT) mechanism in DeepSeek-R1, focusing on its reasoning behavior, controllability, contextual robustness, cultural adaptability, and safety vulnerabilities. We propose “Thoughtology”—a novel paradigm integrating interpretability analysis, statistical modeling of reasoning traces, cognitive comparative experiments, and red-teaming. Key findings include: (1) a non-monotonic relationship between reasoning length and performance, with a distinct “sweet spot”; (2) an intrinsic tendency toward thought rumination—repetitive restatement of the problem impedes deep reasoning; and (3) reasoning augmentation unexpectedly introduces novel safety vulnerabilities, with alignment strength significantly weaker than that of non-reasoning baseline models. Our study establishes the first taxonomy of reasoning behaviors in large language models, providing both theoretical foundations and empirical benchmarks for trustworthy reasoning in foundation models.
📝 Abstract
Large Reasoning Models like DeepSeek-R1 mark a fundamental shift in how LLMs approach complex problems. Instead of directly producing an answer for a given input, DeepSeek-R1 creates detailed multi-step reasoning chains, seemingly"thinking"about a problem before providing an answer. This reasoning process is publicly available to the user, creating endless opportunities for studying the reasoning behaviour of the model and opening up the field of Thoughtology. Starting from a taxonomy of DeepSeek-R1's basic building blocks of reasoning, our analyses on DeepSeek-R1 investigate the impact and controllability of thought length, management of long or confusing contexts, cultural and safety concerns, and the status of DeepSeek-R1 vis-`a-vis cognitive phenomena, such as human-like language processing and world modelling. Our findings paint a nuanced picture. Notably, we show DeepSeek-R1 has a 'sweet spot' of reasoning, where extra inference time can impair model performance. Furthermore, we find a tendency for DeepSeek-R1 to persistently ruminate on previously explored problem formulations, obstructing further exploration. We also note strong safety vulnerabilities of DeepSeek-R1 compared to its non-reasoning counterpart, which can also compromise safety-aligned LLMs.