The Effect of Scripts and Formats on LLM Numeracy

📅 2026-01-21

📈 Citations: 0

✨ Influential: 0

career value

158K/year

🤖 AI Summary

This study addresses the significant degradation in numerical reasoning performance of large language models when confronted with numeral systems or formats rarely seen in their training data. We systematically evaluate mainstream large language models across diverse numeral representations and, for the first time, reveal the critical impact of numeric format on model capabilities. To mitigate this limitation, we propose a targeted strategy that combines few-shot prompting with explicit numeral mapping, effectively enhancing the models’ cross-format generalization. Experimental results demonstrate that our approach substantially narrows the performance gap observed under non-standard numeral formats, offering a novel pathway toward improving numerical robustness in large language models.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) have achieved impressive proficiency in basic arithmetic, rivaling human-level performance on standard numerical tasks. However, little attention has been given to how these models perform when numerical expressions deviate from the prevailing conventions present in their training corpora. In this work, we investigate numerical reasoning across a wide range of numeral scripts and formats. We show that LLM accuracy drops substantially when numerical inputs are rendered in underrepresented scripts or formats, despite the underlying mathematical reasoning being identical. We further demonstrate that targeted prompting strategies, such as few-shot prompting and explicit numeral mapping, can greatly narrow this gap. Our findings highlight an overlooked challenge in multilingual numerical reasoning and provide actionable insights for working with LLMs to reliably interpret, manipulate, and generate numbers across diverse numeral scripts and formatting styles.

Problem

Research questions and friction points this paper is trying to address.

numeral scripts

numerical formats

LLM numeracy

multilingual numerical reasoning

numerical reasoning

Innovation

Methods, ideas, or system contributions that make the work stand out.

numeral scripts

numerical reasoning

large language models