Understanding the Prompt Sensitivity

๐Ÿ“… 2026-04-20
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

197K/year
๐Ÿค– AI Summary
This work investigates the sensitivity of large language models to semantically equivalent yet lexically distinct prompts, which undermines output stability. Treating the model as a multivariate function, the authors derive a theoretical upper bound on the difference in log-probabilities of the next token between semantics-preserving prompts by combining first-order Taylor expansion, gradient analysis, and the Cauchyโ€“Schwarz inequality. Their analysis reveals that models do not internally cluster similar semantic inputs but instead represent them in a dispersed manner, preventing convergence in output behavior. Furthermore, prompt templates exert a stronger influence on logits than the underlying question content itself. The derived upper bound exhibits high correlation with PromptSensiScore, offering the first theoretical framework to explain prompt sensitivity in large language models.

Technology Category

Application Category

๐Ÿ“ Abstract
Prompt sensitivity, which refers to how strongly the output of a large language model (LLM) depends on the exact wording of its input prompt, raises concerns among users about the LLM's stability and reliability. In this work, we consider LLMs as multivariate functions and perform a first-order Taylor expansion, thereby analyzing the relationship between meaning-preserving prompts, their gradients, and the log probabilities of the model's next token. We derive an upper bound on the difference between log probabilities using the Cauchy-Schwarz inequality. We show that LLMs do not internally cluster similar inputs like smaller neural networks do, but instead disperse them. This dispersing behavior leads to an excessively high upper bound on the difference of log probabilities between two meaning-preserving prompts, making it difficult to effectively reduce to 0. In our analysis, we also show which types of meaning-preserving prompt variants are more likely to introduce prompt sensitivity risks in LLMs. In addition, we demonstrate that the upper bound is strongly correlated with an existing prompt sensitivity metric, PromptSensiScore. Moreover, by analyzing the logit variance, we find that prompt templates typically exert a greater influence on logits than the questions themselves. Overall, our results provide a general interpretation for why current LLMs can be highly sensitive to prompts with the same meaning, offering crucial evidence for understanding the prompt sensitivity of LLMs. Code for experiments is available at https://github.com/ku-nlp/Understanding_the_Prompt_Sensitivity.
Problem

Research questions and friction points this paper is trying to address.

prompt sensitivity
large language models
meaning-preserving prompts
output stability
log probability variance
Innovation

Methods, ideas, or system contributions that make the work stand out.

prompt sensitivity
Taylor expansion
log probability bound
meaning-preserving prompts
logit variance
๐Ÿ”Ž Similar Papers
No similar papers found.