Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models

πŸ“… 2025-03-03
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Current language models lack intrinsic risk-awareness mechanisms, rendering them incapable of autonomously deciding when to answer, refuse, or abstain from speculative responses under uncertainty. Method: We formally define the β€œrisk-aware decision-making” task and propose a dynamic, skill-decomposition-based framework that employs chain-of-thought prompting to jointly model multi-level reasoning and refusal behavior, complemented by a risk-sensitive evaluation protocol. Contribution/Results: Experiments reveal that even state-of-the-art reasoning models reliably perform risk-informed decisions only when guided by explicit prompt chains; static refusal strategies exhibit fundamental limitations. Our work establishes the first principled benchmark for risk-adaptive decision-making, providing both theoretical foundations and scalable technical pathways toward building truly autonomous and safety-aware intelligent agents.

Technology Category

Application Category

πŸ“ Abstract
Knowing when to answer or refuse is crucial for safe and reliable decision-making language agents. Although prior work has introduced refusal strategies to boost LMs' reliability, how these models adapt their decisions to different risk levels remains underexplored. We formalize the task of risk-aware decision-making, expose critical weaknesses in existing LMs, and propose skill-decomposition solutions to mitigate them. Our findings show that even cutting-edge LMs--both regular and reasoning models--still require explicit prompt chaining to handle the task effectively, revealing the challenges that must be overcome to achieve truly autonomous decision-making agents.
Problem

Research questions and friction points this paper is trying to address.

Investigating risk-aware decision-making in language models
Exposing weaknesses in existing language models' refusal strategies
Proposing skill-decomposition solutions for better decision-making
Innovation

Methods, ideas, or system contributions that make the work stand out.

Formalize risk-aware decision-making task
Propose skill-decomposition solutions
Use explicit prompt chaining for effectiveness
πŸ”Ž Similar Papers
No similar papers found.