OTora: A Unified Red Teaming Framework for Reasoning-Level Denial-of-Service in LLM Agents

📅 2026-05-09

📈 Citations: 0

✨ Influential: 0

career value

187K/year

🤖 AI Summary

This work addresses reasoning-layer denial-of-service (R-DoS) attacks against large language model agents in multi-step tool-augmented tasks, where adversaries significantly inflate inference costs without compromising task correctness. To this end, the paper introduces OTora, the first unified red-teaming framework for R-DoS. OTora integrates adversarial trigger optimization with in-context learning (ICL)-guided genetic search, leveraging perception-aware scoring, dynamic target co-evolution, and agent-aware payload generation to craft stealthy attacks efficiently in both black-box and white-box settings. Experiments demonstrate that OTora induces up to a 10× increase in inference tokens and an order-of-magnitude latency overhead on agent benchmarks such as WebShop, Email, and OS, while preserving task accuracy nearly on par with baseline performance, using foundation models including LLaMA-70B and GPT-OSS-120B.

📝 Abstract

Large Language Models (LLMs) are increasingly deployed as autonomous agents that execute tool-augmented, multi-step tasks, where latency is a critical factor for real-world applications. Yet an overlooked threat is Reasoning-Level Denial-of-Service (R-DoS), in which an attacker preserves task correctness but degrades availability by inflating an agent's reasoning depth or tool-use budget. We introduce OTora, the first unified, two-stage red-teaming framework for instantiating R-DoS attacks. Stage I optimizes an adversarial trigger that induces targeted tool invocations using insertion-aware scoring and dynamic target co-evolution, supporting both black-box and white-box settings. Stage II generates agent-aware reasoning payloads via an ICL-guided genetic search that amplifies overthinking while maintaining correct task outcomes. Across WebShop, Email, and OS agents built on multiple backbone models such as LLaMA-70B and GPT-OSS-120B, OTora achieves up to 10 times increases in reasoning tokens and order-of-magnitude latency slowdowns, all while preserving near-baseline task accuracy. Finally, we discuss mitigation strategies for detecting and constraining abnormal reasoning and latency spikes. The code is available at https://github.com/llm2409/OTora.

Problem

Research questions and friction points this paper is trying to address.

Reasoning-Level Denial-of-Service

LLM Agents

Red Teaming

Availability Attack

Tool-Augmented Reasoning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Reasoning-Level Denial-of-Service

Red Teaming

Adversarial Triggers