Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG

πŸ“… 2026-02-24
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge that existing stateless defense mechanisms struggle to counter adversarial semantic attacks dispersed across retrieval, planning, and generation components in multimodal agent-based RAG systems. The authors formulate the security problem as a partially observable Markov decision process (POMDP), treating adversarial intent as a latent variable for the first time and enabling stateful, layered defense through belief-state maintenance. They propose MMA-RAG^T, a model-agnostic runtime control framework that integrates modular trusted agents with structured large language model reasoning and configurable internal checkpoints. Evaluated on 43,774 test instances, the approach reduces average attack success rates by 6.5Γ— with negligible impact on system utility. Ablation studies confirm that both statefulness and spatial coverage are critical to defense effectiveness.

Technology Category

Application Category

πŸ“ Abstract
Current stateless defences for multimodal agentic RAG fail to detect adversarial strategies that distribute malicious semantics across retrieval, planning, and generation components. We formulate this security challenge as a Partially Observable Markov Decision Process (POMDP), where adversarial intent is a latent variable inferred from noisy multi-stage observations. We introduce MMA-RAG^T, an inference-time control framework governed by a Modular Trust Agent (MTA) that maintains an approximate belief state via structured LLM reasoning. Operating as a model-agnostic overlay, MMA-RAGT mediates a configurable set of internal checkpoints to enforce stateful defence-in-depth. Extensive evaluation on 43,774 instances demonstrates a 6.50x average reduction factor in Attack Success Rate relative to undefended baselines, with negligible utility cost. Crucially, a factorial ablation validates our theoretical bounds: while statefulness and spatial coverage are individually necessary (26.4 pp and 13.6 pp gains respectively), stateless multi-point intervention can yield zero marginal benefit under homogeneous stateless filtering when checkpoint detections are perfectly correlated.
Problem

Research questions and friction points this paper is trying to address.

Adversarial Intent
Multimodal Agentic RAG
Stateful Trust Inference
Security
Latent Variable
Innovation

Methods, ideas, or system contributions that make the work stand out.

stateful trust inference
adversarial intent as latent variable
modular trust agent
defense-in-depth for RAG
partially observable MDP
Inderjeet Singh
Inderjeet Singh
Fujitsu
Generative AIRobust AIPrivate AIDeep LearningCyber Security
V
Vikas Pahuja
Fujitsu Research of Europe, UK
A
Aishvariya Priya Rathina Sabapathy
Fujitsu Research of Europe, UK
C
Chiara Picardi
Fujitsu Research of Europe, UK
Amit Giloni
Amit Giloni
Ben-Gurion University of the Negev
Roman Vainshtein
Roman Vainshtein
Ph.D.
GenAI Security and TrustMachine LearningAutoMLData ScienceAI Robustness and Security
A
AndrΓ©s Murillo
Fujitsu Research of Europe, UK
H
Hisashi Kojima
Fujitsu Limited, Japan
M
Motoyoshi Sekiya
Fujitsu Limited, Japan
Y
Yuki Unno
Fujitsu Limited, Japan
J
Junichi Suga
Fujitsu Limited, Japan