PolarMem: A Training-Free Polarized Latent Graph Memory for Verifiable Multimodal Agents

📅 2026-01-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current vision-language models struggle to distinguish between semantic similarity and factual existence and lack the ability to explicitly represent negation constraints, leading to unverifiable multimodal reasoning. This work proposes a training-free polarized implicit graph memory mechanism that, for the first time, explicitly models negated facts as core cognitive states. By leveraging non-parametric distribution partitioning and a polarized graph structure with inhibitory connections, the method transforms ambiguous perceptual likelihoods into discrete logical constraints, enabling logic-driven, verifiable retrieval. Experiments across eight frozen vision-language models and six benchmarks demonstrate that the approach significantly suppresses hallucinations and enhances the verifiability and robustness of multimodal agent reasoning.

Technology Category

Application Category

📝 Abstract
As multimodal agents evolve from passive observers to long-horizon decision-makers, they require memory systems that provide not just information availability but logical verifiability. A fundamental limitation of current architectures is the epistemic asymmetry inherent in probabilistic vision-language models and dense associative memories: they conflate semantic affinity with factual existence and structurally fail to encode negative constraints. To this end, we introduce PolarMem, a training-free Polarized Latent Graph Memory designed to ground agent reasoning in verifiable evidence. PolarMem transforms fuzzy perceptual likelihoods into discrete logical constraints through non-parametric distributional partitioning. Furthermore, it employs a polarized graph topology with orthogonal inhibitory connections to explicitly store verified negation as a primary cognitive state. At inference time, we enforce a logic-dominant retrieval paradigm, suppressing hallucinatory patterns that violate negative constraints. Extensive evaluation across eight frozen Vision--Language Models and six benchmarks demonstrates that PolarMem functions as a robust cognitive system, establishing a foundation for verifiable multimodal agents. Our code is available at https://github.com/czs-ict/PolarMem.
Problem

Research questions and friction points this paper is trying to address.

verifiable multimodal agents
epistemic asymmetry
negative constraints
logical verifiability
memory systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Polarized Latent Graph
Verifiable Multimodal Agents
Logical Constraints
Verified Negation
Training-Free Memory
🔎 Similar Papers
No similar papers found.
Z
Zhisheng Chen
Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
T
Tingyu Wu
Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Z
Zijie Zhou
China University of Petroleum, Beijing
Z
Zhengwei Xie
University of Science and Technology of China
Z
Ziyan Weng
City University of Hong Kong
Yingwei Zhang
Yingwei Zhang
Institue of Computing Technology, Chinese Academic of Sciences
Brain Computer InterfaceArtificial IntelligenceUbiquitous Computing