Composition of Memory Experts for Diffusion World Models

📅 2026-05-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

212K/year
🤖 AI Summary
This work addresses a fundamental trade-off in existing world models between preserving fine-grained local details and compressing long-term historical context. The authors propose a novel diffusion-based framework that integrates heterogeneous memory experts—specialized for short-term dynamics, long-term episodic memory, and spatial consistency—through a contrastive product-of-experts mechanism. This approach enables efficient long-context modeling without quadratic computational overhead and avoids mode collapse. Augmented with lightweight test-time adaptation and an external memory store, the method significantly enhances temporal coherence, historical recall, and navigation performance across multiple simulated and real-world benchmarks.
📝 Abstract
World models aim to predict plausible futures consistent with past observations, a capability central to planning and decision-making in reinforcement learning. Yet, existing architectures face a fundamental memory trade-off: transformers preserve local detail but are bottlenecked by quadratic attention, while recurrent and state-space models scale more efficiently but compress history at the cost of fidelity. To overcome this trade-off, we suggest decoupling future-past consistency from any single architecture and instead leveraging a set of specialized experts. We introduce a diffusion-based framework that integrates heterogeneous memory models through a contrastive product-of-experts formulation. Our approach instantiates three complementary roles: a short-term memory expert that captures fine local dynamics, a long-term memory expert that stores episodic history in external diffusion weights via lightweight test-time finetuning, and a spatial long-term memory expert that enforces geometric and spatial coherence. This compositional design avoids mode collapse and scales to long contexts without incurring a quadratic cost. Across simulated and real-world benchmarks, our method improves temporal consistency, recall of past observations, and navigation performance, establishing a novel paradigm for building and operating memory-augmented diffusion world models.
Problem

Research questions and friction points this paper is trying to address.

world models
memory trade-off
diffusion models
long-term memory
temporal consistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

diffusion world models
memory experts
product-of-experts
long-term memory
spatial coherence
🔎 Similar Papers
2022-09-02ACM Computing SurveysCitations: 1628
2024-05-10arXiv.orgCitations: 2