Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of non-Markovian observations in long-horizon robotic tasks, where occlusions or environmental changes hinder reliable decision-making. To this end, the authors propose Chameleon, a system inspired by human episodic memory that constructs a geometry-anchored, multimodal memory mechanism to preserve fine-grained perceptual cues. Chameleon incorporates a differentiable memory stack enabling goal-directed episodic recall, thereby circumventing the loss of critical contextual information inherent in conventional semantic compression approaches. Experimental evaluations on the Camo-Dataset and a real-world UR5e robotic platform demonstrate that Chameleon substantially outperforms strong baselines, significantly enhancing decision reliability and control performance in perceptually ambiguous scenarios.

Technology Category

Application Category

📝 Abstract
Robotic manipulation often requires memory: occlusion and state changes can make decision-time observations perceptually aliased, making action selection non-Markovian at the observation level because the same observation may arise from different interaction histories. Most embodied agents implement memory via semantically compressed traces and similarity-based retrieval, which discards disambiguating fine-grained perceptual cues and can return perceptually similar but decision-irrelevant episodes. Inspired by human episodic memory, we propose Chameleon, which writes geometry-grounded multimodal tokens to preserve disambiguating context and produces goal-directed recall through a differentiable memory stack. We also introduce Camo-Dataset, a real-robot UR5e dataset spanning episodic recall, spatial tracking, and sequential manipulation under perceptual aliasing. Across tasks, Chameleon consistently improves decision reliability and long-horizon control over strong baselines in perceptually confusable settings.
Problem

Research questions and friction points this paper is trying to address.

perceptual aliasing
episodic memory
robotic manipulation
non-Markovian decision making
long-horizon control
Innovation

Methods, ideas, or system contributions that make the work stand out.

episodic memory
perceptual aliasing
multimodal tokens
differentiable memory stack
long-horizon manipulation
🔎 Similar Papers
No similar papers found.
X
Xinying Guo
MARS Lab, Nanyang Technological University
C
Chenxi Jiang
MARS Lab, Nanyang Technological University
H
Hyun Bin Kim
MARS Lab, Nanyang Technological University
Ying Sun
Ying Sun
Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR), Singapore
Image AnalysisMachine Learning
Y
Yang Xiao
MARS Lab, Nanyang Technological University
Yuhang Han
Yuhang Han
Northwestern Polytechnical University
Event-based taskEffcient MLLM
Jianfei Yang
Jianfei Yang
Assistant Professor, Director of MARS Lab, Nanyang Technological University
Physical AIEmbodied AIMultimodal AI