Hierarchical Successor Representation for Robust Transfer

๐Ÿ“… 2026-02-13
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

Technology Category

Application Category

๐Ÿ“ Abstract
The successor representation (SR) provides a powerful framework for decoupling predictive dynamics from rewards, enabling rapid generalisation across reward configurations. However, the classical SR is limited by its inherent policy dependence: policies change due to ongoing learning, environmental non-stationarities, and changes in task demands, making established predictive representations obsolete. Furthermore, in topologically complex environments, SRs suffer from spectral diffusion, leading to dense and overlapping features that scale poorly. Here we propose the Hierarchical Successor Representation (HSR) for overcoming these limitations. By incorporating temporal abstractions into the construction of predictive representations, HSR learns stable state features which are robust to task-induced policy changes. Applying non-negative matrix factorisation (NMF) to the HSR yields a sparse, low-rank state representation that facilitates highly sample-efficient transfer to novel tasks in multi-compartmental environments. Further analysis reveals that HSR-NMF discovers interpretable topological structures, providing a policy-agnostic hierarchical map that effectively bridges model-free optimality and model-based flexibility. Beyond providing a useful basis for task-transfer, we show that HSR's temporally extended predictive structure can also be leveraged to drive efficient exploration, effectively scaling to large, procedurally generated environments.
Problem

Research questions and friction points this paper is trying to address.

successor representation
policy dependence
spectral diffusion
transfer learning
temporal abstraction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Successor Representation
Temporal Abstraction
Non-negative Matrix Factorisation
Policy-agnostic Representation
Sample-efficient Transfer
๐Ÿ”Ž Similar Papers
No similar papers found.
C
Changmin Yu
Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, United Kingdom
Mรกtรฉ Lengyel
Mรกtรฉ Lengyel
Professor of Computational Neuroscience, University of Cambridge; Senior Research Fellow, CEU
computational neurosciencelearningmemorycomputational cognitive science