Hierarchical Successor Representation for Robust Transfer

📅 2026-02-13

📈 Citations: 0

✨ Influential: 0

📄 PDF

career value

217K/year

Technology Category

Application Category

📝 Abstract

The successor representation (SR) provides a powerful framework for decoupling predictive dynamics from rewards, enabling rapid generalisation across reward configurations. However, the classical SR is limited by its inherent policy dependence: policies change due to ongoing learning, environmental non-stationarities, and changes in task demands, making established predictive representations obsolete. Furthermore, in topologically complex environments, SRs suffer from spectral diffusion, leading to dense and overlapping features that scale poorly. Here we propose the Hierarchical Successor Representation (HSR) for overcoming these limitations. By incorporating temporal abstractions into the construction of predictive representations, HSR learns stable state features which are robust to task-induced policy changes. Applying non-negative matrix factorisation (NMF) to the HSR yields a sparse, low-rank state representation that facilitates highly sample-efficient transfer to novel tasks in multi-compartmental environments. Further analysis reveals that HSR-NMF discovers interpretable topological structures, providing a policy-agnostic hierarchical map that effectively bridges model-free optimality and model-based flexibility. Beyond providing a useful basis for task-transfer, we show that HSR's temporally extended predictive structure can also be leveraged to drive efficient exploration, effectively scaling to large, procedurally generated environments.

Problem

Research questions and friction points this paper is trying to address.

successor representation

policy dependence

spectral diffusion

transfer learning

temporal abstraction

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Successor Representation

Temporal Abstraction

Non-negative Matrix Factorisation