On the Identifiability of Latent Action Policies

📅 2025-10-01

📈 Citations: 0

✨ Influential: 0

career value

233K/year

🤖 AI Summary

This work addresses the identifiability problem of learning action representations from video data in Latent Action Policy Optimization (LAPO). Existing methods lack theoretical guarantees for recovering meaningful action representations. We formally define sufficient conditions for identifiability of latent action representations and prove that, under mild assumptions, the entropy-regularized LAPO objective uniquely recovers action representations satisfying desirable properties—including discreteness, causal interpretability, and statistical robustness. Our analysis reveals that entropy regularization implicitly imposes structural constraints on the action policy distribution, thereby resolving representation ambiguity—a key mechanism underlying the empirical success of discrete action representations. By integrating information-theoretic principles with statistical learning theory, this work establishes the first identifiability guarantee for unsupervised action representation learning, filling a foundational theoretical gap in LAPO.

Technology Category

Application Category

📝 Abstract

We study the identifiability of latent action policy learning (LAPO), a framework introduced recently to discover representations of actions from video data. We formally describe desiderata for such representations, their statistical benefits and potential sources of unidentifiability. Finally, we prove that an entropy-regularized LAPO objective identifies action representations satisfying our desiderata, under suitable conditions. Our analysis provides an explanation for why discrete action representations perform well in practice.

Problem

Research questions and friction points this paper is trying to address.

Studies identifiability of latent action policy learning from videos

Analyzes statistical benefits and sources of unidentifiability in action representations

Proves entropy-regularized objective identifies valid action representations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Identifies latent action policies from video data

Proves entropy-regularized objective enables identifiability

Explains benefits of discrete action representations

🔎 Similar Papers

Identifiable latent bandits: Combining observational data and exploration for personalized healthcare