Unsupervised Hierarchical Skill Discovery

๐Ÿ“… 2026-01-30
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge of automatically discovering reusable skills and their hierarchical structure from raw trajectories without any labels, rewards, or action supervision. To this end, we propose an unsupervised grammar inductionโ€“based approach that, for the first time, enables hierarchical skill segmentation and modeling in the complete absence of supervisory signals. Our method effectively identifies low-level behavioral primitives and their compositional higher-level skills in high-dimensional pixel-based environments such as Craftax and vanilla Minecraft, yielding substantially improved semantic clarity and structural coherence. Experimental results demonstrate that the discovered skills outperform existing baselines in segmentation accuracy, reusability, and hierarchical quality, and further facilitate faster and more stable training in downstream reinforcement learning tasks.

Technology Category

Application Category

๐Ÿ“ Abstract
We consider the problem of unsupervised skill segmentation and hierarchical structure discovery in reinforcement learning. While recent approaches have sought to segment trajectories into reusable skills or options, most rely on action labels, rewards, or handcrafted annotations, limiting their applicability. We propose a method that segments unlabelled trajectories into skills and induces a hierarchical structure over them using a grammar-based approach. The resulting hierarchy captures both low-level behaviours and their composition into higher-level skills. We evaluate our approach in high-dimensional, pixel-based environments, including Craftax and the full, unmodified version of Minecraft. Using metrics for skill segmentation, reuse, and hierarchy quality, we find that our method consistently produces more structured and semantically meaningful hierarchies than existing baselines. Furthermore, as a proof of concept for utility, we demonstrate that these discovered hierarchies accelerate and stabilise learning on downstream reinforcement learning tasks.
Problem

Research questions and friction points this paper is trying to address.

unsupervised skill discovery
hierarchical structure
skill segmentation
reinforcement learning
trajectory analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

unsupervised skill discovery
hierarchical reinforcement learning
grammar-based segmentation
skill composition
trajectory segmentation
๐Ÿ”Ž Similar Papers
No similar papers found.