HELM: Hierarchical and Explicit Label Modeling with Graph Learning for Multi-Label Image Classification

📅 2026-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of modeling multi-path hierarchical structures and underutilizing unlabeled data in multi-label remote sensing image classification. To this end, the authors propose an end-to-end scalable framework that integrates hierarchy-specific class tokens into a Vision Transformer and explicitly captures multi-path dependencies among labels via a graph convolutional network. Additionally, a self-supervised learning branch is embedded to effectively leverage unlabeled data. This approach represents the first attempt to jointly incorporate hierarchical label structure and semi-supervised learning in remote sensing multi-label classification. It achieves state-of-the-art performance across four benchmark datasets—UCM, AID, DFC-15, and MLRSNet—and demonstrates particularly significant improvements over existing methods in label-scarce scenarios.

Technology Category

Application Category

📝 Abstract
Hierarchical multi-label classification (HMLC) is essential for modeling complex label dependencies in remote sensing. Existing methods, however, struggle with multi-path hierarchies where instances belong to multiple branches, and they rarely exploit unlabeled data. We introduce HELM (\textit{Hierarchical and Explicit Label Modeling}), a novel framework that overcomes these limitations. HELM: (i) uses hierarchy-specific class tokens within a Vision Transformer to capture nuanced label interactions; (ii) employs graph convolutional networks to explicitly encode the hierarchical structure and generate hierarchy-aware embeddings; and (iii) integrates a self-supervised branch to effectively leverage unlabeled imagery. We perform a comprehensive evaluation on four remote sensing image (RSI) datasets (UCM, AID, DFC-15, MLRSNet). HELM achieves state-of-the-art performance, consistently outperforming strong baselines in both supervised and semi-supervised settings, demonstrating particular strength in low-label scenarios.
Problem

Research questions and friction points this paper is trying to address.

Hierarchical Multi-Label Classification
Multi-Path Hierarchies
Unlabeled Data
Remote Sensing Image
Label Dependencies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Multi-Label Classification
Vision Transformer
Graph Convolutional Network
Self-Supervised Learning
Remote Sensing Image
🔎 Similar Papers
No similar papers found.
M
Marjan Stoimchev
Jožef Stefan Institute, Ljubljana, Slovenia; Jožef Stefan International Postgraduate School, Ljubljana, Slovenia
Boshko Koloski
Boshko Koloski
Researcher, Jozef Stefan Institute
Machine LearningNatural Language ProcessingKnowledge Graphs
J
Jurica Levatić
Jožef Stefan Institute, Ljubljana, Slovenia
Dragi Kocev
Dragi Kocev
Jožef Stefan Institute
Machine learningEnsemble methodsStructured output predictionComputer vision
S
Sašo Džeroski
Jožef Stefan Institute, Ljubljana, Slovenia