A Novel Large-scale Crop Dataset and Dual-stream Transformer Method for Fine-grained Hierarchical Crop Classification from Integrated Hyperspectral EnMAP Data and Multispectral Sentinel-2 Time Series

πŸ“… 2025-06-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Fine-grained crop classification faces challenges in jointly modeling multi-temporal phenological dynamics and nanoscale spectral variations. Method: We propose a dual-stream Transformer framework coupling a spectral-spatial Transformer (for EnMAP hyperspectral data) with a temporal Swin Transformer (for Sentinel-2 time-series imagery), integrated via a hierarchical fusion classification head supporting joint prediction across a four-level crop taxonomy. Contribution/Results: We introduce H2Cropβ€”the first million-scale, fine-grained, multimodally aligned hyperspectral crop benchmark dataset. Experiments show our method improves mean F1-score by 4.2% (up to +6.3%) over a Sentinel-2-only baseline on the four-level classification task. Hyperspectral gains remain consistently significant across varying temporal windows and crop change scenarios, establishing a scalable, multi-source synergistic modeling paradigm for precision agriculture and food security monitoring.

Technology Category

Application Category

πŸ“ Abstract
Fine-grained crop classification is crucial for precision agriculture and food security monitoring. It requires simultaneous capture of both phenological dynamics (obtained from multi-temporal satellite data like Sentinel-2) and subtle spectral variations (demanding nanometer-scale spectral resolution from hyperspectral imagery). Research combining these two modalities remains scarce currently due to challenges in hyperspectral data acquisition and crop types annotation costs. To address these issues, we construct a hierarchical hyperspectral crop dataset (H2Crop) by integrating 30m-resolution EnMAP hyperspectral data with Sentinel-2 time series. With over one million annotated field parcels organized in a four-tier crop taxonomy, H2Crop establishes a vital benchmark for fine-grained agricultural crop classification and hyperspectral image processing. We propose a dual-stream Transformer architecture that synergistically processes these modalities. It coordinates two specialized pathways: a spectral-spatial Transformer extracts fine-grained signatures from hyperspectral EnMAP data, while a temporal Swin Transformer extracts crop growth patterns from Sentinel-2 time series. The designed hierarchy classification heads with hierarchical fusion then simultaneously delivers multi-level classification across all taxonomic tiers. Experiments demonstrate that adding hyperspectral EnMAP data to Sentinel-2 time series yields a 4.2% average F1-scores improvement (peaking at 6.3%). Extensive comparisons also confirming our method's higher accuracy over existing deep learning approaches for crop type classification and the consistent benefits of hyperspectral data across varying temporal windows and crop change scenarios. Codes and dataset will be available at https://github.com/flyakon/H2Crop and www.glass.hku.hk Keywords: Crop type classification, precision agriculture, remote sensing, deep learning, hyperspectral data, Sentinel-2 time series, fine-grained crops
Problem

Research questions and friction points this paper is trying to address.

Develops a dual-stream Transformer for fine-grained crop classification
Integrates hyperspectral and multispectral data to improve classification accuracy
Addresses challenges in hyperspectral data acquisition and crop annotation costs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-stream Transformer for multi-modal data fusion
Hierarchical hyperspectral crop dataset (H2Crop) integration
Hierarchical classification heads with multi-level fusion
πŸ”Ž Similar Papers
No similar papers found.
W
Wenyuan Li
Jockey Club STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Shunlin Liang
Shunlin Liang
Chair Professor, Department of Geography, The University of Hong Kong
Y
Yuxiang Zhang
Jockey Club STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Liqin Liu
Liqin Liu
Department of Aerospace Intelligent Science and Technology, School of Astronautics, Beihang University, Beijing, China
K
Keyan Chen
Department of Aerospace Intelligent Science and Technology, School of Astronautics, Beihang University, Beijing, China
Y
Yongzhe Chen
Jockey Club STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Han Ma
Han Ma
The Chinese university of hong kong
RoboticsPath Planning
J
Jianglei Xu
Jockey Club STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Yichuan Ma
Yichuan Ma
Fudan University
LLMSynthetic Data
S
Shikang Guan
Jockey Club STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Z
Zhenwei Shi
Department of Aerospace Intelligent Science and Technology, School of Astronautics, Beihang University, Beijing, China