🤖 AI Summary
To address the fundamental trade-off between spectral resolution and spatiotemporal coverage in satellite-based greenhouse gas (GHG) monitoring, this paper proposes an end-to-end multispectral-to-hyperspectral spectral synthesis method based on a vision Transformer (ViT). Our approach innovatively introduces band-masked autoencoding (BMAE) pretraining into satellite spectral reconstruction, enabling fully self-supervised learning without labeled data. We further integrate spatiotemporally aligned multispectral–hyperspectral paired fine-tuning with a spectral-aware ViT architecture to enhance fine-grained spectral fidelity. Experimental results demonstrate that the synthesized hyperspectral data reduce the mean retrieval error of CH₄ and CO₂ column concentrations by 37%, significantly improving the accuracy and resolution of global, high-temporal-frequency GHG monitoring. This work advances scalable, label-efficient spectral super-resolution for operational atmospheric remote sensing.
📝 Abstract
Hyperspectral imaging provides detailed spectral information and holds significant potential for monitoring of greenhouse gases (GHGs). However, its application is constrained by limited spatial coverage and infrequent revisit times. In contrast, multispectral imaging offers broader spatial and temporal coverage but often lacks the spectral detail that can enhance GHG detection. To address these challenges, this study proposes a spectral transformer model that synthesizes hyperspectral data from multispectral inputs. The model is pre-trained via a band-wise masked autoencoder and subsequently fine-tuned on spatio-temporally aligned multispectral-hyperspectral image pairs. The resulting synthetic hyperspectral data retain the spatial and temporal benefits of multispectral imagery and improve GHG prediction accuracy relative to using multispectral data alone. This approach effectively bridges the trade-off between spectral resolution and coverage, highlighting its potential to advance atmospheric monitoring by combining the strengths of hyperspectral and multispectral systems with self-supervised deep learning.