ChessMamba: Structure-Aware Interleaving of State Spaces for Change Detection in Remote Sensing Images

📅 2025-11-24

📈 Citations: 0

✨ Influential: 0

career value

191K/year

🤖 AI Summary

Multi-temporal remote sensing change detection suffers from fine-grained recognition difficulties caused by heterogeneity and spatiotemporal misalignment, while existing sequential modeling approaches often compromise local structural consistency. To address this, we propose a structure-aware interleaved state-space modeling framework. First, we introduce a novel checkerboard serpentine scanning strategy that preserves local structural integrity across multi-temporal features and enables single-pass forward alignment. Second, a multi-dilation convolution fusion module explicitly captures center-to-corner contextual relationships, enhancing robustness to misalignment. Third, we integrate a SpatialMamba encoder with a lightweight cross-source interaction module for efficient heterogeneous temporal feature fusion. Our method achieves state-of-the-art performance on binary change detection, semantic change detection, and multimodal building damage assessment—demonstrating significant improvements in change localization accuracy and cross-scenario generalization.

Technology Category

Application Category

📝 Abstract

Change detection (CD) in multitemporal remote sensing imagery presents significant challenges for fine-grained recognition, owing to heterogeneity and spatiotemporal misalignment. However, existing methodologies based on vision transformers or state-space models typically disrupt local structural consistency during temporal serialization, obscuring discriminative cues under misalignment and hindering reliable change localization. To address this, we introduce ChessMamba, a structure-aware framework leveraging interleaved state-space modeling for robust CD with multi-temporal inputs. ChessMamba integrates a SpatialMamba encoder with a lightweight cross-source interaction module, featuring two key innovations: (i) Chessboard interleaving with snake scanning order, which serializes multi-temporal features into a unified sequence within a single forward pass, thereby shortening interaction paths and enabling direct comparison for accurate change localization; and (ii) Structure-aware fusion via multi-dilated convolutions, selectively capturing center-and-corner neighborhood contexts within each mono-temporal. Comprehensive evaluations on three CD tasks, including binary CD, semantic CD and multimodal building damage assessment, demonstrate that ChessMamba effectively fuses heterogeneous features and achieves substantial accuracy improvements over state-of-the-art methods.The relevant code will be available at: github.com/DingLei14/ChessMamba.

Problem

Research questions and friction points this paper is trying to address.

Addresses change detection challenges in multitemporal remote sensing imagery

Solves structural consistency disruption during temporal serialization processes

Improves accurate change localization under spatiotemporal misalignment conditions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Interleaved state-space modeling for robust change detection

Chessboard interleaving with snake scanning order

Structure-aware fusion using multi-dilated convolutions

🔎 Similar Papers

Hierarchical Attention Diffusion Networks with Object Priors for Video Change Detection