UniRSCD: A Unified Novel Architectural Paradigm for Remote Sensing Change Detection

📅 2025-11-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing remote sensing change detection (RSCD) methods rely on expert-designed, task-specific decoders to compensate for encoding information loss, resulting in high model selection uncertainty and poor cross-task generalization. To address this, we propose UniRSCD—a unified framework built upon a state-space model backbone. We introduce the first frequency-aware change prompt generator as a unified encoder that dynamically fuses high- and low-frequency features to construct a task-adaptive shared representation space, eliminating the need for task-customized decoders. Furthermore, hierarchical feature interaction and task-adaptive output mapping enable end-to-end unified modeling across multi-granularity tasks—including binary change detection, semantic change detection, and building damage assessment. UniRSCD achieves state-of-the-art performance on five benchmark datasets (LEVIR-CD, SECOND, xBD, WHU-CD, and DSIFN-CD), significantly enhancing both generality and practicality of RSCD methods.

Technology Category

Application Category

📝 Abstract
In recent years, remote sensing change detection has garnered significant attention due to its critical role in resource monitoring and disaster assessment. Change detection tasks exist with different output granularities such as BCD, SCD, and BDA. However, existing methods require substantial expert knowledge to design specialized decoders that compensate for information loss during encoding across different tasks. This not only introduces uncertainty into the process of selecting optimal models for abrupt change scenarios (such as disaster outbreaks) but also limits the universality of these architectures. To address these challenges, this paper proposes a unified, general change detection framework named UniRSCD. Building upon a state space model backbone, we introduce a frequency change prompt generator as a unified encoder. The encoder dynamically scans bitemporal global context information while integrating high-frequency details with low-frequency holistic information, thereby eliminating the need for specialized decoders for feature compensation. Subsequently, the unified decoder and prediction head establish a shared representation space through hierarchical feature interaction and task-adaptive output mapping. This integrating various tasks such as binary change detection and semantic change detection into a unified architecture, thereby accommodating the differing output granularity requirements of distinct change detection tasks. Experimental results demonstrate that the proposed architecture can adapt to multiple change detection tasks and achieves leading performance on five datasets, including the binary change dataset LEVIR-CD, the semantic change dataset SECOND, and the building damage assessment dataset xBD.
Problem

Research questions and friction points this paper is trying to address.

Unified architecture eliminates specialized decoders for different change detection tasks
Dynamic encoder integrates high-frequency details with low-frequency global information
Framework accommodates varying output granularities across multiple remote sensing datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

State space model backbone for global context scanning
Frequency change prompt generator integrating high-low frequency details
Unified decoder with hierarchical interaction and task-adaptive mapping
🔎 Similar Papers
No similar papers found.
Y
Yuan Qu
College of Artificial Intelligence, Southwest University
Zhipeng Zhang
Zhipeng Zhang
School of Artificial Intelligence, Shanghai Jiao Tong University
Computer Vision,Object Tracking and Segmentation
C
Chaojun Xu
College of Artificial Intelligence, Southwest University
Q
Qiao Wan
LIESMARS, Wuhan University
M
Mengying Xie
CS, Chongqing University
Y
Yuzeng Chen
School of Geodesy and Geomatics, Wuhan University
Z
Zhenqi Liu
College of Artificial Intelligence, Southwest University
Yanfei Zhong
Yanfei Zhong
Full Professor, RSIDEA, LIESMARS, Wuhan University, China
hyperspectralhigh spatial resolutionremote sensingimage processingcomputational intelligence