STARS: Shared-specific Translation and Alignment for missing-modality Remote Sensing Semantic Segmentation

📅 2026-01-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the significant performance degradation in remote sensing multimodal semantic segmentation when modalities such as optical, SAR, or DSM are missing, a challenge exacerbated by feature collapse and over-generalized feature recovery in existing methods. To tackle this, the authors propose the STARS framework, which employs a shared-specific translation mechanism coupled with an asymmetric alignment strategy. Specifically, bidirectional modality translation with gradient stopping is introduced to prevent feature collapse, while a pixel-level semantic alignment (PSA) strategy—integrating class-balanced sampling and a cross-modal semantic alignment loss—is designed to mitigate alignment failure caused by class imbalance. Experiments demonstrate that the proposed method substantially improves segmentation accuracy under missing-modality conditions, particularly enhancing recognition of minority classes, while also reducing sensitivity to hyperparameters and exhibiting superior robustness and generalization capability.

Technology Category

Application Category

📝 Abstract
Multimodal remote sensing technology significantly enhances the understanding of surface semantics by integrating heterogeneous data such as optical images, Synthetic Aperture Radar (SAR), and Digital Surface Models (DSM). However, in practical applications, the missing of modality data (e.g., optical or DSM) is a common and severe challenge, which leads to performance decline in traditional multimodal fusion models. Existing methods for addressing missing modalities still face limitations, including feature collapse and overly generalized recovered features. To address these issues, we propose \textbf{STARS} (\textbf{S}hared-specific \textbf{T}ranslation and \textbf{A}lignment for missing-modality \textbf{R}emote \textbf{S}ensing), a robust semantic segmentation framework for incomplete multimodal inputs. STARS is built on two key designs. First, we introduce an asymmetric alignment mechanism with bidirectional translation and stop-gradient, which effectively prevents feature collapse and reduces sensitivity to hyperparameters. Second, we propose a Pixel-level Semantic sampling Alignment (PSA) strategy that combines class-balanced pixel sampling with cross-modality semantic alignment loss, to mitigate alignment failures caused by severe class imbalance and improve minority-class recognition.
Problem

Research questions and friction points this paper is trying to address.

missing-modality
remote sensing
semantic segmentation
multimodal fusion
feature collapse
Innovation

Methods, ideas, or system contributions that make the work stand out.

missing-modality
asymmetric alignment
bidirectional translation
pixel-level semantic alignment
multimodal remote sensing
🔎 Similar Papers
No similar papers found.
Tong Wang
Tong Wang
武汉大学
Remote SensingDeep Learning
Xiaodong Zhang
Xiaodong Zhang
The Ohio State University
Memory SystemsStorage SystemsComputer Systems
Guanzhou Chen
Guanzhou Chen
Shanghai Jiao Tong University; Shanghai AI Laboratory
Jiaqi Wang
Jiaqi Wang
Wuhan University
Remote SensingCV
C
Chenxi Liu
Electronic Information School, Wuhan University, 299 Bayi Road, Wuchang District, Wuhan, 430072, China
X
Xiaoliang Tan
State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 299 Bayi Road, Wuchang District, Wuhan, 430079, China
W
Wenchao Guo
State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 299 Bayi Road, Wuchang District, Wuhan, 430079, China
Xuyang Li
Xuyang Li
University of North Carolina at Charlotte
Scientific Machine LearningStructural Health MonitoringSystem IdentificationFEA
X
Xuanrui Wang
State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 299 Bayi Road, Wuchang District, Wuhan, 430079, China
Z
Zifan Wang
State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 299 Bayi Road, Wuchang District, Wuhan, 430079, China; Hubei FreerTech Co. Ltd, No.2 Wenhua Road, Guandong Industrial Park, East Lake High-tech Development Zone, Wuhan, 430073, Hubei, China