Transferring Physical Priors into Remote Sensing Segmentation via Large Language Models

๐Ÿ“… 2026-03-28
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses two key challenges in semantic segmentation of remote sensing imagery: the difficulty of effectively fusing optical images with misaligned physical variables (e.g., DEM, SAR, NDVI), and the prohibitive cost of retraining foundation models when integrating new sensors. To overcome these issues, the authors propose PriorSeg, a novel framework that leverages a large language model to construct a Physics-Centric Knowledge Graph (PCKG), which enables the generation of a heterogeneous yet aligned dataset, Phy-Sky-SA. By incorporating physical priors through visionโ€“physics joint training and a residual refinement mechanism guided by physics-consistency loss, PriorSeg enhances segmentation accuracy and physical plausibility without requiring retraining of the underlying foundation model. Ablation studies confirm the effectiveness of each proposed component.
๐Ÿ“ Abstract
Semantic segmentation of remote sensing imagery is fundamental to Earth observation. Achieving accurate results requires integrating not only optical images but also physical variables such as the Digital Elevation Model (DEM), Synthetic Aperture Radar (SAR) and Normalized Difference Vegetation Index (NDVI). Recent foundation models (FMs) leverage pre-training to exploit these variables but still depend on spatially aligned data and costly retraining when involving new sensors. To overcome these limitations, we introduce a novel paradigm for integrating domain-specific physical priors into segmentation models. We first construct a Physical-Centric Knowledge Graph (PCKG) by prompting large language models to extract physical priors from 1,763 vocabularies, and use it to build a heterogeneous, spatial-aligned dataset, Phy-Sky-SA. Building on this foundation, we develop PriorSeg, a physics-aware residual refinement model trained with a joint visual-physical strategy that incorporates a novel physics-consistency loss. Experiments on heterogeneous settings demonstrate that PriorSeg improves segmentation accuracy and physical plausibility without retraining the FMs. Ablation studies verify the effectiveness of the Phy-Sky-SA dataset, the PCKG, and the physics-consistency loss.
Problem

Research questions and friction points this paper is trying to address.

remote sensing segmentation
physical priors
foundation models
semantic segmentation
physics consistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Physical Priors
Large Language Models
Knowledge Graph
Physics-Consistency Loss
Remote Sensing Segmentation
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yuxi Lu
University of Technology Sydney, New South Wales, Australia
K
Kunqi Li
University of Technology Sydney, New South Wales, Australia
Zhidong Li
Zhidong Li
UTS
Machine LearningData science
X
Xiaohan Su
University of Technology Sydney, New South Wales, Australia
Biao Wu
Biao Wu
Artificial intelligence , University of Technology Sydney
Multimodal Information Processing
C
Chenya Huang
University of Technology Sydney, New South Wales, Australia
Bin Liang
Bin Liang
Senior Lecturer at UTS
Computer VisionGraph Neural NetworkMulti-Modal LearningLLMReinforcement Learning