HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images

📅 2026-02-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the widespread lack of building-level thermal load data in urban areas, which hinders accurate heating demand mapping and decarbonization planning. It proposes a novel approach that leverages zero-shot large vision-language models (VLMs) to extract semantic features—such as roof age and building density—from satellite imagery using natural language prompts. These features are integrated with GIS and building metadata to train an MLP regressor for predicting annual heating demand, entirely without labeled thermal data. Evaluated in data-scarce regions, the method achieves substantially higher prediction accuracy than baseline models, improving R² by 93.7% and reducing MAE by 30%. Moreover, the high-impact semantic features identified by the model align closely with spatial patterns of elevated heating demand.

Technology Category

Application Category

📝 Abstract
Accurate heat-demand maps play a crucial role in decarbonizing space heating, yet most municipalities lack detailed building-level data needed to calculate them. We introduce HeatPrompt, a zero-shot vision-language energy modeling framework that estimates annual heat demand using semantic features extracted from satellite images, basic Geographic Information System (GIS), and building-level features. We feed pretrained Large Vision Language Models (VLMs) with a domain-specific prompt to act as an energy planner and extract the visual attributes such as roof age, building density, etc, from the RGB satellite image that correspond to the thermal load. A Multi-Layer Perceptron (MLP) regressor trained on these captions shows an $R^2$ uplift of 93.7% and shrinks the mean absolute error (MAE) by 30% compared to the baseline model. Qualitative analysis shows that high-impact tokens align with high-demand zones, offering lightweight support for heat planning in data-scarce regions.
Problem

Research questions and friction points this paper is trying to address.

heat demand estimation
urban decarbonization
building-level data scarcity
satellite imagery
thermal load mapping
Innovation

Methods, ideas, or system contributions that make the work stand out.

zero-shot vision-language modeling
urban heat demand estimation
satellite image analysis
Large Vision Language Models (VLMs)
prompt engineering
🔎 Similar Papers
No similar papers found.
K
Kundan Thota
Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Germany
X
Xuanhao Mu
Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Germany
T
Thorsten Schlachter
Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Germany
Veit Hagenmeyer
Veit Hagenmeyer
KIT
energy informaticsnonlinear controlsmart grids