Slum Detection and Density Mapping with AlphaEarth Foundations: A Representation Learning Evaluation Across 12 Global Cities

📅 2026-05-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

190K/year
🤖 AI Summary
This study addresses the challenges of weak cross-city generalization, absence of continuous density estimation, and poor global comparability in slum mapping by presenting the first validation of AlphaEarth Foundations—a globally consistent 64-dimensional land surface embedding—for the indirectly related task of slum identification. Leveraging data from 12 cities, the authors systematically evaluate its performance in both classification and sub-pixel density estimation using GRAM pseudo-labels, POI-derived auxiliary features, spatially blocked cross-validation, and multiple model baselines. Results demonstrate optimal within-city cross-year transfer (F1=0.616, R²=0.466), a significant improvement in density modeling from POI integration (ΔR²=+0.064), and strong spatial structural consistency in six-city full-area inference (SSIM=0.926), while also revealing city-scale representation drift.
📝 Abstract
Pixel-level slum mapping has long been constrained by limited cross-city generalisation, the absence of continuous density estimation, and weak global comparability. AlphaEarth Foundations (AEF), a globally consistent 64-dimensional annual surface embedding at 10 m, offers a new analysis-ready basis for lightweight slum monitoring, but its applicability to slum detection - an indirectly coupled task shaped by both built form and socio-economic processes - remains untested. We evaluate AEF on slum classification and sub-pixel density estimation across 12 cities and 69 city-year pairs (2017-2024), using GRAM pseudo-masks as supervisory labels. The evaluation spans four training strategies, two protocols (random split and 3x3 spatial block cross-validation), six auxiliary feature configurations, and five baseline models, complemented by representation-level analyses (PCA, SHAP) and full-AOI mapping. Five findings emerge. (1) Same-city cross-year training is optimal under both protocols (median spatial F1 = 0.616, R^2 = 0.466); temporal expansion outperforms cross-city transfer, indicating city-scale representational drift. (2) Regression R^2 is driven primarily by zero/non-zero boundary discrimination: positive-pixel R^2 is consistently negative across all cities, revealing limited capacity to model intra-pixel density gradients at 10 m. (3) PC36 is consistently top-ranked across tasks; classification saturates at k = 32 while regression remains unsaturated at k = 64. (4) POI features yield the largest density gain (Delta R^2 = +0.064). (5) For six cities meeting dual-task usability thresholds, full-AOI inference across 2017-2024 preserves slum cluster structure (mean SSIM = 0.926). The study delineates the capabilities and complementarity needs of foundation-model embeddings for slum monitoring.
Problem

Research questions and friction points this paper is trying to address.

slum detection
density mapping
cross-city generalisation
global comparability
representation learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

foundation model embeddings
slum density mapping
cross-city generalization
sub-pixel regression
representation learning
S
Shuyang Hou
State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, China
Ziqi Liu
Ziqi Liu
Wuhan University
LLM、GeoAI
Haoyue Jiao
Haoyue Jiao
Wuhan University
GeoAILarge Language ModelCode Generation
Z
Zhangyan Xu
State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, China
X
Xiaopu Zhang
State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, China
L
Lutong Xie
State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, China
Y
Yaxian Qing
State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, China
Jianyuan Liang
Jianyuan Liang
Wuhan University
GIS SystemGIServiceSpatial Data MiningGraph RAG
Xuefeng Guan
Xuefeng Guan
Professor, Wuhan University
High-performance GeoComputationBig-data AnalyticsSpatial Data Mining
Huayi Wu
Huayi Wu
Wuhan University
GISremote sensingcartographyGeomatics