A Large-Scale Remote Sensing Dataset and VLM-based Algorithm for Fine-Grained Road Hierarchy Classification

📅 2026-03-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of fine-grained hierarchical classification of roads in remote sensing imagery by proposing RoadReasoner, a novel framework that uniquely integrates vision-language models with geometric descriptors to jointly perform road segmentation, topological reconstruction, and three-level hierarchical classification. Key innovations include frequency-sensitive feature enhancement, multi-scale contextual modeling, skeleton-segment-level geometric descriptors, and geometry-aware textual prompts, enabling semantically interpretable hierarchical mapping. The authors also introduce SYSU-HiRoads, the first large-scale hierarchical road dataset, featuring dense masks, vectorized centerlines, and three-tier semantic labels. Experimental results demonstrate that the proposed method achieves 72.6% overall accuracy, 64.2% F1 score, and 60.6% SegAcc on SYSU-HiRoads and CHN6-CUG, respectively, significantly outperforming existing approaches.

Technology Category

Application Category

📝 Abstract
In this work, we present SYSU-HiRoads, a large-scale hierarchical road dataset, and RoadReasoner, a vision-language-geometry framework for automatic multi-grade road mapping from remote sensing imagery. SYSU-HiRoads is built from GF-2 imagery covering 3631 km2 in Henan Province, China, and contains 1079 image tiles at 0.8 m spatial resolution. Each tile is annotated with dense road masks, vectorized centerlines, and three-level hierarchy labels, enabling the joint training and evaluation of segmentation, topology reconstruction, and hierarchy classification. Building on this dataset, RoadReasoner is designed to generate robust road surface masks, topology-preserving road networks, and semantically coherent hierarchy assignments. We strengthen road feature representation and network connectivity by explicitly enhancing frequency-sensitive cues and multi-scale context. Moreover, we perform hierarchy inference at the skeleton-segment level with geometric descriptors and geometry-aware textual prompts, queried by vision-language models to obtain linguistically interpretable grade decisions. Experiments on SYSU-HiRoads and the CHN6-CUG dataset show that RoadReasoner surpasses state-of-the-art road extraction baselines and produces accurate and semantically consistent road hierarchy maps with 72.6% OA, 64.2% F1 score, and 60.6% SegAcc. The dataset and code will be publicly released to support automated transport infrastructure mapping, road inventory updating, and broader infrastructure management applications.
Problem

Research questions and friction points this paper is trying to address.

road hierarchy classification
remote sensing
fine-grained
automated mapping
infrastructure management
Innovation

Methods, ideas, or system contributions that make the work stand out.

vision-language model
road hierarchy classification
remote sensing
topology-preserving road extraction
geometric-aware prompting
🔎 Similar Papers
No similar papers found.
Ting Han
Ting Han
Sun Yat-sen University
point cloudremote sensing
X
Xiangyi Xie
School of Geospatial Engineering and Science, Sun Yat-Sen University, Zhuhai, 519082, China; Transport Bureau of Nanhai District, Foshan, 528200, China
Yiping Chen
Yiping Chen
Sun Yat-sen University
Point CloudsMobile MappingGeomaticsLiDAR3D Vision
Y
Yumeng Du
School of Geospatial Engineering and Science, Sun Yat-Sen University, Zhuhai, 519082, China
J
Jin Ma
School of Geospatial Engineering and Science, Sun Yat-Sen University, Zhuhai, 519082, China
A
Aiguang Li
Laboratory of Intelligent Collaborative Computing, University of Electronic Science and Technology of China, Chengdu, 611731, China
J
Jiaan Liu
United Nations University Institute in Macau, Macau S.A.R., China
Y
Yin Gao