GridNet-HD: A High-Resolution Multi-Modal Dataset for LiDAR-Image Fusion on Power Line Infrastructure

πŸ“… 2026-01-19
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the absence of publicly available multimodal datasets that simultaneously incorporate high-density LiDAR, high-resolution oblique imagery, and fine-grained 3D semantic annotations for power line infrastructure. To bridge this gap, we introduce GridNet-HD, a novel benchmark dataset comprising 250 million LiDAR points and 7,694 oblique images, annotated with 11 semantic classes in 3Dβ€”the first of its kind to be released. Leveraging this dataset, we develop both single-modality and multimodal fusion baselines for 3D semantic segmentation, demonstrating the complementary nature of geometric and appearance cues. Experimental results show that the multimodal model achieves a 5.55% improvement in mean Intersection over Union (mIoU) over the best single-modality baseline, significantly enhancing 3D semantic understanding of power line assets.

Technology Category

Application Category

πŸ“ Abstract
This paper presents GridNet-HD, a multi-modal dataset for 3D semantic segmentation of overhead electrical infrastructures, pairing high-density LiDAR with high-resolution oblique imagery. The dataset comprises 7,694 images and 2.5 billion points annotated into 11 classes, with predefined splits and mIoU metrics. Unimodal (LiDAR-only, image-only) and multi-modal fusion baselines are provided. On GridNet-HD, fusion models outperform the best unimodal baseline by +5.55 mIoU, highlighting the complementarity of geometry and appearance. As reviewed in Sec. 2, no public dataset jointly provides high-density LiDAR and high-resolution oblique imagery with 3D semantic labels for power-line assets. Dataset, baselines, and codes are available: https://huggingface.co/collections/heig-vd-geo/gridnet-hd.
Problem

Research questions and friction points this paper is trying to address.

LiDAR-image fusion
power line infrastructure
3D semantic segmentation
multi-modal dataset
high-resolution imagery
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-modal fusion
high-density LiDAR
high-resolution oblique imagery
3D semantic segmentation
power line infrastructure
πŸ”Ž Similar Papers
No similar papers found.
A
Antoine Carreaud
ESO lab. EPFL, 1015 Lausanne, Switzerland
S
Shanci Li
University of Applied Sciences Western Switzerland (HES-SO / HEIG-VD), Yverdon-les-Bains, Switzerland
M
Malo De Lacour
University of Applied Sciences Western Switzerland (HES-SO / HEIG-VD), Yverdon-les-Bains, Switzerland
D
Digre Frinde
University of Applied Sciences Western Switzerland (HES-SO / HEIG-VD), Yverdon-les-Bains, Switzerland
Jan Skaloud
Jan Skaloud
Prof. titulaire EPFL
Adrien Gressin
Adrien Gressin
HEIG-VD
PhotogrammetryRemote sensingchange detection3d point cloud registration