Beyond Segmentation: Structurally Informed Facade Parsing from Imperfect Images

📅 2026-04-10

📈 Citations: 0

✨ Influential: 0

career value

168K/year

🤖 AI Summary

Existing methods for building facade parsing struggle to maintain structural consistency under occlusion and perspective distortion, often yielding geometrically irregular layouts. This work proposes a lightweight alignment loss integrated into the YOLOv8 training objective, which injects grid-alignment geometric priors without altering the inference pipeline. By guiding bounding boxes to adhere to regular spatial arrangements, the method effectively corrects alignment errors caused by occlusion and perspective effects on the CMP dataset. It achieves a controllable trade-off between detection accuracy and geometric regularity, significantly enhancing the structural plausibility of parsed facades while preserving high detection performance.

Technology Category

Application Category

📝 Abstract

Standard object detectors typically treat architectural elements independently, often resulting in facade parsings that lack the structural coherence required for downstream procedural reconstruction. We address this limitation by augmenting the YOLOv8 training objective with a custom lightweight alignment loss. This regularization encourages grid-consistent arrangements of bounding boxes during training, effectively injecting geometric priors without altering the standard inference pipeline. Experiments on the CMP dataset demonstrate that our method successfully improves structural regularity, correcting alignment errors caused by perspective and occlusion while maintaining a controllable trade-off with standard detection accuracy.

Problem

Research questions and friction points this paper is trying to address.

facade parsing

structural coherence

object detection

geometric priors

procedural reconstruction

Innovation

Methods, ideas, or system contributions that make the work stand out.

facade parsing

structural coherence

alignment loss