Artifacts of Idiosyncracy in Global Street View Data

📅 2025-05-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper identifies systematic coverage bias in global street-view imagery—arising from inherent urban structural characteristics—where even high-density acquisition yields significant, non-random representational distortions across cities. Method: We quantitatively characterize statistical associations between street-view coverage distribution and urban morphology, road network density, and functional zoning; develop a reproducible framework for coverage bias assessment; and integrate geospatial modeling, multi-city coverage statistics (28 cities), and semi-structured field interviews in Amsterdam to trace bias origins. Contribution/Results: Our analysis reveals that coverage disparities are structurally embedded—not merely sampling artifacts—and systematically correlate with urban form and land-use patterns. The proposed evaluation framework enables rigorous, cross-city bias diagnosis and mitigation. Findings provide a methodological foundation and actionable pathway for constructing fair, representative, and geographically equitable street-view datasets—critical for downstream applications in urban analytics, computer vision, and AI-driven policy.

Technology Category

Application Category

📝 Abstract
Street view data is increasingly being used in computer vision applications in recent years. Machine learning datasets are collected for these applications using simple sampling techniques. These datasets are assumed to be a systematic representation of cities, especially when densely sampled. Prior works however, show that there are clear gaps in coverage, with certain cities or regions being covered poorly or not at all. Here we demonstrate that a cities' idiosyncracies, such as city layout, may lead to biases in street view data for 28 cities across the globe, even when they are densely covered. We quantitatively uncover biases in the distribution of coverage of street view data and propose a method for evaluation of such distributions to get better insight in idiosyncracies in a cities' coverage. In addition, we perform a case study of Amsterdam with semi-structured interviews, showing how idiosyncracies of the collection process impact representation of cities and regions and allowing us to address biases at their source.
Problem

Research questions and friction points this paper is trying to address.

Biases in street view data due to city layout idiosyncracies
Gaps in coverage despite dense sampling in global cities
Impact of collection process on city representation accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantitatively uncover biases in street view coverage
Propose evaluation method for coverage distribution insights
Conduct case study with interviews to address biases
🔎 Similar Papers
Tim Alpherts
Tim Alpherts
University of Amsterdam
S
S. Ghebreab
University of Amsterdam, Amsterdam, The Netherlands
N
N. V. Noord
University of Amsterdam, Amsterdam, The Netherlands