Spatial Data Science Languages: commonalities and needs

📅 2025-03-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper identifies and systematically analyzes common challenges in spatial data science across mainstream programming languages—R, Python, and Julia—including inconsistent spherical geometry modeling, ambiguous spatial/temporal semantics, conflation of intensive and extensive attributes, poor interoperability between data cube and vector formats, complex cross-package dependencies, and a persistent divide between GIS and physical modeling communities. Through multi-language ecosystem surveys, cross-community comparative analysis, and software engineering abstraction, we propose, for the first time, a cross-language semantic framework for spatial operations. The framework formally defines support types (point vs. block), specifies attribute-type constraints on operation validity, and refactors spherical Simple Features logic. We distill five foundational insights that establish a methodological basis and practical guidance for tool interoperability, pedagogical alignment, and open-source governance in spatial computing.

Technology Category

Application Category

📝 Abstract
Recent workshops brought together several developers, educators and users of software packages extending popular languages for spatial data handling, with a primary focus on R, Python and Julia. Common challenges discussed included handling of spatial or spatio-temporal support, geodetic coordinates, in-memory vector data formats, data cubes, inter-package dependencies, packaging upstream libraries, differences in habits or conventions between the GIS and physical modelling communities, and statistical models. The following set of insights have been formulated: (i) considering software problems across data science language silos helps to understand and standardise analysis approaches, also outside the domain of formal standardisation bodies; (ii) whether attribute variables have block or point support, and whether they are spatially intensive or extensive has consequences for permitted operations, and hence for software implementing those; (iii) handling geometries on the sphere rather than on the flat plane requires modifications to the logic of {em simple features}, (iv) managing communities and fostering diversity is a necessary, on-going effort, and (v) tools for cross-language development need more attention and support.
Problem

Research questions and friction points this paper is trying to address.

Standardizing spatial data analysis across R, Python, and Julia
Addressing geometric and statistical challenges in spatial data handling
Improving cross-language tools and community diversity in spatial science
Innovation

Methods, ideas, or system contributions that make the work stand out.

Standardize analysis approaches across data science languages
Modify simple features logic for spherical geometries
Enhance cross-language development tools support
🔎 Similar Papers
No similar papers found.
E
E. Pebesma
Institute for Geoinformatics, University of Münster, Germany
Martin Fleischmann
Martin Fleischmann
Research Associate, Charles University
urban morphologygeographic data science
J
Josiah Parry
Environmental Systems Research Institute, Inc. (Esri)
Jakub Nowosad
Jakub Nowosad
University of Münster; Adam Mickiewicz University
GeocomputationPattern AnalysisSpatial Data MiningGeostatisticsR
A
Anita Graser
Center for Digital Safety and Security, AIT Austrian Institute of Technology, Vienna, Austria
D
Dewey D Dunnington
Wherobots, Inc.
M
Maarten Pronk
Hydrology Software, Deltares, the Netherlands, Urban Data Science, Delft University of Technology, the Netherlands
R
Rafael Schouten
Norwegian Institute for Nature Research (NINA), Oslo, Norway
Robin Lovelace
Robin Lovelace
Professor of Transport Data Science, Institute for Transport Studies, University of Leeds
geocomputationenergy and transportopen source softwaretransport modelling
M
M. Appel
Department of Geodesy, Bochum University of Applied Sciences, Germany
L
Lorena Abad
Department of Geoinformatics - Z GIS, University of Salzburg, Austria