ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs

📅 2026-04-23
📈 Citations: 0
Influential: 0
📄 PDF

career value

166K/year
🤖 AI Summary
This work proposes the first end-to-end geocoding framework based on large language models, reframing coordinate prediction as a text generation task. In contrast to traditional multi-stage approaches—which suffer from complex pipelines, error propagation, and heavy reliance on structured geographic knowledge bases—the proposed method eliminates the need for external databases by integrating chain-of-thought reasoning to enhance spatial relationship inference. Furthermore, it employs a reinforcement learning mechanism with a distance-bias-aware reward function to refine coordinate prediction accuracy. The framework not only achieves high precision in mapping explicit addresses to point coordinates but also effectively interprets ambiguous relative location descriptions and demonstrates strong generalization capabilities for non-point, region-based queries.

Technology Category

Application Category

📝 Abstract
This paper proposes ReaGeo, an end-to-end geocoding framework based on large language models, designed to overcome the limitations of traditional multi-stage approaches that rely on text or vector similarity retrieval over geographic databases, including workflow complexity, error propagation, and heavy dependence on structured geographic knowledge bases. The method converts geographic coordinates into geohash sequences, reformulating the coordinate prediction task as a text generation problem, and introduces a Chain-of-Thought mechanism to enhance the model's reasoning over spatial relationships. Furthermore, reinforcement learning with a distance-deviation-based reward is applied to optimize the generation accuracy. Comprehensive experiments show that ReaGeo can accurately handle explicit address queries in single-point predictions and effectively resolve vague relative location queries. In addition, the model demonstrates strong predictive capability for non-point geometric regions, highlighting its versatility and generalization ability in geocoding tasks.
Problem

Research questions and friction points this paper is trying to address.

geocoding
multi-stage approaches
error propagation
structured geographic knowledge bases
workflow complexity
Innovation

Methods, ideas, or system contributions that make the work stand out.

geocoding
large language models
Chain-of-Thought reasoning
geohash
reinforcement learning
🔎 Similar Papers
2024-06-03International Conference on Machine LearningCitations: 19