Geo-OLM: Enabling Sustainable Earth Observation Studies with Cost-Efficient Open Language Models&State-Driven Workflows

📅 2025-04-06

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

Large language models (LLMs) such as GPT-4o show promise for geospatial assistance in Earth observation and climate monitoring, but their high API costs and computational demands hinder sustainable deployment. Method: This paper proposes a state-driven LLM inference paradigm—the first to decouple task workflows from tool invocation—built upon a lightweight open-source LLM (<7B parameters), augmented with tool-integrated agents, on-the-fly LLM fine-tuning (OLM), collaborative execution, and a geospatial-specific evaluation benchmark. Contribution/Results: Experiments demonstrate a 32.8% improvement in query completion rate over the best open-source baseline, achieving 90% of GPT-4o’s performance while reducing per-inference cost from $500–$1,000 to under $10—a >99% reduction. The approach establishes a new paradigm for low-cost, high-performance, and reproducible geospatial automation.

Technology Category

Application Category

📝 Abstract

Geospatial Copilots hold immense potential for automating Earth observation (EO) and climate monitoring workflows, yet their reliance on large-scale models such as GPT-4o introduces a paradox: tools intended for sustainability studies often incur unsustainable costs. Using agentic AI frameworks in geospatial applications can amass thousands of dollars in API charges or requires expensive, power-intensive GPUs for deployment, creating barriers for researchers, policymakers, and NGOs. Unfortunately, when geospatial Copilots are deployed with open language models (OLMs), performance often degrades due to their dependence on GPT-optimized logic. In this paper, we present Geo-OLM, a tool-augmented geospatial agent that leverages the novel paradigm of state-driven LLM reasoning to decouple task progression from tool calling. By alleviating the workflow reasoning burden, our approach enables low-resource OLMs to complete geospatial tasks more effectively. When downsizing to small models below 7B parameters, Geo-OLM outperforms the strongest prior geospatial baselines by 32.8% in successful query completion rates. Our method performs comparably to proprietary models achieving results within 10% of GPT-4o, while reducing inference costs by two orders of magnitude from $500-$1000 to under $10. We present an in-depth analysis with geospatial downstream benchmarks, providing key insights to help practitioners effectively deploy OLMs for EO applications.

Problem

Research questions and friction points this paper is trying to address.

High costs of large models hinder sustainable Earth observation studies

Open language models underperform in geospatial tasks without optimization

Balancing performance and cost in geospatial AI workflows is challenging

Innovation

Methods, ideas, or system contributions that make the work stand out.

State-driven LLM reasoning for task progression

Cost-efficient open language models deployment

Tool-augmented geospatial agent for sustainability

🔎 Similar Papers

RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models