Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents

📅 2025-09-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current multimodal large language models (MLLMs) struggle with multi-step reasoning and domain-specific tool integration in Earth observation tasks, and lack a systematic evaluation framework tailored for remote sensing agents. Method: We propose Earth-Agent—the first multimodal agent framework that jointly processes RGB and spectral remote sensing data, leveraging the Model-Controller-Protocol (MCP) tool ecosystem to enable cross-modal, quantitative spatiotemporal reasoning and geophysical parameter retrieval. Contribution/Results: We introduce Earth-Bench, a dedicated benchmark with a two-tiered evaluation protocol, addressing the longstanding gap in systematic assessment of remote sensing agents. Experiments demonstrate that Earth-Agent consistently outperforms state-of-the-art MLLMs across diverse LLM backbones and agent architectures, achieving the first paradigm shift in remote sensing analysis—from shallow perception to scientific, deep reasoning.

Technology Category

Application Category

📝 Abstract
Earth observation (EO) is essential for understanding the evolving states of the Earth system. Although recent MLLMs have advanced EO research, they still lack the capability to tackle complex tasks that require multi-step reasoning and the use of domain-specific tools. Agent-based methods offer a promising direction, but current attempts remain in their infancy, confined to RGB perception, shallow reasoning, and lacking systematic evaluation protocols. To overcome these limitations, we introduce Earth-Agent, the first agentic framework that unifies RGB and spectral EO data within an MCP-based tool ecosystem, enabling cross-modal, multi-step, and quantitative spatiotemporal reasoning beyond pretrained MLLMs. Earth-Agent supports complex scientific tasks such as geophysical parameter retrieval and quantitative spatiotemporal analysis by dynamically invoking expert tools and models across modalities. To support comprehensive evaluation, we further propose Earth-Bench, a benchmark of 248 expert-curated tasks with 13,729 images, spanning spectrum, products and RGB modalities, and equipped with a dual-level evaluation protocol that assesses both reasoning trajectories and final outcomes. We conduct comprehensive experiments varying different LLM backbones, comparisons with general agent frameworks, and comparisons with MLLMs on remote sensing benchmarks, demonstrating both the effectiveness and potential of Earth-Agent. Earth-Agent establishes a new paradigm for EO analysis, moving the field toward scientifically grounded, next-generation applications of LLMs in Earth observation. Our code and dataset will be publicly released.
Problem

Research questions and friction points this paper is trying to address.

Addressing complex Earth observation tasks requiring multi-step reasoning
Overcoming limitations of current agent methods confined to RGB perception
Lacking systematic evaluation protocols for Earth observation agents
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified agentic framework for RGB and spectral data
Dynamic tool invocation for cross-modal reasoning
Dual-level evaluation protocol for comprehensive assessment
🔎 Similar Papers
No similar papers found.
P
Peilin Feng
Shanghai Artificial Intelligence Laboratory
Z
Zhutao Lv
Sun Yat-sen University, Shanghai Artificial Intelligence Laboratory
Junyan Ye
Junyan Ye
SYSU
Computer Vision and Deep Learning
X
Xiaolei Wang
Sun Yat-sen University
X
Xinjie Huo
Sun Yat-sen University
Jinhua Yu
Jinhua Yu
Sun Yat-sen University
Remote sensing
W
Wanghan Xu
Shanghai Artificial Intelligence Laboratory
W
Wenlong Zhang
Shanghai Artificial Intelligence Laboratory
Lei Bai
Lei Bai
Shanghai AI Laboratory
Foundation ModelScience IntelligenceMulti-Agent SystemAutonomous Discovery
Conghui He
Conghui He
Shanghai AI Laboratory
Data-centric AILLMDocument Intelligence
W
Weijia Li
Sun Yat-sen University, Shanghai Artificial Intelligence Laboratory