Harvesting AlphaEarth: Benchmarking the Geospatial Foundation Model for Agricultural Downstream Tasks

📅 2025-12-30
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the lack of systematic evaluation of geospatial foundation models like AlphaEarth in agricultural downstream tasks and their comprehensive comparison against traditional remote sensing approaches. We present the first application of AlphaEarth embeddings to three major U.S. agricultural tasks—crop yield prediction, tillage practice mapping, and cover crop mapping—leveraging both public and proprietary datasets. Machine learning models based on AlphaEarth features are rigorously compared with those using conventional remote sensing features across multiple spatial scales and regions. Results demonstrate that AlphaEarth achieves strong performance in yield prediction and county-level tillage mapping when trained locally; however, it exhibits limited spatial transferability, low interpretability, and insufficient temporal sensitivity. These findings provide critical empirical insights and practical guidance for deploying foundation models in real-world agricultural applications.

Technology Category

Application Category

📝 Abstract
Geospatial foundation models (GFMs) have emerged as a promising approach to overcoming the limitations in existing featurization methods. More recently, Google DeepMind has introduced AlphaEarth Foundation (AEF), a GFM pre-trained using multi-source EOs across continuous time. An annual and global embedding dataset is produced using AEF that is ready for analysis and modeling. The internal experiments show that AEF embeddings have outperformed operational models in 15 EO tasks without re-training. However, those experiments are mostly about land cover and land use classification. Applying AEF and other GFMs to agricultural monitoring require an in-depth evaluation in critical agricultural downstream tasks. There is also a lack of comprehensive comparison between the AEF-based models and traditional remote sensing (RS)-based models under different scenarios, which could offer valuable guidance for researchers and practitioners. This study addresses some of these gaps by evaluating AEF embeddings in three agricultural downstream tasks in the U.S., including crop yield prediction, tillage mapping, and cover crop mapping. Datasets are compiled from both public and private sources to comprehensively evaluate AEF embeddings across tasks at different scales and locations, and RS-based models are trained as comparison models. AEF-based models generally exhibit strong performance on all tasks and are competitive with purpose-built RS-based models in yield prediction and county-level tillage mapping when trained on local data. However, we also find several limitations in current AEF embeddings, such as limited spatial transferability compared to RS-based models, low interpretability, and limited time sensitivity. These limitations recommend caution when applying AEF embeddings in agriculture, where time sensitivity, generalizability, and interpretability is important.
Problem

Research questions and friction points this paper is trying to address.

geospatial foundation model
agricultural monitoring
remote sensing
downstream tasks
model evaluation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Geospatial Foundation Model
AlphaEarth
Agricultural Downstream Tasks
Remote Sensing Benchmarking
Crop Yield Prediction
🔎 Similar Papers
No similar papers found.
Yuchi Ma
Yuchi Ma
HUAWEI
Large Language ModelAI for SE
Y
Yawen Shen
Corteva Agriscience™, USA
A
Anu Swatantran
Corteva Agriscience™, USA
D
David B. Lobell
Department of Earth System Science and Center on Food Security and the Environment, Stanford University, USA