Galileo: Learning Global and Local Features in Pretrained Remote Sensing Models

📅 2025-02-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Remote sensing pre-trained models are commonly constrained by fixed input modalities and spatial scales, limiting their adaptability to heterogeneous multi-sensor data and multi-scale Earth surface phenomena. To address this, we propose the Galileo model family: (1) a modality-agnostic encoder architecture supporting variable-resolution inputs; (2) a novel self-supervised learning paradigm that jointly models large-scale global structures and fine-grained local details for the first time; and (3) a unified spatiotemporal representation learning framework enabling cross-sensor and cross-resolution generalization. Evaluated on crop mapping and flood detection, Galileo achieves state-of-the-art performance while significantly reducing reliance on labeled data. Our work establishes a new paradigm for general-purpose remote sensing foundation models.

Technology Category

Application Category

📝 Abstract
From crop mapping to flood detection, machine learning in remote sensing has a wide range of societally beneficial applications. The commonalities between remote sensing data in these applications present an opportunity for pretrained machine learning models tailored to remote sensing to reduce the labeled data and effort required to solve individual tasks. However, such models must be: (i) flexible enough to ingest input data of varying sensor modalities and shapes (i.e., of varying spatial and temporal dimensions), and (ii) able to model Earth surface phenomena of varying scales and types. To solve this gap, we present Galileo, a family of pretrained remote sensing models designed to flexibly process multimodal remote sensing data. We also introduce a novel and highly effective self-supervised learning approach to learn both large- and small-scale features, a challenge not addressed by previous models. Our Galileo models obtain state-of-the-art results across diverse remote sensing tasks.
Problem

Research questions and friction points this paper is trying to address.

Flexible processing of multimodal remote sensing data
Learning both large and small scale features effectively
Reducing labeled data requirements for remote sensing tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Pretrained models for remote sensing
Flexible multimodal data processing
Self-supervised learning for feature scales
🔎 Similar Papers
No similar papers found.
Gabriel Tseng
Gabriel Tseng
Mila – Quebec AI Institute, McGill University, Allen Institute for AI (Ai2)
A
A. Fuller
Carleton University
M
Marlena Reil
Mila – Quebec AI Institute, McGill University
H
Henry Herzog
Allen Institute for AI (Ai2)
Patrick Beukema
Patrick Beukema
Ai2
AI for scienceAI for earthAI for good
F
F. Bastani
Allen Institute for AI (Ai2)
J
James R. Green
Carleton University
Evan Shelhamer
Evan Shelhamer
UBC / Vector Institute / CIFAR AI Chair
computer visionmachine learningdeep learning
H
Hannah Kerner
Arizona State University
David Rolnick
David Rolnick
McGill University, Mila Quebec AI Institute
Machine LearningClimate ChangeBiodiversityDeep Learning Theory