The Seismic Wavefield Common Task Framework

📅 2025-12-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Seismology faces fundamental challenges in seismic wavefield prediction, reconstruction, and source/medium parameter modeling—including massive simulation requirements, sparse real-world observations, and oversimplified physical models. Existing machine learning (ML) approaches suffer from inconsistent feature engineering, nonstandardized evaluation protocols, and a lack of benchmark datasets. To address these issues, we propose the Common Task Framework (CTF) for seismic wavefields: a unified task-oriented framework that introduces three standardized, multi-scale datasets—global, crustal, and local—and formally defines core tasks (prediction, reconstruction, generalization) with task-specific metrics and hidden test sets. CTF integrates multi-scale physics-based simulations and real seismic network data, unifying foundational models (e.g., Waveform Transformer), classical inversion methods, and modern ML algorithms. It adopts rigorous robustness assessment and cross-dataset generalization evaluation. Initial benchmarking on two established benchmarks reveals performance boundaries and domain-specific applicability of diverse methods, advancing seismological ML toward reproducible, verifiable scientific evaluation.

Technology Category

Application Category

📝 Abstract
Seismology faces fundamental challenges in state forecasting and reconstruction (e.g., earthquake early warning and ground motion prediction) and managing the parametric variability of source locations, mechanisms, and Earth models (e.g., subsurface structure and topography effects). Addressing these with simulations is hindered by their massive scale, both in synthetic data volumes and numerical complexity, while real-data efforts are constrained by models that inadequately reflect the Earth's complexity and by sparse sensor measurements from the field. Recent machine learning (ML) efforts offer promise, but progress is obscured by a lack of proper characterization, fair reporting, and rigorous comparisons. To address this, we introduce a Common Task Framework (CTF) for ML for seismic wavefields, starting with three distinct wavefield datasets. Our CTF features a curated set of datasets at various scales (global, crustal, and local) and task-specific metrics spanning forecasting, reconstruction, and generalization under realistic constraints such as noise and limited data. Inspired by CTFs in fields like natural language processing, this framework provides a structured and rigorous foundation for head-to-head algorithm evaluation. We illustrate the evaluation procedure with scores reported for two of the datasets, showcasing the performance of various methods and foundation models for reconstructing seismic wavefields from both simulated and real-world sensor measurements. The CTF scores reveal the strengths, limitations, and suitability for specific problem classes. Our vision is to replace ad hoc comparisons with standardized evaluations on hidden test sets, raising the bar for rigor and reproducibility in scientific ML.
Problem

Research questions and friction points this paper is trying to address.

Addresses seismic forecasting and reconstruction challenges
Manages variability in earthquake sources and Earth models
Standardizes ML evaluation for seismic wavefield tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces Common Task Framework for seismic wavefield ML
Provides curated multi-scale datasets with task-specific metrics
Enables standardized algorithm evaluation on hidden test sets
🔎 Similar Papers
No similar papers found.
A
Alexey Yermakov
Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195
Y
Yue Zhao
High Performance Machine Learning team, SURF, Amsterdam, the Netherlands
Marine Denolle
Marine Denolle
Department of Earth and Space Sciences, University of Washington, Seattle, WA 98195
Y
Yiyu Ni
Department of Earth and Space Sciences, University of Washington, Seattle, WA 98195
P
Philippe M. Wyder
Department of Applied Mathematics, University of Washington, Seattle, WA 98195
J
Judah Goldfeder
Department of Computer Science, Columbia University, New York, NY 10027
S
Stefano Riva
Department of Energy, Nuclear Engineering Division, Politecnico di Milano, Milan, Italy
J
Jan Williams
Department of Mechanical Engineering, University of Washington, Seattle, WA 98195
D
David Zoro
Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195
A
Amy Sara Rude
Department of Applied Mathematics, University of Washington, Seattle, WA 98195
Matteo Tomasetto
Matteo Tomasetto
Politecnico di Milano
scientific machine learningreduced-order modelingcontroldeep learning
J
Joe Germany
Department of Mathematics, American University in Beirut, Beirut, Lebanon
Joseph Bakarji
Joseph Bakarji
American University of Beirut
Nonlinear Dynamics and ChaosMachine LearningComplex Systems
Georg Maierhofer
Georg Maierhofer
Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
Miles Cranmer
Miles Cranmer
University of Cambridge
Machine LearningAstrophysicsFluid Dynamics
J. Nathan Kutz
J. Nathan Kutz
Professor of Applied Mathematics & Electrical and Computer Engineering
Dynamical SystemsData ScienceMachine LearningOpticsNeuroscience