This Time is Different: An Observability Perspective on Time Series Foundation Models

πŸ“… 2025-05-20
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the modeling challenge of multivariate time-series forecasting for observability monitoring. We propose Totoβ€”a foundational decoder-only architecture (151M parameters) specifically designed for observability dataβ€”and BOOM, the first large-scale benchmark comprising 350M real-world production observations across 2,807 time series. To enable robust pretraining, we construct the first observability-specific pretraining corpus derived entirely from production environments and introduce a hybrid pretraining strategy combining real, open-source, and synthetic time-series data. Evaluated on BOOM and multiple general-purpose time-series benchmarks, Toto achieves state-of-the-art performance. All model weights, inference code, and evaluation scripts are publicly released under the Apache 2.0 license, facilitating standardized, reproducible research in observability-driven time-series modeling.

Technology Category

Application Category

πŸ“ Abstract
We introduce Toto, a time series forecasting foundation model with 151 million parameters. Toto uses a modern decoder-only architecture coupled with architectural innovations designed to account for specific challenges found in multivariate observability time series data. Toto's pre-training corpus is a mixture of observability data, open datasets, and synthetic data, and is 4-10$ imes$ larger than those of leading time series foundation models. Additionally, we introduce BOOM, a large-scale benchmark consisting of 350 million observations across 2,807 real-world time series. For both Toto and BOOM, we source observability data exclusively from Datadog's own telemetry and internal observability metrics. Extensive evaluations demonstrate that Toto achieves state-of-the-art performance on both BOOM and on established general purpose time series forecasting benchmarks. Toto's model weights, inference code, and evaluation scripts, as well as BOOM's data and evaluation code, are all available as open source under the Apache 2.0 License available at https://huggingface.co/Datadog/Toto-Open-Base-1.0 and https://github.com/DataDog/toto.
Problem

Research questions and friction points this paper is trying to address.

Develops Toto, a large-scale time series forecasting foundation model
Addresses challenges in multivariate observability time series data
Introduces BOOM benchmark for evaluating time series models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decoder-only architecture for time series
Largest pre-training corpus for observability
Open-source model weights and benchmark
πŸ”Ž Similar Papers
No similar papers found.
Ben Cohen
Ben Cohen
Datadog AI Research
E
Emaad Khwaja
Datadog AI Research
Y
Youssef Doubli
Datadog AI Research
S
Salahidine Lemaachi
Datadog AI Research
C
Chris Lettieri
Datadog AI Research
C
Charles Masson
Datadog AI Research
H
Hugo Miccinilli
Datadog AI Research
E
Elise Ram'e
Datadog AI Research
Q
Qiqi Ren
Datadog AI Research
Afshin Rostamizadeh
Afshin Rostamizadeh
Datadog
J
Jean Ogier du Terrail
Datadog AI Research
A
Anna-Monica Toon
Datadog AI Research
Kan Wang
Kan Wang
Datadog AI Research
S
Stephan Xie
Datadog AI Research, Carnegie Mellon University
D
David Asker
Datadog AI Research
Ameet Talwalkar
Ameet Talwalkar
CMU, Datadog
Machine Learning
O
Othmane Abou-Amal
Datadog AI Research