🤖 AI Summary
Foundation models (FMs) are increasingly applied to time series forecasting, yet their generalizability across diverse, domain-heterogeneous time series remains poorly understood. Method: This work systematically evaluates FM applicability through cross-domain zero-shot forecasting and fine-tuning experiments, explicitly assessing how pretraining data domain affects out-of-distribution generalization. Contribution/Results: (1) Zero-shot performance is highly sensitive to pretraining domain and degrades sharply on unseen real-world scenarios; (2) After fine-tuning, large FMs fail to consistently outperform lightweight, task-specific models and incur substantially higher computational costs; (3) The widely held “larger models are universally better” assumption is empirically invalidated for time series forecasting, leading to the principle that “domain alignment takes precedence over scale expansion.” These findings provide empirical grounding and methodological guidance for designing practical time-series foundation models—highlighting domain specificity, efficiency trade-offs, and the limits of transferability in sequential forecasting.
📝 Abstract
Foundation Models are designed to serve as versatile embedding machines, with strong zero shot capabilities and superior generalization performance when fine-tuned on diverse downstream tasks. While this is largely true for language and vision foundation models, we argue that the inherent diversity of time series data makes them less suited for building effective foundation models. We demonstrate this using forecasting as our downstream task. We show that the zero-shot capabilities of a time series foundation model are significantly influenced and tied to the specific domains it has been pretrained on. Furthermore, when applied to unseen real-world time series data, fine-tuned foundation models do not consistently yield substantially better results, relative to their increased parameter count and memory footprint, than smaller, dedicated models tailored to the specific forecasting task at hand.