WindFM: An Open-Source Foundation Model for Zero-Shot Wind Power Forecasting

📅 2025-09-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing wind power forecasting methods face two key bottlenecks: site-specific models suffer from poor generalizability, while generic time-series foundation models struggle to incorporate domain-specific energy priors. To address this, we propose the first dedicated foundation model for zero-shot cross-site wind power forecasting. Our method introduces a hierarchical time-series tokenizer that explicitly captures meteorological–power coupling relationships and adopts a lightweight decoder-only Transformer architecture integrated with a multivariate discretization-based generative framework, pre-trained autoregressively on large-scale wind energy data. With only 8.1 million parameters, our model achieves state-of-the-art performance in both deterministic and probabilistic forecasting tasks. Moreover, it demonstrates strong robustness and zero-shot transfer capability under out-of-distribution settings—particularly across continental-scale site shifts—without fine-tuning.

Technology Category

Application Category

📝 Abstract
High-quality wind power forecasting is crucial for the operation of modern power grids. However, prevailing data-driven paradigms either train a site-specific model which cannot generalize to other locations or rely on fine-tuning of general-purpose time series foundation models which are difficult to incorporate domain-specific data in the energy sector. This paper introduces WindFM, a lightweight and generative Foundation Model designed specifically for probabilistic wind power forecasting. WindFM employs a discretize-and-generate framework. A specialized time-series tokenizer first converts continuous multivariate observations into discrete, hierarchical tokens. Subsequently, a decoder-only Transformer learns a universal representation of wind generation dynamics by autoregressively pre-training on these token sequences. Using the comprehensive WIND Toolkit dataset comprising approximately 150 billion time steps from more than 126,000 sites, WindFM develops a foundational understanding of the complex interplay between atmospheric conditions and power output. Extensive experiments demonstrate that our compact 8.1M parameter model achieves state-of-the-art zero-shot performance on both deterministic and probabilistic tasks, outperforming specialized models and larger foundation models without any fine-tuning. In particular, WindFM exhibits strong adaptiveness under out-of-distribution data from a different continent, demonstrating the robustness and transferability of its learned representations. Our pre-trained model is publicly available at https://github.com/shiyu-coder/WindFM.
Problem

Research questions and friction points this paper is trying to address.

Zero-shot probabilistic wind power forecasting
Overcoming site-specific model generalization limitations
Incorporating domain-specific data in energy forecasting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lightweight generative foundation model for wind forecasting
Discretize-and-generate framework with specialized time-series tokenizer
Autoregressive pre-training on universal wind dynamics representation
Hang Fan
Hang Fan
North China Electric Power Univercity;Tsinghua University
Electricity MarketTime series predictionDeep/Machine learning
Y
Yu Shi
Tsinghua University, Beijing 100084, China; Institute for Interdisciplinary Information Sciences
Z
Zongliang Fu
Tsinghua University, Beijing 100084, China; Department of Automation
S
Shuo Chen
Tsinghua University, Beijing 100084, China; Institute for Interdisciplinary Information Sciences
W
Wei Wei
Tsinghua University, Beijing 100084, China; Department of Electrical Engineering
W
Wei Xu
Tsinghua University, Beijing 100084, China; Institute for Interdisciplinary Information Sciences
J
Jian Li
Tsinghua University, Beijing 100084, China