What If TSF: A Benchmark for Reframing Forecasting as Scenario-Guided Multimodal Forecasting

📅 2026-01-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing time series forecasting methods, which are predominantly unimodal and struggle to effectively incorporate textual context—such as hypothetical scenarios—for conditional prediction. Moreover, there is a lack of standardized benchmarks to evaluate whether models genuinely leverage textual information. To bridge this gap, the authors propose the first multimodal time series forecasting paradigm guided by hypothetical scenarios, introducing the What If TSF (WIT) benchmark dataset. WIT features expert-crafted real-world and counterfactual scenario descriptions meticulously aligned with corresponding time series, enabling rigorous assessment of a model’s ability to understand contextual cues and perform conditionally informed forecasting. This benchmark establishes a standardized evaluation platform for large language models in context-guided time series prediction, filling a critical void in both contextual understanding and conditional forecasting evaluation.

Technology Category

Application Category

📝 Abstract
Time series forecasting is critical to real-world decision making, yet most existing approaches remain unimodal and rely on extrapolating historical patterns. While recent progress in large language models (LLMs) highlights the potential for multimodal forecasting, existing benchmarks largely provide retrospective or misaligned raw context, making it unclear whether such models meaningfully leverage textual inputs. In practice, human experts incorporate what-if scenarios with historical evidence, often producing distinct forecasts from the same observations under different scenarios. Inspired by this, we introduce What If TSF (WIT), a multimodal forecasting benchmark designed to evaluate whether models can condition their forecasts on contextual text, especially future scenarios. By providing expert-crafted plausible or counterfactual scenarios, WIT offers a rigorous testbed for scenario-guided multimodal forecasting. The benchmark is available at https://github.com/jinkwan1115/WhatIfTSF.
Problem

Research questions and friction points this paper is trying to address.

time series forecasting
multimodal forecasting
scenario-guided forecasting
what-if scenarios
benchmark
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal forecasting
scenario-guided forecasting
time series forecasting
what-if scenarios
LLM-based forecasting
🔎 Similar Papers
No similar papers found.
J
Jinkwan Jang
Graduate School of Data Science, Seoul National University
H
Hyunbin Jin
Graduate School of Data Science, Seoul National University
H
Hyungjin Park
Graduate School of Data Science, Seoul National University
K
Kyubyung Chae
Graduate School of Data Science, Seoul National University
Taesup Kim
Taesup Kim
Assistant Professor, Seoul National University
Representation LearningTransfer LearningAIMachine LearningDeep Learning