Learning to Defer in Non-Stationary Time Series via Switching State-Space Models

📅 2026-01-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of modeling non-stationary time series when expert availability varies over time and only partial feedback is observed. To this end, the authors propose L2D-SLDS, a factored switching linear Gaussian state-space model that jointly captures shared global dynamics and expert-specific latent states. The approach models expert residuals and enables context-aware dynamic routing, while supporting online expert registration and pruning. A routing policy based on information-directed sampling (IDS) is introduced to balance predictive accuracy against the information gain from querying individual experts. Experimental results demonstrate that the proposed method significantly outperforms contextual multi-armed bandit baselines and ablated variants without shared factors, achieving marked improvements in predictive performance.

Technology Category

Application Category

📝 Abstract
We study Learning to Defer for non-stationary time series with partial feedback and time-varying expert availability. At each time step, the router selects an available expert, observes the target, and sees only the queried expert's prediction. We model signed expert residuals using L2D-SLDS, a factorized switching linear-Gaussian state-space model with context-dependent regime transitions, a shared global factor enabling cross-expert information transfer, and per-expert idiosyncratic states. The model supports expert entry and pruning via a dynamic registry. Using one-step-ahead predictive beliefs, we propose an IDS-inspired routing rule that trades off predicted cost against information gained about the latent regime and shared factor. Experiments show improvements over contextual-bandit baselines and a no-shared-factor ablation.
Problem

Research questions and friction points this paper is trying to address.

Learning to Defer
Non-Stationary Time Series
Partial Feedback
Time-Varying Expert Availability
Expert Routing
Innovation

Methods, ideas, or system contributions that make the work stand out.

switching state-space models
learning to defer
non-stationary time series
information-directed sampling
dynamic expert routing
🔎 Similar Papers
No similar papers found.
Yannis Montreuil
Yannis Montreuil
PhD Candidate
Machine LearningStatistical LearningHuman-AI collaboration
L
Letian Yu
School of Computing, National University of Singapore, Singapore; Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore; CNRS@CREATE LTD, 1 Create Way, Singapore
Axel Carlier
Axel Carlier
ISAE-SUPAERO
AIMultimedia
L
Lai Xing Ng
Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore; IPAL, IRL2955, Singapore
Wei Tsang Ooi
Wei Tsang Ooi
National University of Singapore
Multimedia SystemsInteractive SystemsIntelligent Systems