History Is Not Enough: An Adaptive Dataflow System for Financial Time-Series Synthesis

📅 2026-01-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Financial time series data often suffer from concept drift and non-stationary distributions, which degrade model generalization. To address this, this work proposes a drift-aware adaptive data stream system that, for the first time, unifies data augmentation, curriculum learning, and data workflow management within a differentiable framework, enabling gradient-based bilevel optimization and traceable replay. The system incorporates parameterized data operation modules—including single-asset transformations, multi-asset mixing, and data filtering—together with an adaptive planner that dynamically adjusts augmentation strategies to align with evolving market conditions. Evaluated on both predictive modeling and reinforcement learning–based trading tasks, the approach significantly enhances model robustness and risk-adjusted returns.

Technology Category

Application Category

📝 Abstract
In quantitative finance, the gap between training and real-world performance-driven by concept drift and distributional non-stationarity-remains a critical obstacle for building reliable data-driven systems. Models trained on static historical data often overfit, resulting in poor generalization in dynamic markets. The mantra"History Is Not Enough"underscores the need for adaptive data generation that learns to evolve with the market rather than relying solely on past observations. We present a drift-aware dataflow system that integrates machine learning-based adaptive control into the data curation process. The system couples a parameterized data manipulation module comprising single-stock transformations, multi-stock mix-ups, and curation operations, with an adaptive planner-scheduler that employs gradient-based bi-level optimization to control the system. This design unifies data augmentation, curriculum learning, and data workflow management under a single differentiable framework, enabling provenance-aware replay and continuous data quality monitoring. Extensive experiments on forecasting and reinforcement learning trading tasks demonstrate that our framework enhances model robustness and improves risk-adjusted returns. The system provides a generalizable approach to adaptive data management and learning-guided workflow automation for financial data.
Problem

Research questions and friction points this paper is trying to address.

concept drift
distributional non-stationarity
financial time-series
model generalization
adaptive data generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

adaptive dataflow
concept drift
differentiable data curation
bi-level optimization
financial time-series synthesis
H
Haochong Xia
College of Computing and Data Science, Nanyang Technological University, Singapore
Y
Yao Long Teng
College of Computing and Data Science, Nanyang Technological University, Singapore
R
Regan Tan
College of Computing and Data Science, Nanyang Technological University, Singapore
M
Molei Qin
College of Computing and Data Science, Nanyang Technological University, Singapore
Xinrun Wang
Xinrun Wang
Singapore Management University
Reinforcement LearningMulti-Agent SystemsDistributed Foundation Agents
Bo An
Bo An
Nanyang Technological University
Artificial intelligencemulti-agent systemsgame theoryreinforcement learningoptimization