ParallelTime: Dynamically Weighting the Balance of Short- and Long-Term Temporal Dependencies

๐Ÿ“… 2025-07-18
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

189K/year
๐Ÿค– AI Summary
Existing multivariate time series forecasting methods suffer from modeling bias due to uniform weighting of short- and long-term dependencies. To address this, we propose ParallelTimeโ€”a novel architecture featuring an input-aware ParallelTime Weighter module that dynamically allocates adaptive weights to local attention (for short-term patterns) and Mamba-based state-space modeling (for long-term dynamics), thereby overcoming the limitations of conventional equal-weighted fusion. The architecture adopts a parallel design to jointly optimize computational efficiency and modeling flexibility. Extensive experiments on multiple benchmark datasets demonstrate state-of-the-art (SOTA) performance, with significant reductions in both FLOPs and parameter count. Moreover, ParallelTime exhibits superior long-horizon extrapolation capability, confirming its effectiveness in capturing extended temporal dependencies.

Technology Category

Application Category

๐Ÿ“ Abstract
Modern multivariate time series forecasting primarily relies on two architectures: the Transformer with attention mechanism and Mamba. In natural language processing, an approach has been used that combines local window attention for capturing short-term dependencies and Mamba for capturing long-term dependencies, with their outputs averaged to assign equal weight to both. We find that for time-series forecasting tasks, assigning equal weight to long-term and short-term dependencies is not optimal. To mitigate this, we propose a dynamic weighting mechanism, ParallelTime Weighter, which calculates interdependent weights for long-term and short-term dependencies for each token based on the input and the model's knowledge. Furthermore, we introduce the ParallelTime architecture, which incorporates the ParallelTime Weighter mechanism to deliver state-of-the-art performance across diverse benchmarks. Our architecture demonstrates robustness, achieves lower FLOPs, requires fewer parameters, scales effectively to longer prediction horizons, and significantly outperforms existing methods. These advances highlight a promising path for future developments of parallel Attention-Mamba in time series forecasting. The implementation is readily available at: href{https://github.com/itay1551/ParallelTime}{ParallelTime GitHub
Problem

Research questions and friction points this paper is trying to address.

Dynamic weighting for short- and long-term dependencies
Improving time series forecasting with ParallelTime
Balancing attention and Mamba for optimal performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic weighting for temporal dependencies balance
Combines local window attention and Mamba
ParallelTime architecture with state-of-the-art performance
๐Ÿ”Ž Similar Papers
No similar papers found.