π€ AI Summary
To address the challenge of long-horizon multivariate time series generation, this paper proposes the first fractional diffusion model grounded in log-signature embeddings. By modeling time series as discrete samples of continuous paths, we design forward perturbation and reverse denoising schemes that preserve the algebraic structure of path signatures. We pioneer the deep integration of log-signatures with score-based diffusion frameworks and derive a differentiable, closed-form signature inversion formula, enabling exact reconstruction from algebraic features back toεε§ time series. To enhance representational capacity, the method incorporates Fourier and orthogonal polynomial expansions. Experiments demonstrate state-of-the-art generation quality on diverse synthetic and real-world long-term multivariate time series datasets, significantly improving robustness to sequence length and fidelity in capturing complex inter-variable temporal dependencies.
π Abstract
Score-based diffusion models have recently emerged as state-of-the-art generative models for a variety of data modalities. Nonetheless, it remains unclear how to adapt these models to generate long multivariate time series. Viewing a time series as the discretisation of an underlying continuous process, we introduce SigDiffusion, a novel diffusion model operating on log-signature embeddings of the data. The forward and backward processes gradually perturb and denoise log-signatures while preserving their algebraic structure. To recover a signal from its log-signature, we provide new closed-form inversion formulae expressing the coefficients obtained by expanding the signal in a given basis (e.g. Fourier or orthogonal polynomials) as explicit polynomial functions of the log-signature. Finally, we show that combining SigDiffusions with these inversion formulae results in high-quality long time series generation, competitive with the current state-of-the-art on various datasets of synthetic and real-world examples.