π€ AI Summary
This study addresses the challenge of modeling cross-asset dependencies and extreme-event co-movements in high-dimensional financial return data, where scarcity of observations hinders reliable estimation. To this end, the authors propose MarketGAN, a generative adversarial network that incorporates asset pricing factor structures as economic inductive biases. MarketGAN uniquely embeds an explicit factor model within the GAN framework and leverages temporal convolutional networks (TCNs) to dynamically capture time-varying factor loadings and volatilities. This approach effectively reproduces complex statistical features of financial markets, including cross-sectional dependence, tail co-movements, volatility clustering, and long-range temporal dependencies. Empirical experiments on daily U.S. equity data demonstrate that the generated samples closely replicate real-market statistical properties, and covariance matrices estimated from synthetic data yield significantly improved portfolio optimization performance compared to conventional methods, delivering clear economic value.
π Abstract
This paper introduces MarketGAN, a factor-based generative framework for high-dimensional asset return generation under severe data scarcity. We embed an explicit asset-pricing factor structure as an economic inductive bias and generate returns as a single joint vector, thereby preserving cross-sectional dependence and tail co-movement alongside inter-temporal dynamics. MarketGAN employs generative adversarial learning with a temporal convolutional network (TCN) backbone, which models stochastic, time-varying factor loadings and volatilities and captures long-range temporal dependence. Using daily returns of large U.S. equities, we find that MarketGAN more closely matches empirical stylized facts of asset returns, including heavy-tailed marginal distributions, volatility clustering, leverage effects, and, most notably, high-dimensional cross-sectional correlation structures and tail co-movement across assets, than conventional factor-model-based bootstrap approaches. In portfolio applications, covariance estimates derived from MarketGAN-generated samples outperform those derived from other methods when factor information is at least weakly informative, demonstrating tangible economic value.