๐ค AI Summary
This work proposes a generative Transformer-based approach to modeling market microstructure, designed to learn cross-asset, generalizable representations from massive heterogeneous trading events and synthesize order flows that replicate the statistical properties of real markets. By introducing scale-invariant features and a universal event tokenization scheme, the method maps multi-asset order flows into unified discrete sequences and leverages a deterministic market simulator to generate realistic trading trajectories. Notably, it achieves zero-shot cross-market generalization without asset-specific calibrationโthe first such foundation model in microstructure modeling. The synthetic data accurately reproduces stylized facts such as heavy-tailed returns and volatility clustering, reducing distributional errors by a factor of 2โ3 compared to Compound Hawkes baselines, and demonstrates effective transfer to unseen Asia-Pacific markets.
๐ Abstract
Foundation models have transformed domains from language to genomics by learning general-purpose representations from large-scale, heterogeneous data. We introduce TradeFM, a 524M-parameter generative Transformer that brings this paradigm to market microstructure, learning directly from billions of trade events across>9K equities. To enable cross-asset generalization, we develop scale-invariant features and a universal tokenization scheme that map the heterogeneous, multi-modal event stream of order flow into a unified discrete sequence -- eliminating asset-specific calibration. Integrated with a deterministic market simulator, TradeFM-generated rollouts reproduce key stylized facts of financial returns, including heavy tails, volatility clustering, and absence of return autocorrelation. Quantitatively, TradeFM achieves 2-3x lower distributional error than Compound Hawkes baselines and generalizes zero-shot to geographically out-of-distribution APAC markets with moderate perplexity degradation. Together, these results suggest that scale-invariant trade representations capture transferable structure in market microstructure, opening a path toward synthetic data generation, stress testing, and learning-based trading agents.