🤖 AI Summary
This paper addresses the joint problem of identifying similar assets, detecting mispricing opportunities, and constructing trading strategies to maximize risk-adjusted returns net of transaction costs. To this end, we propose the “Attention Factor”—a conditional latent factor framework that models complex cross-asset interactions via learnable firm-level feature embeddings, and jointly optimizes factor extraction and trading decisions in an end-to-end manner. Our method unifies feature embedding, residual portfolio signal extraction, and a generic sequence model within a single statistical arbitrage learning architecture. Evaluated on out-of-sample U.S. equities in 2024, the strategy achieves a Sharpe ratio of 4.0 pre-transaction costs and 2.3 post-costs—demonstrating both the viability of weak signals in high-frequency statistical arbitrage and the practical robustness of the proposed framework.
📝 Abstract
Statistical arbitrage exploits temporal price differences between similar assets. We develop a framework to jointly identify similar assets through factors, identify mispricing and form a trading policy that maximizes risk-adjusted performance after trading costs. Our Attention Factors are conditional latent factors that are the most useful for arbitrage trading. They are learned from firm characteristic embeddings that allow for complex interactions. We identify time-series signals from the residual portfolios of our factors with a general sequence model. Estimating factors and the arbitrage trading strategy jointly is crucial to maximize profitability after trading costs. In a comprehensive empirical study we show that our Attention Factor model achieves an out-of-sample Sharpe ratio above 4 on the largest U.S. equities over a 24-year period. Our one-step solution yields an unprecedented Sharpe ratio of 2.3 net of transaction costs. We show that weak factors are important for arbitrage trading.