π€ AI Summary
This work addresses the challenges of executability, auditability, and computational efficiency in automated alpha factor mining for quantitative investing, as well as the limited expressiveness or poor interpretability of existing approaches. To overcome these issues, the authors propose FactorEngine, a novel framework that models factors as Turing-complete programs. FactorEngine decouples logical refinement from parameter optimization and integrates large language modelβguided directional search with Bayesian hyperparameter optimization. It further introduces a knowledge-injected bootstrapping module that leverages a multi-agent closed-loop pipeline to transform unstructured financial reports into executable factor programs, building an empirical knowledge base to support trajectory-aware iterative refinement. Backtesting on real-world OHLCV data demonstrates that the proposed method significantly outperforms baseline models in terms of IC/ICIR, Rank IC/ICIR, and annualized return/Sharpe ratio, achieving both high predictive stability and superior investment performance.
π Abstract
We study alpha factor mining, the automated discovery of predictive signals from noisy, non-stationary market data-under a practical requirement that mined factors be directly executable and auditable, and that the discovery process remain computationally tractable at scale. Existing symbolic approaches are limited by bounded expressiveness, while neural forecasters often trade interpretability for performance and remain vulnerable to regime shifts and overfitting. We introduce FactorEngine (FE), a program-level factor discovery framework that casts factors as Turing-complete code and improves both effectiveness and efficiency via three separations: (i) logic revision vs. parameter optimization, (ii) LLM-guided directional search vs. Bayesian hyperparameter search, and (iii) LLM usage vs. local computation. FE further incorporates a knowledge-infused bootstrapping module that transforms unstructured financial reports into executable factor programs through a closed-loop multi-agent extraction-verification-code-generation pipeline, and an experience knowledge base that supports trajectory-aware refinement (including learning from failures). Across extensive backtests on real-world OHLCV data, FE produces factors with substantially stronger predictive stability and portfolio impact-for example, higher IC/ICIR (and Rank IC/ICIR) and improved AR/Sharpe, than baseline methods, achieving state-of-the-art predictive and portfolio performance.