🤖 AI Summary
Financial markets exhibit inherent complexity, information asymmetry, and high stochasticity, posing significant challenges for traditional decision-making models. Method: This work systematically reviews 167 studies on reinforcement learning (RL) in finance, proposing the first unified analytical framework integrating single-agent RL, multi-agent RL, transfer learning, and meta-learning. It leverages canonical algorithms—including deep Q-networks and policy gradients—alongside financial time-series modeling and simulation-based evaluation. Contribution/Results: The study identifies three persistent limitations across existing approaches: insufficient interpretability, inadequate robustness, and poor generalization. It characterizes six representative application paradigms and introduces the first standardized evaluation protocol for financial RL, specifying environment design principles, benchmark tasks, and performance metrics. The findings deliver a structured taxonomy for theoretical advancement and establish reproducible benchmarks to guide empirical research and industrial deployment.
📝 Abstract
Reinforcement Learning (RL) has experienced significant advancement over the past decade, prompting a growing interest in applications within finance. This survey critically evaluates 167 publications, exploring diverse RL applications and frameworks in finance. Financial markets, marked by their complexity, multi-agent nature, information asymmetry, and inherent randomness, serve as an intriguing test-bed for RL. Traditional finance offers certain solutions, and RL advances these with a more dynamic approach, incorporating machine learning methods, including transfer learning, meta-learning, and multi-agent solutions. This survey dissects key RL components through the lens of Quantitative Finance. We uncover emerging themes, propose areas for future research, and critique the strengths and weaknesses of existing methods.