Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024

📅 2025-01-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the dual challenges of policy instability and low sample efficiency in financial reinforcement learning, this paper proposes a robust ensemble trading framework enabling thousand-scale parallel simulation on a single GPU. Methodologically, it integrates multi-agent PPO/SAC training, policy averaging and voting-based ensemble mechanisms, domain-specific financial time-series feature engineering, and a risk-aware reward function. Its key innovation lies in the first-ever realization of over one thousand concurrent environment simulations on a single GPU, coupled with a novel risk-mitigating policy fusion paradigm tailored for high-volatility markets. Experiments demonstrate a 1746× speedup in sampling throughput; compared to single-agent baselines, the framework achieves higher cumulative returns, reduces maximum drawdown by 4.17%, improves the Sharpe ratio by 0.21, and significantly enhances generalization capability and decision stability.

Technology Category

Application Category

📝 Abstract
Reinforcement learning has demonstrated great potential for performing financial tasks. However, it faces two major challenges: policy instability and sampling bottlenecks. In this paper, we revisit ensemble methods with massively parallel simulations on graphics processing units (GPUs), significantly enhancing the computational efficiency and robustness of trained models in volatile financial markets. Our approach leverages the parallel processing capability of GPUs to significantly improve the sampling speed for training ensemble models. The ensemble models combine the strengths of component agents to improve the robustness of financial decision-making strategies. We conduct experiments in both stock and cryptocurrency trading tasks to evaluate the effectiveness of our approach. Massively parallel simulation on a single GPU improves the sampling speed by up to $1,746 imes$ using $2,048$ parallel environments compared to a single environment. The ensemble models have high cumulative returns and outperform some individual agents, reducing maximum drawdown by up to $4.17%$ and improving the Sharpe ratio by up to $0.21$. This paper describes trading tasks at ACM ICAIF FinRL Contests in 2023 and 2024.
Problem

Research questions and friction points this paper is trying to address.

Reinforcement Learning
Financial Markets
Decision Instability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parallel Processing
Integrated Model Strategy
Financial Decision Stability
🔎 Similar Papers
No similar papers found.
N
Nikolaus Holzer
Columbia University, New York, New York, USA
K
Keyi Wang
Columbia University, New York, New York, USA
Kairong Xiao
Kairong Xiao
Columbia Business School
Financial IntermediationIndustrial OrganizationMonetary EconomicsPolitical Economy
X
Xiao-Yang Liu Yanglet
Columbia University, New York, New York, USA