FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance

📅 2025-09-22

📈 Citations: 0

✨ Influential: 0

career value

191K/year

🤖 AI Summary

Traditional stochastic control methods in finance suffer from poor robustness in dynamic real-world markets due to strong assumptions—e.g., market stationarity and Markovianity. To address this, we propose an adaptive decision-making framework integrating imitation learning with noise-augmented latent-space reinforcement learning. First, a meta-policy is pre-trained via multi-expert imitation learning; then, it is fine-tuned via reinforcement learning in a noisy latent space. An action chunking mechanism is further introduced to explicitly model non-Markovian dependencies. This design enables cross-market knowledge transfer and online policy adaptation. Empirical results demonstrate that our method significantly outperforms single-expert baselines across diverse market regimes—including high-volatility and structural-break scenarios—while exhibiting superior generalization, stability, and real-time decision robustness.

Technology Category

Application Category

📝 Abstract

Traditional stochastic control methods in finance rely on simplifying assumptions that often fail in real world markets. While these methods work well in specific, well defined scenarios, they underperform when market conditions change. We introduce FinFlowRL, a novel framework for financial stochastic control that combines imitation learning with reinforcement learning. The framework first pretrains an adaptive meta policy by learning from multiple expert strategies, then finetunes it through reinforcement learning in the noise space to optimize the generation process. By employing action chunking, that is generating sequences of actions rather than single decisions, it addresses the non Markovian nature of financial markets. FinFlowRL consistently outperforms individually optimized experts across diverse market conditions.

Problem

Research questions and friction points this paper is trying to address.

Addresses limitations of traditional financial stochastic control methods

Combines imitation and reinforcement learning for adaptive market control

Handles non-Markovian market dynamics through action chunking techniques

Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines imitation learning with reinforcement learning

Pretrains meta policy from multiple expert strategies

Uses action chunking for non-Markovian market adaptation

🔎 Similar Papers

The Evolution of Reinforcement Learning in Quantitative Finance