FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance

📅 2025-09-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional stochastic control methods in finance suffer from poor robustness in dynamic real-world markets due to strong assumptions—e.g., market stationarity and Markovianity. To address this, we propose an adaptive decision-making framework integrating imitation learning with noise-augmented latent-space reinforcement learning. First, a meta-policy is pre-trained via multi-expert imitation learning; then, it is fine-tuned via reinforcement learning in a noisy latent space. An action chunking mechanism is further introduced to explicitly model non-Markovian dependencies. This design enables cross-market knowledge transfer and online policy adaptation. Empirical results demonstrate that our method significantly outperforms single-expert baselines across diverse market regimes—including high-volatility and structural-break scenarios—while exhibiting superior generalization, stability, and real-time decision robustness.

Technology Category

Application Category

📝 Abstract
Traditional stochastic control methods in finance rely on simplifying assumptions that often fail in real world markets. While these methods work well in specific, well defined scenarios, they underperform when market conditions change. We introduce FinFlowRL, a novel framework for financial stochastic control that combines imitation learning with reinforcement learning. The framework first pretrains an adaptive meta policy by learning from multiple expert strategies, then finetunes it through reinforcement learning in the noise space to optimize the generation process. By employing action chunking, that is generating sequences of actions rather than single decisions, it addresses the non Markovian nature of financial markets. FinFlowRL consistently outperforms individually optimized experts across diverse market conditions.
Problem

Research questions and friction points this paper is trying to address.

Addresses limitations of traditional financial stochastic control methods
Combines imitation and reinforcement learning for adaptive market control
Handles non-Markovian market dynamics through action chunking techniques
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines imitation learning with reinforcement learning
Pretrains meta policy from multiple expert strategies
Uses action chunking for non-Markovian market adaptation
🔎 Similar Papers
No similar papers found.
Y
Yang Li
School of Business, Stevens Institute of Technology
Z
Zhi Chen
School of Business, Stevens Institute of Technology
S
Steve Y. Yang
School of Business, Stevens Institute of Technology
Ruixun Zhang
Ruixun Zhang
Peking University
Sustainable InvestingMachine LearningMarket MicrostructureAdaptive Markets