MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading

📅 2025-09-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Non-stationarity in financial markets and challenges in fusing heterogeneous multimodal information severely limit the adaptability and decision accuracy of existing quantitative models. To address this, we propose DR-MoE—a Dynamic Routing Multimodal Mixture-of-Experts framework leveraging large language models. DR-MoE is the first to employ vision-language models (VLMs) for market state perception and dynamic expert routing, decoupling state recognition from strategy execution. It integrates four heterogeneous trading experts—trend-following, mean-reversion, breakout, and position-holding—each generating fine-grained sub-strategies and dynamically allocating weights in real time. Routing classification and risk-adjusted policy optimization are jointly trained via a hybrid supervised fine-tuning and reinforcement learning (SFT-RL) approach. Evaluated across equities, futures, and cryptocurrencies, DR-MoE achieves statistically significant improvements in annualized return, Sharpe ratio, and maximum drawdown over 15 state-of-the-art baselines, demonstrating superior robustness and cross-asset generalization.

Technology Category

Application Category

📝 Abstract
The inherent non-stationarity of financial markets and the complexity of multi-modal information pose significant challenges to existing quantitative trading models. Traditional methods relying on fixed structures and unimodal data struggle to adapt to market regime shifts, while large language model (LLM)-driven solutions - despite their multi-modal comprehension - suffer from static strategies and homogeneous expert designs, lacking dynamic adjustment and fine-grained decision mechanisms. To address these limitations, we propose MM-DREX: a Multimodal-driven, Dynamically-Routed EXpert framework based on large language models. MM-DREX explicitly decouples market state perception from strategy execution to enable adaptive sequential decision-making in non-stationary environments. Specifically, it (1) introduces a vision-language model (VLM)-powered dynamic router that jointly analyzes candlestick chart patterns and long-term temporal features to allocate real-time expert weights; (2) designs four heterogeneous trading experts (trend, reversal, breakout, positioning) generating specialized fine-grained sub-strategies; and (3) proposes an SFT-RL hybrid training paradigm to synergistically optimize the router's market classification capability and experts' risk-adjusted decision-making. Extensive experiments on multi-modal datasets spanning stocks, futures, and cryptocurrencies demonstrate that MM-DREX significantly outperforms 15 baselines (including state-of-the-art financial LLMs and deep reinforcement learning models) across key metrics: total return, Sharpe ratio, and maximum drawdown, validating its robustness and generalization. Additionally, an interpretability module traces routing logic and expert behavior in real time, providing an audit trail for strategy transparency.
Problem

Research questions and friction points this paper is trying to address.

Adapting to non-stationary financial markets with multimodal data complexity
Overcoming static strategies in LLM-driven trading with dynamic adjustment
Addressing homogeneous expert designs lacking fine-grained decision mechanisms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic router analyzes charts and temporal features
Heterogeneous experts generate specialized sub-strategies
Hybrid training optimizes market classification and decisions
🔎 Similar Papers
No similar papers found.
Y
Yang Chen
Zhejiang University, China
Y
Yueheng Jiang
Zhejiang University, China
Z
Zhaozhao Ma
Zhejiang University, China
Yuchen Cao
Yuchen Cao
Carnegie Mellon University
Spatial ComputingComputer VisionArtificial IntelligenceExtended Reality
J
Jacky Keung
City University of Hong Kong, Hong Kong SAR, China
Kun Kuang
Kun Kuang
Zhejiang University
Causal InferenceData MiningMachine Learning
Leilei Gan
Leilei Gan
Zhejiang University
NLPLLMsMultimodal LLMsAI+X
Y
Yiquan Wu
Zhejiang University, China
F
Fei Wu
Zhejiang University, China