TradExpert: Revolutionizing Trading with Mixture of Expert LLMs

📅 2024-10-16
🏛️ arXiv.org
📈 Citations: 3
Influential: 1
📄 PDF
🤖 AI Summary
This paper addresses the challenge of integrating heterogeneous multi-source financial data—news, market quotes, Alpha factors, and fundamentals—in quantitative trading. We propose the first Mixture-of-Experts (MoE) Large Language Model (LLM) framework tailored for quant finance: four domain-specific expert LLMs process respective data modalities in parallel, while a generalist expert LLM unifies cross-modal representations for joint decision-making, supporting both return prediction and stock ranking tasks. Key contributions include: (1) the first MoE-LLM architecture designed specifically for finance; (2) a switchable dual-task mechanism enabling flexible inference modes; and (3) an open-source, large-scale financial evaluation benchmark. Leveraging multi-source prompt engineering, domain-adaptive fine-tuning, and unified structured/unstructured modeling, our framework achieves statistically significant improvements over monolithic LLMs and state-of-the-art baselines across multiple live-trading benchmarks, demonstrating both the efficacy of cross-modal financial insight fusion and strong real-world deployability.

Technology Category

Application Category

📝 Abstract
The integration of Artificial Intelligence (AI) in the financial domain has opened new avenues for quantitative trading, particularly through the use of Large Language Models (LLMs). However, the challenge of effectively synthesizing insights from diverse data sources and integrating both structured and unstructured data persists. This paper presents TradeExpert, a novel framework that employs a mix of experts (MoE) approach, using four specialized LLMs, each analyzing distinct sources of financial data, including news articles, market data, alpha factors, and fundamental data. The insights of these expert LLMs are further synthesized by a General Expert LLM to make a final prediction or decision. With specific prompts, TradeExpert can be switched between the prediction mode and the ranking mode for stock movement prediction and quantitative stock trading, respectively. In addition to existing benchmarks, we also release a large-scale financial dataset to comprehensively evaluate TradeExpert's effectiveness. Our experimental results demonstrate TradeExpert's superior performance across all trading scenarios.
Problem

Research questions and friction points this paper is trying to address.

Synthesizing insights from diverse financial data sources
Integrating structured and unstructured financial data effectively
Improving stock movement prediction and quantitative trading performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixture of Expert LLMs for diverse data analysis
General Expert LLM synthesizes insights for decisions
Switchable prediction and ranking modes for trading
🔎 Similar Papers
No similar papers found.