Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

📅 2026-02-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes the first end-to-end event-driven trading framework to address the limitations of existing learning-based trading systems, which struggle to effectively leverage financial news due to the absence of large-scale, event-centric datasets and misalignment between language models and trading objectives. The authors construct a novel dataset comprising 62,400 annotated news articles that explicitly links events to market reactions. Building upon this resource, they integrate supervised fine-tuning with reinforcement learning and introduce a Hierarchical Gated Reward Model (HGRM) to explicitly balance multiple trading objectives. The proposed approach achieves over a 17.5% improvement in directional prediction accuracy and up to a 102.0% increase in Sharpe ratio, significantly outperforming both market benchmarks and large language model baselines, thereby enabling more consistent, interpretable, and profitable trading decisions.

Technology Category

Application Category

📝 Abstract
Financial market movements are often driven by discrete financial events conveyed through news, whose impacts are heterogeneous, abrupt, and difficult to capture under purely numerical prediction objectives. These limitations have motivated growing interest in using textual information as the primary source of trading signals in learning-based systems. Two key challenges hinder existing approaches: (1) the absence of large-scale, event-centric datasets that jointly model news semantics and statistically grounded market reactions, and (2) the misalignment between language model reasoning and financially valid trading behavior under dynamic market conditions. To address these challenges, we propose Janus-Q, an end-to-end event-driven trading framework that elevates financial news events from auxiliary signals to primary decision units. Janus-Q unifies event-centric data construction and model optimization under a two-stage paradigm. Stage I focuses on event-centric data construction, building a large-scale financial news event dataset comprising 62,400 articles annotated with 10 fine-grained event types, associated stocks, sentiment labels, and event-driven cumulative abnormal return (CAR). Stage II performs decision-oriented fine-tuning, combining supervised learning with reinforcement learning guided by a Hierarchical Gated Reward Model (HGRM), which explicitly captures trade-offs among multiple trading objectives. Extensive experiments demonstrate that Janus-Q achieves more consistent, interpretable, and profitable trading decisions than market indices and LLM baselines, improving the Sharpe Ratio by up to 102.0% while increasing direction accuracy by over 17.5% compared to the strongest competing strategies.
Problem

Research questions and friction points this paper is trying to address.

event-driven trading
financial news
market reaction
language model alignment
trading signals
Innovation

Methods, ideas, or system contributions that make the work stand out.

event-driven trading
hierarchical gated reward model
financial news events
cumulative abnormal return
end-to-end reinforcement learning
🔎 Similar Papers
X
Xiang Li
The Hong Kong University of Science and Technology (Guangzhou)
Z
Zikai Wei
International Digital Economy Academy
Yiyan Qi
Yiyan Qi
IDEA
W
Wanyun Zhou
The Hong Kong University of Science and Technology (Guangzhou)
Xiang Liu
Xiang Liu
HKUST(GZ)
LLM AgentPhysics
P
Penglei Sun
The Hong Kong University of Science and Technology (Guangzhou)
Yongqi Zhang
Yongqi Zhang
Assistant Professor in HKUST(GZ)
Graph learningDrug discoveryDeep learning
Xiaowen Chu
Xiaowen Chu
IEEE Fellow, Professor, Data Science and Analytics, HKUST(GZ)
GPU ComputingMachine Learning SystemsParallel and Distributed ComputingWireless Networks