RKEFino1: A Regulation Knowledge-Enhanced Large Language Model

📅 2025-06-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limitations of large language models (LLMs) in regulatory compliance and numerical accuracy for Digital Regulatory Reporting (DRR), this paper introduces RKEFino1, a domain-specific financial language model. Methodologically: (1) it proposes the first numerical Named Entity Recognition (NER) task tailored to regulatory reporting, supporting dual-modality (sentence and table) financial entity identification; (2) it pioneers the systematic integration of multi-source structured regulatory knowledge—including XBRL, the Common Data Model (CDM), and the Meta-Object Facility (MOF)—into a lightweight financial foundation model, enhanced via domain-knowledge-informed fine-tuning, multi-task joint training, and regulatory ontology alignment. Experiments demonstrate that RKEFino1 significantly outperforms both general-purpose and state-of-the-art financial LLMs on key compliance tasks—including regulatory knowledge question answering, mathematical reasoning, and numerical NER—while exhibiting strong generalization capability. The model is publicly released on Hugging Face.

Technology Category

Application Category

📝 Abstract
Recent advances in large language models (LLMs) hold great promise for financial applications but introduce critical accuracy and compliance challenges in Digital Regulatory Reporting (DRR). To address these issues, we propose RKEFino1, a regulation knowledge-enhanced financial reasoning model built upon Fino1, fine-tuned with domain knowledge from XBRL, CDM, and MOF. We formulate two QA tasks-knowledge-based and mathematical reasoning-and introduce a novel Numerical NER task covering financial entities in both sentences and tables. Experimental results demonstrate the effectiveness and generalization capacity of RKEFino1 in compliance-critical financial tasks. We have released our model on Hugging Face.
Problem

Research questions and friction points this paper is trying to address.

Enhance financial LLM accuracy and compliance in DRR
Integrate XBRL, CDM, MOF knowledge for financial reasoning
Address QA and Numerical NER tasks in finance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Regulation knowledge-enhanced financial reasoning model
Fine-tuned with XBRL, CDM, and MOF knowledge
Introduces Numerical NER for financial entities
🔎 Similar Papers
No similar papers found.
Y
Yan Wang
Yale University
Yueru He
Yueru He
Columbia University
FinanceLarge Language Models
R
Ruoyu Xiang
New York University
J
Jeff Zhao
The University of Texas at Austin