CreditARF: A Framework for Corporate Credit Rating with Annual Report and Financial Feature Integration

πŸ“… 2025-08-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing corporate credit rating models predominantly rely on financial metrics and deep learning, overlooking rich credit signals embedded in unstructured textual sources such as annual reports. Method: This paper proposes the first multimodal credit rating framework integrating structured financial data and unstructured annual report text. It employs FinBERT to extract semantic features from textual content and introduces a novel dual-stream feature fusion mechanism that jointly models financial and textual modalities. Additionally, we construct CCRDβ€”the first large-scale, publicly available, comprehensive dataset for multimodal credit rating. Contribution/Results: Experiments demonstrate consistent performance gains across multiple benchmarks, improving rating accuracy by 8–12%. The proposed framework significantly enhances model discriminability and robustness. This work provides the first systematic empirical validation of the predictive value of non-financial textual information in credit assessment, establishing a reproducible methodology and foundational dataset for multi-source, heterogeneous-data-driven intelligent risk control.

Technology Category

Application Category

πŸ“ Abstract
Corporate credit rating serves as a crucial intermediary service in the market economy, playing a key role in maintaining economic order. Existing credit rating models rely on financial metrics and deep learning. However, they often overlook insights from non-financial data, such as corporate annual reports. To address this, this paper introduces a corporate credit rating framework that integrates financial data with features extracted from annual reports using FinBERT, aiming to fully leverage the potential value of unstructured text data. In addition, we have developed a large-scale dataset, the Comprehensive Corporate Rating Dataset (CCRD), which combines both traditional financial data and textual data from annual reports. The experimental results show that the proposed method improves the accuracy of the rating predictions by 8-12%, significantly improving the effectiveness and reliability of corporate credit ratings.
Problem

Research questions and friction points this paper is trying to address.

Integrates financial data and annual report text for credit rating
Improves rating prediction accuracy by 8-12%
Creates a large-scale dataset combining financial and textual data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates financial data with FinBERT-extracted annual report features
Develops Comprehensive Corporate Rating Dataset (CCRD)
Improves rating prediction accuracy by 8-12%
πŸ”Ž Similar Papers
No similar papers found.
Yumeng Shi
Yumeng Shi
Nanyang Technology University
efficient machine learningmultimodalityvideo understanding
Zhongliang Yang
Zhongliang Yang
Associate Professor, Beijing University of Posts and Telecommunications
AI SecurityFinTech
D
DiYang Lu
School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing, China
Y
Yisi Wang
Guotai Junan Securities, Shanghai, China
Y
Yiting Zhou
Hebei University of Technology, Langfang, China
L
Linna Zhou
School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing, China