🤖 AI Summary
This study addresses the challenge of automated enterprise risk factor extraction in financial risk identification. We propose the first seven-dimensional corporate risk modeling paradigm—covering supply chain, regulatory, competitive, and other critical dimensions. Leveraging 744 manually annotated Bloomberg news articles, we fine-tune a pre-trained language model (rather than relying on large language models with zero- or few-shot prompting), achieving significant performance gains over multiple baselines on risk factor identification. The model is deployed at scale on 277,000 news articles to generate enterprise-level and industry-level risk insights. Key contributions include: (1) a human-interpretable, multi-dimensional enterprise risk taxonomy; (2) empirical validation that fine-tuned medium-scale language models outperform larger alternatives on domain-specific financial text, demonstrating practical efficacy; and (3) the first large-scale, news-derived dynamic corporate risk knowledge graph capturing temporal risk evolution across firms and sectors.
📝 Abstract
Identifying risks associated with a company is important to investors and the well-being of the overall financial market. In this study, we build a computational framework to automatically extract company risk factors from news articles. Our newly proposed schema comprises seven distinct aspects, such as supply chain, regulations, and competitions. We sample and annotate 744 news articles and benchmark various machine learning models. While large language models have achieved huge progress in various types of NLP tasks, our experiment shows that zero-shot and few-shot prompting state-of-the-art LLMs (e.g. LLaMA-2) can only achieve moderate to low performances in identifying risk factors. And fine-tuned pre-trained language models are performing better on most of the risk factors. Using this model, we analyze over 277K Bloomberg news articles and demonstrate that identifying risk factors from news could provide extensive insight into the operations of companies and industries.