Enhancing Financial Report Question-Answering: A Retrieval-Augmented Generation System with Reranking Analysis

📅 2026-02-18
📈 Citations: 7
Influential: 0
📄 PDF

career value

168K/year
📝 Abstract
Financial analysts face significant challenges extracting information from lengthy 10-K reports, which often exceed 100 pages. This paper presents a Retrieval-Augmented Generation (RAG) system designed to answer questions about S&P 500 financial reports and evaluates the impact of neural reranking on system performance. Our pipeline employs hybrid search combining full-text and semantic retrieval, followed by an optional reranking stage using a cross-encoder model. We conduct systematic evaluation using the FinDER benchmark dataset, comprising 1,500 queries across five experimental groups. Results demonstrate that reranking significantly improves answer quality, achieving 49.0 percent correctness for scores of 8 or above compared to 33.5 percent without reranking, representing a 15.5 percentage point improvement. Additionally, the error rate for completely incorrect answers decreases from 35.3 percent to 22.5 percent. Our findings emphasize the critical role of reranking in financial RAG systems and demonstrate performance improvements over baseline methods through modern language models and refined retrieval strategies.
Problem

Research questions and friction points this paper is trying to address.

financial report question-answering
10-K reports
information extraction
RAG systems
FinDER benchmark
Innovation

Methods, ideas, or system contributions that make the work stand out.

Retrieval-Augmented Generation
neural reranking
hybrid search
financial question answering
cross-encoder
🔎 Similar Papers
No similar papers found.