FIND: Toward Multimodal Financial Reasoning and Question Answering for Indic Languages

πŸ“… 2026-05-13
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

166K/year
πŸ€– AI Summary
This study addresses the absence of benchmarks and models capable of supporting multilingual, multimodal financial numerical reasoning and question answering for Indic languagesβ€”a critical gap that undermines reliable high-stakes financial decision-making. To bridge this gap, the authors introduce FinVQA, the first benchmark encompassing six Indian languages with 18,900 samples spanning 14 financial domains and four question types. They further propose FIND, a novel framework integrating supervised fine-tuning, constraint-aware decoding, and multimodal alignment mechanisms to simultaneously ensure numerical precision and cross-lingual semantic consistency. Experimental results demonstrate that FIND substantially outperforms existing baselines on multilingual multimodal financial QA tasks.
πŸ“ Abstract
Financial decision-making in multilingual settings demands accurate numerical reasoning grounded in diverse modalities, yet existing benchmarks largely overlook this high-stakes, real-world challenge, especially for Indic languages. We introduce FinVQA, a benchmark for evaluating financial numerical and multimodal reasoning in multilingual Indic contexts. FinVQA spans English, Hindi, Bengali, Marathi, Gujarati, and Tamil, and comprises 18,900 samples across 14 financial domains. The dataset captures diverse reasoning paradigms under realistic constraints, and is structured across three difficulty levels (easy, moderate, hard) and four question formats: multiple choice, fill-in-the-blank, table matching, and true/false. To address these challenges, we propose FIND, a framework that combines supervised fine-tuning with constraint-aware decoding to promote faithful numerical reasoning, robust multimodal grounding, and structured decision-making. Together, FinVQA and FIND establish a rigorous evaluation and modeling paradigm for high-stakes multilingual multimodal financial reasoning.
Problem

Research questions and friction points this paper is trying to address.

multimodal financial reasoning
Indic languages
numerical reasoning
financial question answering
multilingual benchmark
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal financial reasoning
Indic languages
constraint-aware decoding
FinVQA benchmark
numerical question answering
πŸ”Ž Similar Papers
No similar papers found.