Financial Transaction Retrieval and Contextual Evidence for Knowledge-Grounded Reasoning

📅 2026-03-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge that general-purpose large language models struggle to effectively model highly sequential and structured financial transaction data, while existing specialized models rely heavily on extensive labeled data and exhibit limited generalization. To overcome these limitations, the authors propose FinTRACE, a novel architecture that introduces a retrieval-first paradigm: it transforms raw transaction records into reusable feature representations, constructs a behavior knowledge base via rule-based detectors, and leverages a retrieval mechanism to supply contextual evidence for downstream tasks. Notably, FinTRACE is the first to incorporate retrieved behavioral patterns into instruction tuning of large language models, aligning structured memory with task objectives. Evaluated on multiple public and industrial benchmarks, FinTRACE achieves state-of-the-art performance under low-supervision settings, improving zero-shot churn prediction MCC from 0.19 to 0.38 and reaching 0.40 MCC with only 16 labeled samples.

Technology Category

Application Category

📝 Abstract
Nowadays, success of financial organizations heavily depends on their ability to process digital traces generated by their clients, e.g., transaction histories, gathered from various sources to improve user modeling pipelines. As general-purpose LLMs struggle with time-distributed tabular data, production stacks still depend on specialized tabular and sequence models with limited transferability and need for labeled data. To address this, we introduce FinTRACE, a retrieval-first architecture that converts raw transactions into reusable feature representations, applies rule-based detectors, and stores the resulting signals in a behavioral knowledge base with graded associations to the objectives of downstream tasks. Across public and industrial benchmarks, FinTRACE substantially improves low-supervision transaction analytics, doubling zero-shot MCC on churn prediction performance from 0.19 to 0.38 and improving 16-shot MCC from 0.25 to 0.40. We further use FinTRACE to ground LLMs via instruction tuning on retrieved behavioral patterns, achieving state-of-the-art LLM results on transaction analytics problems.
Problem

Research questions and friction points this paper is trying to address.

financial transaction retrieval
knowledge-grounded reasoning
low-supervision learning
tabular data
behavioral knowledge base
Innovation

Methods, ideas, or system contributions that make the work stand out.

retrieval-first architecture
behavioral knowledge base
low-supervision transaction analytics
instruction tuning
feature representation
🔎 Similar Papers
No similar papers found.
A
Artem Sakhno
Sber AI Lab, Moscow, Russian Federation
D
Daniil Tomilov
Sber AI Lab, Moscow, Russian Federation
Y
Yuliana Shakhvalieva
Sber AI, Moscow, Russian Federation
I
Inessa Fedorova
Sber AI, Moscow, Russian Federation
D
Daria Ruzanova
Sber AI, Moscow, Russian Federation
Omar Zoloev
Omar Zoloev
Researcher @ Sber AI Lab
LLMcontrastive learningcuda
Andrey Savchenko
Andrey Savchenko
Sber AI Lab; HSE University - Nizhny Novgorod
Computer VisionPattern RecognitionMachine LearningSpeech ProcessingImage Processing
Maksim Makarenko
Maksim Makarenko
Senior Researcher @ Sber AI Lab
Machine LearningData-driven ModellingComputer VisionOptimizationOptics