TransactionGPT

📅 2025-11-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of modeling massive, high-dimensional, and multimodal consumer transaction trajectories in the world’s largest payment network, this paper introduces TransFormer—the first foundation model specifically designed for transaction data. Methodologically, it proposes a novel 3D-Transformer architecture that jointly models temporal dynamics, merchant/category semantic dimensions, and LLM-generated embedding dimensions, thereby enhancing multimodal fusion efficiency and representation capacity. The model supports both transaction trajectory understanding and generation, and unifies diverse downstream tasks—including sales forecasting, fraud detection, and user segmentation—under a single framework. Trained on over one billion real-world anonymized transactions, TransFormer achieves an average accuracy improvement of 12.7% across multiple benchmarks, attains 3.2× faster inference speed than current production models, and demonstrates strong capability in future trajectory generation. This work establishes a new paradigm for foundation models in transaction analytics.

Technology Category

Application Category

📝 Abstract
We present TransactionGPT (TGPT), a foundation model for consumer transaction data within one of world's largest payment networks. TGPT is designed to understand and generate transaction trajectories while simultaneously supporting a variety of downstream prediction and classification tasks. We introduce a novel 3D-Transformer architecture specifically tailored for capturing the complex dynamics in payment transaction data. This architecture incorporates design innovations that enhance modality fusion and computational efficiency, while seamlessly enabling joint optimization with downstream objectives. Trained on billion-scale real-world transactions, TGPT significantly improves downstream classification performance against a competitive production model and exhibits advantages over baselines in generating future transactions. We conduct extensive empirical evaluations utilizing a diverse collection of company transaction datasets spanning multiple downstream tasks, thereby enabling a thorough assessment of TGPT's effectiveness and efficiency in comparison to established methodologies. Furthermore, we examine the incorporation of LLM-derived embeddings within TGPT and benchmark its performance against fine-tuned LLMs, demonstrating that TGPT achieves superior predictive accuracy as well as faster training and inference. We anticipate that the architectural innovations and practical guidelines from this work will advance foundation models for transaction-like data and catalyze future research in this emerging field.
Problem

Research questions and friction points this paper is trying to address.

Develops a foundation model for consumer transaction data analysis
Introduces 3D-Transformer for payment transaction dynamics modeling
Enhances downstream prediction tasks and transaction generation capabilities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Novel 3D-Transformer architecture for transaction dynamics
Enhanced modality fusion and computational efficiency design
Joint optimization with downstream prediction tasks
🔎 Similar Papers
No similar papers found.
Yingtong Dou
Yingtong Dou
Research Scientist, Visa Inc.
Graph MiningFraud DetectionApplied Machine Learning
Z
Zhimeng Jiang
Visa Research
T
Tianyi Zhang
Visa Research
Mingzhi Hu
Mingzhi Hu
Worcester Polytechnic Institute
Zhichao Xu
Zhichao Xu
Amazon AWS, University of Utah
natural language processinginformation retrieval
S
Shubham Jain
Visa Research
Uday Singh Saini
Uday Singh Saini
Graduate Student, Multi Aspect Data Lab, University of California Riverside
Machine LearningTensor DecompositionOptimizationNatural Language ProcessingNeural Networks
X
Xiran Fan
Visa Research
J
Jiarui Sun
Visa Research
Menghai Pan
Menghai Pan
Visa Research
Foundation modelreiniforcement learningrecommendationgraph learningsequence model
Junpeng Wang
Junpeng Wang
Research Scientist, Visa Research
Visual AnalyticsExplainable AIDeep Learning
Xin Dai
Xin Dai
Research Scientist, Visa Research
Data Mining
L
Liang Wang
Visa Research
Chin-Chia Michael Yeh
Chin-Chia Michael Yeh
Visa Research
Data MiningTime SeriesMachine learning
Yujie Fan
Yujie Fan
Visa Research
Graph LearningSpatial-Temporal Modeling
V
Vineet Rakesh
Visa Research
Huiyuan Chen
Huiyuan Chen
Amazon
Machine LearningDeep LearningRecommender Systems
M
M. Bendre
Visa Research
Zhongfang Zhuang
Zhongfang Zhuang
Unknown affiliation
Deep LearningData MiningBig Data AnalyticsData ScienceData Management
Xiaoting Li
Xiaoting Li
Samsung Ads
Data MiningGraph LearningAdversarial Machine Learning
P
P. Aboagye
Visa Research
V
V. Lai
Visa Research
M
Minghua Xu
Visa Research
H
Hao Yang
Visa Research
Y
Yi-Jun Cai
Visa Research
M
Mahashweta Das
Visa Research
Yuzhong Chen
Yuzhong Chen
UESTC
Deep Learning