PsyScam: A Benchmark for Psychological Techniques in Real-World Scams

πŸ“… 2025-05-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing fraud benchmarks lack modeling of psychological tactics (PTs) employed in real-world scams. This paper introduces PsyScamβ€”the first benchmark dataset grounded in authentic scam reports and rigorously informed by cognitive and psychological theories. It systematically curates cases from six major platforms and provides fine-grained human annotations. Methodologically, it pioneers the integration of empirically validated psychological theories into cybersecurity fraud analysis, enabling three core tasks: PT classification, scam text completion, and PT-aware text augmentation. An adversarial evaluation framework is also proposed. Experiments reveal substantial deficiencies in state-of-the-art large language models for both PT identification and generation. PsyScam fills a critical gap in interpretable PT modeling and evaluation, and all data and code are publicly released to advance interdisciplinary research at the intersection of explainable AI and anti-fraud technology.

Technology Category

Application Category

πŸ“ Abstract
Online scams have become increasingly prevalent, with scammers using psychological techniques (PTs) to manipulate victims. While existing research has developed benchmarks to study scammer behaviors, these benchmarks do not adequately reflect the PTs observed in real-world scams. To fill this gap, we introduce PsyScam, a benchmark designed to systematically capture and evaluate PTs embedded in real-world scam reports. In particular, PsyScam bridges psychology and real-world cyber security analysis through collecting a wide range of scam reports from six public platforms and grounding its annotations in well-established cognitive and psychological theories. We further demonstrate PsyScam's utility through three downstream tasks: PT classification, scam completion, and scam augmentation. Experimental results show that PsyScam presents significant challenges to existing models in both detecting and generating scam content based on the PTs used by real-world scammers. Our code and dataset are available at: https://anonymous.4open.science/r/PsyScam-66E4.
Problem

Research questions and friction points this paper is trying to address.

Lack of benchmarks for psychological techniques in scams
Need to bridge psychology and cybersecurity in scam analysis
Challenges in detecting and generating scam content accurately
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces PsyScam benchmark for real-world scam PTs
Bridges psychology and cybersecurity via multi-platform reports
Validates with PT classification and scam generation tasks
πŸ”Ž Similar Papers
No similar papers found.
Shang Ma
Shang Ma
University of Notre Dame
Software SecurityProgram Analysis
T
Tianyi Ma
University of Notre Dame
J
Jiahao Liu
National University of Singapore
W
Wei Song
University of Notre Dame
Zhenkai Liang
Zhenkai Liang
School of Computing, National University of Singapore
System SecurityWeb SecuritySoftware Debugging
X
Xusheng Xiao
Arizona State University
Y
Yanfang Ye
University of Notre Dame