PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants

📅 2025-07-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional rule-based and feature-engineering approaches fail against highly realistic, dynamically evolving phishing emails generated by large language models (LLMs). To address this, we reframe detection as “sender identity fact-checking”: extracting identity claims from emails and verifying their consistency against a trusted knowledge base, using identity contradictions as the primary discriminative signal. We introduce, for the first time, a reference-based detection mechanism grounded in knowledge-base invariance, integrating sender identity parsing, domain legitimacy verification, and action-inducing phrase identification into an end-to-end interpretable framework. Evaluated on standard benchmarks, our method achieves an 8.8% accuracy gain without compromising recall. In real-world deployment, it attains 92.1% accuracy and 87.9% recall, with an average inference latency of only 0.05 seconds—substantially outperforming state-of-the-art alternatives.

Technology Category

Application Category

📝 Abstract
Phishing emails are a critical component of the cybercrime kill chain due to their wide reach and low cost. Their ever-evolving nature renders traditional rule-based and feature-engineered detectors ineffective in the ongoing arms race between attackers and defenders. The rise of large language models (LLMs) further exacerbates the threat, enabling attackers to craft highly convincing phishing emails at minimal cost. This work demonstrates that LLMs can generate psychologically persuasive phishing emails tailored to victim profiles, successfully bypassing nearly all commercial and academic detectors. To defend against such threats, we propose PiMRef, the first reference-based phishing email detector that leverages knowledge-based invariants. Our core insight is that persuasive phishing emails often contain disprovable identity claims, which contradict real-world facts. PiMRef reframes phishing detection as an identity fact-checking task. Given an email, PiMRef (i) extracts the sender's claimed identity, (ii) verifies the legitimacy of the sender's domain against a predefined knowledge base, and (iii) detects call-to-action prompts that push user engagement. Contradictory claims are flagged as phishing indicators and serve as human-understandable explanations. Compared to existing methods such as D-Fence, HelpHed, and ChatSpamDetector, PiMRef boosts precision by 8.8% with no loss in recall on standard benchmarks like Nazario and PhishPot. In a real-world evaluation of 10,183 emails across five university accounts over three years, PiMRef achieved 92.1% precision, 87.9% recall, and a median runtime of 0.05s, outperforming the state-of-the-art in both effectiveness and efficiency.
Problem

Research questions and friction points this paper is trying to address.

Detect ever-evolving spear phishing emails effectively
Leverage knowledge-based invariants for phishing detection
Provide human-understandable explanations for flagged emails
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses knowledge-based invariants for detection
Reframes phishing as identity fact-checking task
Verifies sender domain against predefined knowledge base
🔎 Similar Papers
No similar papers found.