The Traffickers' Pitch: Detecting Deceptive Recruitment in Online Job Boards

📅 2026-05-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the exploitation of online job platforms by human traffickers who use fraudulent job postings to lure victims, a threat inadequately mitigated by current safeguards that predominantly focus on post-recruitment intervention. To enable proactive detection during the recruitment phase, the authors propose a web-driven annotation framework to construct the first large-scale dataset of high-risk job advertisements. Integrating natural language processing with an ensemble of classification models, the research systematically analyzes linguistic and behavioral distinctions between legitimate and high-risk postings. The findings reveal, for the first time, structured recruitment strategies employed by traffickers—evident in patterns related to geographic targeting, gender preferences, industry sectors, and contact methods—thereby significantly improving early detection accuracy and offering actionable insights for platform-level preventive mechanisms.
📝 Abstract
While substantial efforts in anti-trafficking research and practice have focused on identifying and assisting victims after exploitation occurs, comparatively less attention has been paid to preventing victimization at the recruitment stage. Although some platforms offer preventive tools, such as background checks triggered by in-person meeting detection, these measures primarily protect potential victims rather than directly limiting traffickers' recruitment activities. In this paper, we propose a computational framework to identify human trafficking recruiters through their linguistic features and to characterize their online recruitment patterns. We introduce a network-driven labeling method to construct large-scale ground truth for trafficking-at-risk job advertisements. Our results reveal significant linguistic differences between safe and risky advertisements and demonstrate that language models and embedding representations behave distinctly across these linguistic spaces. Building on these insights, we propose a multi-model ensemble classifier to improve the detection of trafficking-at-risk job ads. Finally, we analyze the geographic, gender, industry, and contact-method preferences of trafficking recruiters, revealing systematic patterns in recruitment strategies.
Problem

Research questions and friction points this paper is trying to address.

human trafficking
deceptive recruitment
online job boards
victim prevention
recruitment stage
Innovation

Methods, ideas, or system contributions that make the work stand out.

computational framework
network-driven labeling
linguistic features
multi-model ensemble classifier
trafficking-at-risk detection
🔎 Similar Papers
No similar papers found.
S
Siyi Zhou
Annenberg School of Communication and Journalism, University of Southern California
P
Peiran Qiu
Thomas Lord Department of Computer Science, School of Advanced Computing, University of Southern California
Tanishq Salkar
Tanishq Salkar
Student
Data Science
L
Leonardo Blas Urrutia
Thomas Lord Department of Computer Science, School of Advanced Computing, University of Southern California; Information Science Institute, University of Southern California
D
Dacheng Shen
Thomas Lord Department of Computer Science, School of Advanced Computing, University of Southern California
D
Deyang Hsu
Thomas Lord Department of Computer Science, School of Advanced Computing, University of Southern California
Eun Cheol Choi
Eun Cheol Choi
Annenberg School for Communication and Journalism, University of Southern California
Computational Social ScienceSocial NetworksAI/MLMisinformationFact-checking
Emilio Ferrara
Emilio Ferrara
Professor of Computer Science at the University of Southern California
Human-Centered AISocial ComputingNetwork ScienceAI SafetyComputational Social Science