🤖 AI Summary
This study addresses the exploitation of online job platforms by human traffickers who use fraudulent job postings to lure victims, a threat inadequately mitigated by current safeguards that predominantly focus on post-recruitment intervention. To enable proactive detection during the recruitment phase, the authors propose a web-driven annotation framework to construct the first large-scale dataset of high-risk job advertisements. Integrating natural language processing with an ensemble of classification models, the research systematically analyzes linguistic and behavioral distinctions between legitimate and high-risk postings. The findings reveal, for the first time, structured recruitment strategies employed by traffickers—evident in patterns related to geographic targeting, gender preferences, industry sectors, and contact methods—thereby significantly improving early detection accuracy and offering actionable insights for platform-level preventive mechanisms.
📝 Abstract
While substantial efforts in anti-trafficking research and practice have focused on identifying and assisting victims after exploitation occurs, comparatively less attention has been paid to preventing victimization at the recruitment stage. Although some platforms offer preventive tools, such as background checks triggered by in-person meeting detection, these measures primarily protect potential victims rather than directly limiting traffickers' recruitment activities. In this paper, we propose a computational framework to identify human trafficking recruiters through their linguistic features and to characterize their online recruitment patterns. We introduce a network-driven labeling method to construct large-scale ground truth for trafficking-at-risk job advertisements. Our results reveal significant linguistic differences between safe and risky advertisements and demonstrate that language models and embedding representations behave distinctly across these linguistic spaces. Building on these insights, we propose a multi-model ensemble classifier to improve the detection of trafficking-at-risk job ads. Finally, we analyze the geographic, gender, industry, and contact-method preferences of trafficking recruiters, revealing systematic patterns in recruitment strategies.