🤖 AI Summary
Balancing privacy preservation and model accuracy remains challenging in mobile network promotional content detection. Method: This paper proposes a three-tier collaborative defense framework: (1) a fine-grained semantic modeling layer based on Transformer architectures (BERT/GPT-2); (2) an adversarial training layer to enhance robustness; and (3) a novel syntax-level differential privacy mechanism integrating multilingual back-translation, character-level noise injection, and entity anonymization—perturbing both inputs and label distributions at the syntactic level. The decoupled architecture enables lightweight deployment on mobile devices while supporting controllable privacy–utility trade-offs. Results: The baseline achieves F1 scores of 0.89–0.90; with full three-tier protection, F1 drops moderately to 0.83, yet privacy leakage is significantly reduced and practical detection accuracy is retained—demonstrating the feasibility of simultaneously achieving strong privacy guarantees and high detection performance.
📝 Abstract
The proliferation of propaganda on mobile platforms raises critical concerns around detection accuracy and user privacy. To address this, we propose TRIDENT - a three-tier propaganda detection model implementing transformers, adversarial learning, and differential privacy which integrates syntactic obfuscation and label perturbation to mitigate privacy leakage while maintaining propaganda detection accuracy. TRIDENT leverages multilingual back-translation to introduce semantic variance, character-level noise, and entity obfuscation for differential privacy enforcement, and combines these techniques into a unified defense mechanism. Using a binary propaganda classification dataset, baseline transformer models (BERT, GPT-2) we achieved F1 scores of 0.89 and 0.90. Applying TRIDENT's third-tier defense yields a reduced but effective cumulative F1 of 0.83, demonstrating strong privacy protection across mobile ML deployments with minimal degradation.