π€ AI Summary
Quantifying privacy risks in social networks is challenging due to the coexistence of heterogeneous, multi-source dataβuser attributes, social graph structures, and user-generated content (UGC).
Method: This paper proposes the first unified privacy risk scoring framework integrating structural similarity, entity-level sensitivity analysis, and graph-sensitive weighting. It combines graph neural networks (GNNs), natural language processing (NLP), and named entity recognition (NER) to enable fine-grained, interpretable, and personalized risk modeling, supporting cross-dimensional joint assessment and visualizable insights.
Contribution/Results: Evaluated on real-world datasets, the framework achieves an overall average risk score of 0.478 and 0.501 in graph-centric scenarios. A user study confirms high usability: 85% of participants rated its clarity and practicality as satisfactory. This work establishes a novel, dynamic, and scalable paradigm for social privacy risk assessment.
π Abstract
The rise of social networking platforms has amplified privacy threats as users increasingly share sensitive information across profiles, content, and social connections. We present a Comprehensive Privacy Risk Scoring (CPRS) framework that quantifies privacy risk by integrating user attributes, social graph structures, and user-generated content. Our framework computes risk scores across these dimensions using sensitivity, visibility, structural similarity, and entity-level analysis, then aggregates them into a unified risk score. We validate CPRS on two real-world datasets: the SNAP Facebook Ego Network (4,039 users) and the Koo microblogging dataset (1M posts, 1M comments). The average CPRS is 0.478 with equal weighting, rising to 0.501 in graph-sensitive scenarios. Component-wise, graph-based risks (mean 0.52) surpass content (0.48) and profile attributes (0.45). High-risk attributes include email, date of birth, and mobile number. Our user study with 100 participants shows 85% rated the dashboard as clear and actionable, confirming CPRS's practical utility. This work enables personalized privacy risk insights and contributes a holistic, scalable methodology for privacy management. Future directions include incorporating temporal dynamics and multimodal content for broader applicability.