Breaking the illusion: Automated Reasoning of GDPR Consent Violations

📅 2025-12-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current automated auditing tools for GDPR/CCPA compliance lack systematic support for diverse web-based consent forms—extending beyond cookie banners—to verify core legal requirements such as freedom of choice, purpose specification, and ease of withdrawal. Method: This paper introduces Cosmic, the first end-to-end automated framework for detecting GDPR consent violations across heterogeneous web forms. Cosmic integrates DOM parsing, OCR-enhanced form recognition, legal semantic modeling, and structured joint reasoning over form elements to enable interpretable, requirement-specific validation. Contribution/Results: Evaluated on 5,823 websites and 3,598 consent forms, Cosmic achieves a true positive rate (TPR) of 98.6% for consent-form detection and 99.1% for violation identification, covering 94.1% of identified consent forms and detecting 3,384 distinct violations. Cosmic fills a critical gap in form-level consent auditing and establishes a novel paradigm for automated, explainable privacy compliance assessment.

Technology Category

Application Category

📝 Abstract
Recent privacy regulations such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA) have established legal requirements for obtaining user consent regarding the collection, use, and sharing of personal data. These regulations emphasize that consent must be informed, freely given, specific, and unambiguous. However, there are still many violations, which highlight a gap between legal expectations and actual implementation. Consent mechanisms embedded in functional web forms across websites play a critical role in ensuring compliance with data protection regulations such as the GDPR and CCPA, as well as in upholding user autonomy and trust. However, current research has primarily focused on cookie banners and mobile app dialogs. These forms are diverse in structure, vary in legal basis, and are often difficult to locate or evaluate, creating a significant challenge for automated consent compliance auditing. In this work, we present Cosmic, a novel automated framework for detecting consent-related privacy violations in web forms. We evaluate our developed tool for auditing consent compliance in web forms, across 5,823 websites and 3,598 forms. Cosmic detects 3,384 violations on 94.1% of consent forms, covering key GDPR principles such as freely given consent, purpose disclosure, and withdrawal options. It achieves 98.6% and 99.1% TPR for consent and violation detection, respectively, demonstrating high accuracy and real-world applicability.
Problem

Research questions and friction points this paper is trying to address.

Automated detection of GDPR consent violations in web forms
Addressing the gap between legal requirements and actual implementation
Evaluating consent compliance across diverse and complex web forms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automated framework detects consent violations in web forms
Tool audits compliance across thousands of websites and forms
Achieves high accuracy in detecting GDPR principle violations
🔎 Similar Papers
No similar papers found.
Y
Ying Li
University of California, Los Angeles
W
Wenjun Qiu
University of Toronto
Faysal Hossain Shezan
Faysal Hossain Shezan
Assistant Professor, University of Texas at Arlington
Security and Privacy
K
Kunlin Cai
University of California, Los Angeles
M
Michelangelo van Dam
in2it
L
Lisa Austin
University of Toronto
David Lie
David Lie
University of Toronto
Computer ScienceComputer SecurityOperating SystemsVirtualizationComputer Architecture
Y
Yuan Tian
University of California, Los Angeles