Comprehensive Classification of Web Tracking Systems: Technological In-sights and Analysis

📅 2025-04-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the lack of a technically grounded taxonomy for web tracking (WT) systems. We propose the first architecture- and mechanism-based six-dimensional classification framework, encompassing core dimensions such as the HTTP protocol layer, API integration, and user identification—moving beyond conventional function- or deployment-centric taxonomies. Methodologically, we combine technical analysis—including protocol inspection, interface modeling, and identification mechanism abstraction—with a large-scale online survey of over 1,000 participants, augmented by statistical hypothesis testing. Our findings reveal that over 76% of users possess only basic technical understanding; computer science background is the sole statistically significant predictor of comprehension, and technical knowledge shows no significant correlation with privacy concerns. The framework and empirical results jointly provide an extensible classification standard and evidence-based benchmark to inform WT system design, regulatory technology development, and targeted digital literacy interventions.

Technology Category

Application Category

📝 Abstract
Web tracking (WT) systems are advanced technologies used to monitor and analyze online user behavior. Initially focused on HTML and static webpages, these systems have evolved with the proliferation of IoT, edge computing, and Big Data, encompassing a broad array of interconnected devices with APIs, interfaces and computing nodes for interaction. WT systems are pivotal in technological innovation and business development, although trends like GDPR complicate data extraction and mandate transparency. Specifically, this study examines WT systems purely from a technological perspective, excluding organizational and privacy implications. A novel classification scheme based on technological architecture and principles is proposed, compared to two preexisting frameworks. The scheme categorizes WT systems into six classes, emphasizing technological mechanisms such as HTTP proto-cols, APIs, and user identification techniques. Additionally, a survey of over 1,000 internet users, conducted via Google Forms, explores user awareness of WT systems. Findings indicate that knowledge of WT technologies is largely unrelated to demographic factors such as age or gender but is strongly influenced by a user's background in computer science. Most users demonstrate only a basic understanding of WT tools, and this awareness does not correlate with heightened concerns about data misuse. As such, the research highlights gaps in user education about WT technologies and underscores the need for a deeper examination of their technical underpinnings. This study provides a foundation for further exploration of WT systems from multiple perspectives, contributing to advance-ments in classification, implementation, and user awareness.
Problem

Research questions and friction points this paper is trying to address.

Classifies web tracking systems by technological architecture and principles
Assesses user awareness gaps regarding web tracking technologies
Proposes a new taxonomy comparing existing web tracking frameworks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Novel classification scheme for tracking systems
Survey on user awareness via Google Forms
Focus on HTTP protocols and APIs
🔎 Similar Papers
No similar papers found.
T
Theofanis Tasoulas
Ionian University, Department of Informatics, Corfu, Greece
Alexandros Gazis
Alexandros Gazis
Researcher, Programming and Information Processing Laboratory, Democritus University of Thrace
Wireless Sensor NetworksSmart SensingUbiquitous ComputingBig Data AnalyticsInternet Of Things
A
A. Tsohou
Ionian University, Department of Informatics, Corfu, Greece