Data-Induced Groupings and How To Find Them

📅 2026-01-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the cognitive bias in scatterplots where “data-induced grouping”—arising from the interplay between data values and visual encoding—leads users to misinterpret spatial arrangements as meaningful patterns. Through two user studies, the authors systematically demonstrate the prevalence of this phenomenon, develop the first perceptual model capable of predicting whether users perceive a given set of points as a coherent group, and propose a visualization intervention strategy that integrates user perception with data reordering. Notably, the model effectively captures users’ tendency to group points based on trends even in nominal data contexts. Applied to visualization diagnosis and optimization, this approach significantly enhances the accuracy and reliability of graphical representations.

Technology Category

Application Category

📝 Abstract
Making sense of a visualization requires the reader to consider both the visualization design and the underlying data values. Existing work in the visualization community has largely considered affordances driven by visualization design elements, such as color or chart type, but how visual design interacts with data values to impact interpretation and reasoning has remained under-explored. Dot plots and bar graphs are commonly used to help users identify groups of points that form trends and clusters, but are liable to manifest groupings that are artifacts of spatial arrangement rather than inherent patterns in the data itself. These ``Data-induced Groups''can drive suboptimal data comparisons and potentially lead the user to incorrect conclusions. We conduct two user studies using dot plots as a case study to understand the prevalence of data-induced groupings. We find that users rely on data-induced groupings in both conditions despite the fact that trend-based groupings are irrelevant in nominal data. Based on the study results, we build a model to predict whether users are likely to perceive a given set of dot plot points as a group. We discuss two use cases illustrating how the model can assist visualization designers by both diagnosing potential user-perceived groupings in dot plots and offering redesigns that better accentuate desired groupings through data rearrangement.
Problem

Research questions and friction points this paper is trying to address.

data-induced groupings
visualization interpretation
dot plots
visual perception
data artifacts
Innovation

Methods, ideas, or system contributions that make the work stand out.

data-induced grouping
visualization perception
dot plots
user study
predictive model
🔎 Similar Papers
No similar papers found.