🤖 AI Summary
In exploratory data analysis (EDA), cell-based programming in computational notebooks impedes data state tracking and imposes high cognitive load. To address this, we propose “Visual Goggles”: a lightweight framework that tightly couples chart recommendation with dataflow tracing. It is the first to integrate data lineage analysis, visualization encoding rule modeling, and context-aware recommendation—natively embedded within the Jupyter ecosystem. The system dynamically adapts visual encodings to reflect underlying data transformations, ensuring visual consistency and analytical continuity. A user study demonstrates that Visual Goggles significantly improves EDA interpretability, enabling users to rapidly grasp data evolution trajectories and identify salient patterns. Our core contribution is the first realization of synergistic, lineage-driven chart recommendation and dynamic visualization encoding—establishing an interpretable, sustainable visual feedback mechanism for interactive data analysis.
📝 Abstract
Exploratory Data Analysis (EDA) is a routine task for data analysts, often conducted using flexible computational notebooks. During EDA, data workers process, visualize, and interpret data tables, making decisions about subsequent analysis. However, the cell-by-cell programming approach, while flexible, can lead to disorganized code, making it difficult to trace the state of data tables across cells and increasing the cognitive load on data workers. This paper introduces NoteFlow, a notebook library that recommends charts as ``sight glasses'' for data tables, allowing users to monitor their dynamic updates throughout the EDA process. To ensure visual consistency and effectiveness, NoteFlow adapts chart encodings in response to data transformations, maintaining a coherent and insightful representation of the data. The proposed method was evaluated through user studies, demonstrating its ability to provide an overview of the EDA process and convey critical insights in the data tables.