NoteFlow: Recommending Charts as Sight Glasses for Tracing Data Flow in Computational Notebooks

📅 2025-02-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In exploratory data analysis (EDA), cell-based programming in computational notebooks impedes data state tracking and imposes high cognitive load. To address this, we propose “Visual Goggles”: a lightweight framework that tightly couples chart recommendation with dataflow tracing. It is the first to integrate data lineage analysis, visualization encoding rule modeling, and context-aware recommendation—natively embedded within the Jupyter ecosystem. The system dynamically adapts visual encodings to reflect underlying data transformations, ensuring visual consistency and analytical continuity. A user study demonstrates that Visual Goggles significantly improves EDA interpretability, enabling users to rapidly grasp data evolution trajectories and identify salient patterns. Our core contribution is the first realization of synergistic, lineage-driven chart recommendation and dynamic visualization encoding—establishing an interpretable, sustainable visual feedback mechanism for interactive data analysis.

Technology Category

Application Category

📝 Abstract
Exploratory Data Analysis (EDA) is a routine task for data analysts, often conducted using flexible computational notebooks. During EDA, data workers process, visualize, and interpret data tables, making decisions about subsequent analysis. However, the cell-by-cell programming approach, while flexible, can lead to disorganized code, making it difficult to trace the state of data tables across cells and increasing the cognitive load on data workers. This paper introduces NoteFlow, a notebook library that recommends charts as ``sight glasses'' for data tables, allowing users to monitor their dynamic updates throughout the EDA process. To ensure visual consistency and effectiveness, NoteFlow adapts chart encodings in response to data transformations, maintaining a coherent and insightful representation of the data. The proposed method was evaluated through user studies, demonstrating its ability to provide an overview of the EDA process and convey critical insights in the data tables.
Problem

Research questions and friction points this paper is trying to address.

Tracing data flow in computational notebooks
Reducing cognitive load during EDA
Maintaining visual consistency in data charts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Recommends charts for data flow
Adapts chart encodings dynamically
Maintains visual consistency and effectiveness
🔎 Similar Papers
No similar papers found.
Y
Yuan Tian
Zhejiang University, Zhejiang, China
Dazhen Deng
Dazhen Deng
Zhejiang University
Visual AnalyticsXAIHuman-Computer Interaction
S
Sen Yang
Zhejiang University, Zhejiang, China
H
Huawei Zheng
Zhejiang University, Zhejiang, China
B
Bowen Shi
Zhejiang University, Zhejiang, China
K
Kai Xiong
Zhejiang University, Zhejiang, China
X
Xinjing Yi
Zhejiang University, Zhejiang, China
Yingcai Wu
Yingcai Wu
Professor at the State Key Lab of CAD&CG, Zhejiang University
Visual AnalyticsSports AnalyticsUrban Computing