GistVis: Automatic Generation of Word-scale Visualizations from Data-rich Documents

📅 2025-02-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of rapidly perceiving data insights in data-rich documents. We propose the first word-granularity automated visualization generation paradigm. Our method introduces a four-module collaborative framework—Discoverer, Annotator, Extractor, and Visualizer—that tightly integrates large language models (LLMs) with visualization design knowledge to enable end-to-end extraction of data insights, semantic annotation, and rule-driven visual encoding from text. The key contribution lies in deeply embedding LLMs throughout the document-level data understanding and visualization generation pipeline, ensuring both interpretability and reading-friendliness. Technical evaluations demonstrate the robustness of each module. A user study (N=12) shows that our approach significantly improves comprehension accuracy (+5.6%), reduces mental workload (p=0.016), and decreases subjective cognitive effort (p=0.033).

Technology Category

Application Category

📝 Abstract
Data-rich documents are ubiquitous in various applications, yet they often rely solely on textual descriptions to convey data insights. Prior research primarily focused on providing visualization-centric augmentation to data-rich documents. However, few have explored using automatically generated word-scale visualizations to enhance the document-centric reading process. As an exploratory step, we propose GistVis, an automatic pipeline that extracts and visualizes data insight from text descriptions. GistVis decomposes the generation process into four modules: Discoverer, Annotator, Extractor, and Visualizer, with the first three modules utilizing the capabilities of large language models and the fourth using visualization design knowledge. Technical evaluation including a comparative study on Discoverer and an ablation study on Annotator reveals decent performance of GistVis. Meanwhile, the user study (N=12) showed that GistVis could generate satisfactory word-scale visualizations, indicating its effectiveness in facilitating users' understanding of data-rich documents (+5.6% accuracy) while significantly reducing their mental demand (p=0.016) and perceived effort (p=0.033).
Problem

Research questions and friction points this paper is trying to address.

Automatic word-scale visualization generation
Enhancing data-rich document comprehension
Reducing mental demand in document reading
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic word-scale visualizations
Large language model modules
Visualization design integration
🔎 Similar Papers
No similar papers found.
Ruishi Zou
Ruishi Zou
University of California, San Diego
Human-Computer InteractionVisualizationHuman AI InteractionMachine Learning
Y
Yinqi Tang
Tongji University, Shanghai, China
J
Jingzhu Chen
Tongji University, Shanghai, China
Siyu Lu
Siyu Lu
Department of Geography, Texas A&M University
Geo AIGIS/RSGeoSpatial AIGeoSpatial IntelligenceSpatial Dynamics
Y
Yan Lu
Tongji University, Shanghai, China
Y
Yingfan Yang
Tongji University, Shanghai, China
C
Chen Ye
Tongji University, Shanghai, China