π€ AI Summary
Existing automated visualization systems struggle with multi-file, complex datasets and iterative refinement requirements, often failing due to oversimplified task decomposition, inadequate error handling, and absence of quality feedback. This paper proposes a large language modelβbased collaborative multi-agent system that decomposes visualization generation into four synergistic modules: metadata parsing, task planning, code generation, and quality-driven self-reflection. By integrating metadata analysis and introspective mechanisms, the system overcomes context-length limitations and enables end-to-end iterative optimization. Its core contribution is a verifiable and correctable multi-agent workflow specifically designed for complex data scenarios. Experiments demonstrate that our approach achieves a 41.5% improvement in composite evaluation scores over strong baselines, significantly enhancing both success rate and output quality for visualization generation under complex conditions.
π Abstract
Deep research has revolutionized data analysis, yet data scientists still devote substantial time to manually crafting visualizations, highlighting the need for robust automation from natural language queries. However, current systems struggle with complex datasets containing multiple files and iterative refinement. Existing approaches, including simple single- or multi-agent systems, often oversimplify the task, focusing on initial query parsing while failing to robustly manage data complexity, code errors, or final visualization quality. In this paper, we reframe this challenge as a collaborative multi-agent problem. We introduce CoDA, a multi-agent system that employs specialized LLM agents for metadata analysis, task planning, code generation, and self-reflection. We formalize this pipeline, demonstrating how metadata-focused analysis bypasses token limits and quality-driven refinement ensures robustness. Extensive evaluations show CoDA achieves substantial gains in the overall score, outperforming competitive baselines by up to 41.5%. This work demonstrates that the future of visualization automation lies not in isolated code generation but in integrated, collaborative agentic workflows.