InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs

📅 2023-12-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
High entry barriers for beginners learning machine learning (ML) and the substantial development cost of visual programming environments hinder widespread adoption. To address these challenges, this paper proposes VisuPipe—a low-code, visualization-oriented ML pipeline generation framework powered by dual large language models (LLMs). Its core innovation lies in a novel “pseudocode generation + executable node-graph rendering” two-module paradigm, enabling natural-language-driven, AI-human collaborative pipeline construction and iterative refinement. VisuPipe tightly integrates an LLM, a code interpreter, and dynamic visualization rendering to achieve end-to-end translation from textual specifications to executable, interactive node graphs. A user study (N=16) demonstrates that VisuPipe significantly reduces the initial learning curve—cutting average modeling time by 62%—enhances pipeline construction efficiency, and fosters innovative modeling approaches.
📝 Abstract
Visual programming has the potential of providing novice programmers with a low-code experience to build customized processing pipelines. Existing systems typically require users to build pipelines from scratch, implying that novice users are expected to set up and link appropriate nodes from a blank workspace. In this paper, we introduce InstructPipe, an AI assistant for prototyping machine learning (ML) pipelines with text instructions. We contribute two large language model (LLM) modules and a code interpreter as part of our framework. The LLM modules generate pseudocode for a target pipeline, and the interpreter renders the pipeline in the node-graph editor for further human-AI collaboration. Both technical and user evaluation (N=16) shows that InstructPipe empowers users to streamline their ML pipeline workflow, reduce their learning curve, and leverage open-ended commands to spark innovative ideas.
Problem

Research questions and friction points this paper is trying to address.

Simplifying ML pipeline creation for novice programmers
Reducing learning curve with AI-generated pseudocode
Enhancing human-AI collaboration in visual programming
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM modules generate pseudocode
Code interpreter renders pipelines
Streamlines ML workflow with text instructions
🔎 Similar Papers
No similar papers found.
Z
Zhongyi Zhou
The University of Tokyo, Japan
J
Jing Jin
Google Research, USA
Vrushank Phadnis
Vrushank Phadnis
Google
CADVR/AR
X
Xiuxiu Yuan
Google Research, USA
J
Jun Jiang
Google Research, USA
Xun Qian
Xun Qian
Google
Human-Computer InteractionAugmented RealityExtended RealityHuman-AI Interaction
K
Kristen Wright
Google Research, USA
M
Mark Sherwood
Google Research, USA
J
Jason Mayes
Google Research, USA
J
Jingtao Zhou
Google Research, USA
Y
Yiyi Huang
Google Research, USA
Z
Zheng Xu
Google Research, USA
Yinda Zhang
Yinda Zhang
Google Research
Computer VisionComputer GraphicsDeep LearningScene UnderstandingDigital Human
J
Johnny Lee
Google Research, USA
Alex Olwal
Alex Olwal
Research & Engineering Leader / Product Innovation for AI and AR
Human-AI InteractionMultimodal InterfacesAugmented RealityHuman-Computer Interaction
D
David Kim
Google Research, USA
R
Ram Iyengar
Google Research, USA
N
Na Li
Google Research, USA
Ruofei Du
Ruofei Du
Interactive Perception & Graphics Lead / Manager, Google XR
AI + XRInteractive PerceptionInteractive GraphicsHuman Computer InteractionWebAI + WebXR