Multi-Agent Procedural Graph Extraction with Structural and Logical Refinement

📅 2026-01-27

📈 Citations: 0

✨ Influential: 0

career value

174K/year

🤖 AI Summary

Automatically extracting structurally valid and logically consistent flowcharts from natural language remains challenging, as current large language models often produce diagrams with structural errors or semantic misalignments. This work proposes a multi-agent collaborative framework that integrates a graph constructor, a structure simulator, and a semantic aligner to iteratively refine flowcharts through three stages—graph construction, structural feedback, and logical feedback—driven solely by natural language feedback, without requiring supervision or parameter updates. The approach enables interpretable and controllable flowchart refinement. Experimental results demonstrate that the method significantly outperforms strong baselines on metrics of structural correctness and logical consistency, confirming its effectiveness for automatic flowchart extraction from natural language.

Technology Category

Application Category

📝 Abstract

Automatically extracting workflows as procedural graphs from natural language is promising yet underexplored, demanding both structural validity and logical alignment. While recent large language models (LLMs) show potential for procedural graph extraction, they often produce ill-formed structures or misinterpret logical flows. We present \model{}, a multi-agent framework that formulates procedural graph extraction as a multi-round reasoning process with dedicated structural and logical refinement. The framework iterates through three stages: (1) a graph extraction phase with the graph builder agent, (2) a structural feedback phase in which a simulation agent diagnoses and explains structural defects, and (3) a logical feedback phase in which a semantic agent aligns semantics between flow logic and linguistic cues in the source text. Important feedback is prioritized and expressed in natural language, which is injected into subsequent prompts, enabling interpretable and controllable refinement. This modular design allows agents to target distinct error types without supervision or parameter updates. Experiments demonstrate that \model{} achieves substantial improvements in both structural correctness and logical consistency over strong baselines.

Problem

Research questions and friction points this paper is trying to address.

procedural graph extraction

structural validity

logical alignment

multi-agent framework

natural language processing

Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-agent framework

procedural graph extraction

structural refinement