🤖 AI Summary
To address the challenge of AI’s inability to interpret and programmatically utilize flowchart images, this paper proposes the first end-to-end framework mapping flowchart images to executable, code-level semantic graphs. Methodologically, it introduces a novel symbolic-geometric joint modeling mechanism to achieve layout-invariant semantic parsing; integrates OCR, graph neural networks, and vision-language pre-trained models; and incorporates a rule-guided syntactic repair module to ensure structural correctness and compilability of generated logic. Evaluated on our newly constructed benchmark FlowChartBench, the framework achieves 92.3% node recognition accuracy and 86.7% edge relation recall, and generates Python-compliant pseudocode. This work establishes a new paradigm for programmatic semantic understanding of unstructured diagrammatic data.