🤖 AI Summary
The IPARC Challenge comprises 600 anti-automated-solver synthetic image-programming synthesis tasks, designed to evaluate models’ structured programming proficiency across sequence, selection, and iteration control structures—yet no prior automated method has systematically solved all task categories. This paper proposes an LLM-driven structured inductive programming framework featuring five core mechanisms: prior structural guidance, human-in-the-loop refinement, correct-code freezing, code reuse for efficiency, and LLM-mediated human creativity elicitation. It achieves the first complete coverage of all IPARC task classes. Experiments demonstrate substantial improvements in synthesis reliability, interpretability, and reproducibility. Furthermore, the work distills a transferable human-AI collaborative programming paradigm, providing systematic empirical evidence on both the capability boundaries of LLMs in complex code generation and viable pathways for their augmentation.
📝 Abstract
The IPARC Challenge, inspired by ARC, provides controlled program synthesis tasks over synthetic images to evaluate automatic program construction, focusing on sequence, selection, and iteration. This set of 600 tasks has resisted automated solutions. This paper presents a structured inductive programming approach with LLMs that successfully solves tasks across all IPARC categories. The controlled nature of IPARC reveals insights into LLM-based code generation, including the importance of prior structuring, LLMs' ability to aid structuring (requiring human refinement), the need to freeze correct code, the efficiency of code reuse, and how LLM-generated code can spark human creativity. These findings suggest valuable mechanisms for human-LLM collaboration in tackling complex program synthesis.