AIR: Complex Instruction Generation via Automatic Iterative Refinement

📅 2025-02-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of large language models (LLMs) unreliably following complex, multi-step, context-sensitive instructions with multiple constraints, this paper proposes the first document-driven, automated iterative refinement framework. Our method leverages web documents as authoritative sources to generate initial constrained instructions and introduces an LLM-as-judge mechanism that iteratively compares model outputs against source documents, dynamically injecting fidelity, structural coherence, and diversity constraints. We pioneer explicit constraint modeling and structured instruction synthesis techniques. We curate AIR-10K, a high-quality benchmark comprising 10,000 complex instructions derived from real-world documents. Extensive evaluation across multiple benchmarks demonstrates substantial improvements over state-of-the-art methods, significantly enhancing LLMs’ capability to faithfully execute intricate, multi-constraint, context-dependent instructions.

Technology Category

Application Category

📝 Abstract
With the development of large language models, their ability to follow simple instructions has significantly improved. However, adhering to complex instructions remains a major challenge. Current approaches to generating complex instructions are often irrelevant to the current instruction requirements or suffer from limited scalability and diversity. Moreover, methods such as back-translation, while effective for simple instruction generation, fail to leverage the rich contents and structures in large web corpora. In this paper, we propose a novel automatic iterative refinement framework to generate complex instructions with constraints, which not only better reflects the requirements of real scenarios but also significantly enhances LLMs' ability to follow complex instructions. The AIR framework consists of two stages: (1)Generate an initial instruction from a document; (2)Iteratively refine instructions with LLM-as-judge guidance by comparing the model's output with the document to incorporate valuable constraints. Finally, we construct the AIR-10K dataset with 10K complex instructions and demonstrate that instructions generated with our approach significantly improve the model's ability to follow complex instructions, outperforming existing methods for instruction generation.
Problem

Research questions and friction points this paper is trying to address.

Generates complex instruction with constraints
Improves LLMs' ability to follow instructions
Enhances scalability and diversity of instructions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic iterative refinement framework
LLM-as-judge guidance
AIR-10K dataset construction
🔎 Similar Papers
No similar papers found.
W
Wei Liu
Alibaba Group
Yancheng He
Yancheng He
Alibaba Group
LLM
H
Hui Huang
Alibaba Group
Chengwei Hu
Chengwei Hu
Fudan University
NLPKG
J
Jiaheng Liu
Alibaba Group
Shilong Li
Shilong Li
University of California, Irvine
Software EngineeringAutonomous Driving Systems
W
Wenbo Su
Alibaba Group
B
Bo Zheng
Alibaba Group