Story2Proposal: A Scaffold for Structured Scientific Paper Writing

📅 2026-03-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of structural inconsistency, missing content, and cross-section incoherence commonly encountered in automatically generated scientific papers, particularly between narrative text, experimental evidence, and visual elements. To resolve these issues, the authors propose a multi-agent collaborative framework grounded in a persistent shared visual contract, comprising architect, writer, optimizer, renderer, and evaluator agents. These agents operate within a generate–evaluate–adapt loop that dynamically updates the contract to align textual structure with visual components throughout the document. The approach introduces, for the first time, a contract-driven mechanism to regulate multi-agent collaboration. Evaluated on the Jericho corpus, the method achieves an expert rating of 6.145, significantly outperforming DirectChat (3.963) and Fars (5.197), thereby demonstrating substantial improvements in both structural coherence and text–figure consistency.
📝 Abstract
Generating scientific manuscripts requires maintaining alignment between narrative reasoning, experimental evidence, and visual artifacts across the document lifecycle. Existing language-model generation pipelines rely on unconstrained text synthesis with validation applied only after generation, often producing structural drift, missing figures or tables, and cross-section inconsistencies. We introduce Story2Proposal, a contract-governed multi-agent framework that converts a research story into a structured manuscript through coordinated agents operating under a persistent shared visual contract. The system organizes architect, writer, refiner, and renderer agents around a contract state that tracks section structure and registered visual elements, while evaluation agents supply feedback in a generate evaluate adapt loop that updates the contract during generation. Experiments on tasks derived from the Jericho research corpus show that Story2Proposal achieved an expert evaluation score of 6.145 versus 3.963 for DirectChat (+2.182) across GPT, Claude, Gemini, and Qwen backbones. Compared with the structured generation baseline Fars, Story2Proposal obtained an average score of 5.705 versus 5.197, indicating improved structural consistency and visual alignment.
Problem

Research questions and friction points this paper is trying to address.

scientific manuscript generation
structural consistency
visual alignment
narrative reasoning
cross-section inconsistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-agent framework
visual contract
structured generation
scientific writing
generate-evaluate-adapt loop
🔎 Similar Papers
No similar papers found.
Z
Zhuoyang Qian
AgentAlpha Team
W
Wei Shi
AgentAlpha Team
X
Xu Lin
AgentAlpha Team
Li Ling
Li Ling
KTH - Royal Institute of Technology
computer visiondeep learningroboticsautonomous navigation
Meng Luo
Meng Luo
National University of Singapore
Human-Centered AIMultimodal UnderstandingMultimodal Reasoning
Z
Ziming Wang
AgentAlpha Team
T
Tengyue Xu
AgentAlpha Team
G
Gaoge Liu
AgentAlpha Team
Z
Zhentao Zhang
AgentAlpha Team
S
Shuo Zhang
AgentAlpha Team
Z
Ziqi Wang
AgentAlpha Team
Z
Zheng Feng
AgentAlpha Team
Y
Yan Luo
AgentAlpha Team
S
Shu Xu
AgentAlpha Team
Y
Yongjin Chen
AgentAlpha Team
Z
Zhiwei Zhang
AgentAlpha Team
Z
Zhibo Feng
AgentAlpha Team
Z
Zhuo Chen
AgentAlpha Team
B
Bruce Yuan
AgentAlpha Team
B
Biao Wu
AgentAlpha Team
H
Harry Wang
AgentAlpha Team
K
Kris Chen
AgentAlpha Team