Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates

📅 2025-11-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing automated paper review methods rely either on shallow textual features or black-box large language models (LLMs), leading to hallucination, scoring bias, and an inability to model the dynamic nature of peer review. This paper proposes the first multi-agent debate-based review simulation framework: it orchestrates iterative argumentative interactions between LLM-driven reviewers and authors, explicitly encoding logical relations among claims—such as support, rebuttal, and clarification—as typed edges in a heterogeneous graph, and employs a heterogeneous graph neural network for structured reasoning. The core innovation lies in formalizing peer review as an interpretable graph reasoning task, jointly capturing semantic depth and interactive dynamics. Evaluated on three benchmark datasets, our method achieves an average relative improvement of 15.73% over state-of-the-art baselines.

Technology Category

Application Category

📝 Abstract
Existing paper review methods often rely on superficial manuscript features or directly on large language models (LLMs), which are prone to hallucinations, biased scoring, and limited reasoning capabilities. Moreover, these methods often fail to capture the complex argumentative reasoning and negotiation dynamics inherent in reviewer-author interactions. To address these limitations, we propose ReViewGraph (Reviewer-Author Debates Graph Reasoner), a novel framework that performs heterogeneous graph reasoning over LLM-simulated multi-round reviewer-author debates. In our approach, reviewer-author exchanges are simulated through LLM-based multi-agent collaboration. Diverse opinion relations (e.g., acceptance, rejection, clarification, and compromise) are then explicitly extracted and encoded as typed edges within a heterogeneous interaction graph. By applying graph neural networks to reason over these structured debate graphs, ReViewGraph captures fine-grained argumentative dynamics and enables more informed review decisions. Extensive experiments on three datasets demonstrate that ReViewGraph outperforms strong baselines with an average relative improvement of 15.73%, underscoring the value of modeling detailed reviewer-author debate structures.
Problem

Research questions and friction points this paper is trying to address.

Existing review methods rely on superficial features causing biased scoring
Current approaches fail to capture complex reviewer-author argumentative dynamics
LLM-based methods suffer from hallucinations and limited reasoning capabilities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Simulates multi-round reviewer-author debates using LLMs
Models opinion relations as typed edges in graphs
Applies graph neural networks for structured debate reasoning
🔎 Similar Papers
Shuaimin Li
Shuaimin Li
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Natural language processingTabular data visualization
L
Liyang Fan
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
Y
Yufang Lin
Software Engineering Institute, East China Normal University, Shanghai, China
Zeyang Li
Zeyang Li
KTH Royal Institute of Technology
channel modelingwireless sensingreconfigurable intelligent surface-aided networksmillimeter wa
X
Xian Wei
Software Engineering Institute, East China Normal University, Shanghai, China
S
Shiwen Ni
Shenzhen Key Laboratory for High Performance Data Mining, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Hamid Alinejad-Rokny
Hamid Alinejad-Rokny
ARC DECRA & UNSW Scientia Fellow, Head of BioMedical Machine Learning Lab
BioMedical Machine LearningMachine Learning for HealthMedical Artificial IntelligenceLLMs
Min Yang
Min Yang
Bytedance
Vision Language ModelComputer VisionVideo Understanding