EchoReview: Learning Peer Review from the Echoes of Scientific Citations

๐Ÿ“… 2026-01-31
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the growing strain on traditional peer review caused by the surge in scientific submissions and the limitations of existing automated approaches, which are hindered by the scarcity and subjectivity of human-generated review data. To overcome these challenges, we propose EchoReview, a novel framework that introduces a citation-context-based paradigm for synthesizing peer review data. By extracting implicit evaluative signals from academic citations, we construct EchoReview-16Kโ€”the first large-scale, cross-conference, and cross-year citation-driven review datasetโ€”and fine-tune a 7B-parameter large language model to obtain EchoReviewer-7B. Experimental results demonstrate that our model consistently and significantly outperforms current methods on key dimensions such as evidential support and comprehensiveness, thereby validating citation context as a high-quality data source for automated peer review.

Technology Category

Application Category

๐Ÿ“ Abstract
As the volume of scientific submissions continues to grow rapidly, traditional peer review systems are facing unprecedented scalability pressures, highlighting the urgent need for automated reviewing methods that are both scalable and reliable. Existing supervised fine-tuning approaches based on real review data are fundamentally constrained by single-source of data as well as the inherent subjectivity and inconsistency of human reviews, limiting their ability to support high-quality automated reviewers. To address these issues, we propose EchoReview, a citation-context-driven data synthesis framework that systematically mines implicit collective evaluative signals from academic citations and transforms scientific community's long-term judgments into structured review-style data. Based on this pipeline, we construct EchoReview-16K, the first large-scale, cross-conference, and cross-year citation-driven review dataset, and train an automated reviewer, EchoReviewer-7B. Experimental results demonstrate that EchoReviewer-7B can achieve significant and stable improvements on core review dimensions such as evidence support and review comprehensiveness, validating citation context as a robust and effective data paradigm for reliable automated peer review.
Problem

Research questions and friction points this paper is trying to address.

peer review
automated reviewing
citation context
scalability
review quality
Innovation

Methods, ideas, or system contributions that make the work stand out.

citation-context-driven
data synthesis
automated peer review
collective evaluative signals
large language model
๐Ÿ”Ž Similar Papers
No similar papers found.
Yinuo Zhang
Yinuo Zhang
PhD student, DUKE-NUS Medical School
ProteinPeptidesBiologyDeep Learning
D
Dingcheng Huang
Harbin Institute of Technology, China
H
Haifeng Suo
Harbin Institute of Technology, China
Yizhuo Li
Yizhuo Li
The University of Hong Kong
Z
Ziya Zhao
Harbin Institute of Technology, China
J
Junhao Xu
Harbin Institute of Technology, China
Zhiying Tu
Zhiying Tu
Harbin Institute of Technology
software engineering
D
Dianhui Chu
Harbin Institute of Technology, China
D
Deming Zhai
Harbin Institute of Technology, China
X
Xianming Liu
Harbin Institute of Technology, China
X
Xiaoyan Yu
Beijing Institute of Technology, China
Dianbo Sui
Dianbo Sui
Harbin Institute of Technology