PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs

📅 2025-07-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing PPI prediction methods predominantly focus on pairwise interaction classification, neglecting their capacity to reconstruct biologically plausible protein–protein interaction (PPI) networks. To address this gap, we introduce PRING—the first graph-level PPI prediction benchmark—comprising a high-quality, multi-species dataset of 21,484 proteins and 186,818 interactions. PRING establishes a novel dual-dimensional evaluation framework integrating topological structure and functional semantics, enabling assessment across network reconstruction, module detection, and disease mechanism inference. We systematically evaluate four major method classes—sequence similarity, naïve sequence models, protein language models, and structure-based predictors—under strict data-leakage mitigation protocols. Results reveal substantial limitations in current models’ ability to recover both the structural integrity and functional coherence of ground-truth PPI networks. PRING provides a reproducible, multi-level evaluation standard for PPI model development, advancing accurate functional annotation and mechanistic interpretation.

Technology Category

Application Category

📝 Abstract
Deep learning-based computational methods have achieved promising results in predicting protein-protein interactions (PPIs). However, existing benchmarks predominantly focus on isolated pairwise evaluations, overlooking a model's capability to reconstruct biologically meaningful PPI networks, which is crucial for biology research. To address this gap, we introduce PRING, the first comprehensive benchmark that evaluates protein-protein interaction prediction from a graph-level perspective. PRING curates a high-quality, multi-species PPI network dataset comprising 21,484 proteins and 186,818 interactions, with well-designed strategies to address both data redundancy and leakage. Building on this golden-standard dataset, we establish two complementary evaluation paradigms: (1) topology-oriented tasks, which assess intra and cross-species PPI network construction, and (2) function-oriented tasks, including protein complex pathway prediction, GO module analysis, and essential protein justification. These evaluations not only reflect the model's capability to understand the network topology but also facilitate protein function annotation, biological module detection, and even disease mechanism analysis. Extensive experiments on four representative model categories, consisting of sequence similarity-based, naive sequence-based, protein language model-based, and structure-based approaches, demonstrate that current PPI models have potential limitations in recovering both structural and functional properties of PPI networks, highlighting the gap in supporting real-world biological applications. We believe PRING provides a reliable platform to guide the development of more effective PPI prediction models for the community. The dataset and source code of PRING are available at https://github.com/SophieSarceau/PRING.
Problem

Research questions and friction points this paper is trying to address.

Evaluates PPI prediction from graph-level, not just pairs
Assesses network topology and functional biological tasks
Identifies limitations in current PPI models for real-world use
Innovation

Methods, ideas, or system contributions that make the work stand out.

PRING introduces graph-level PPI prediction benchmark
Uses multi-species dataset with redundancy control
Combines topology and function-oriented evaluation paradigms
🔎 Similar Papers
No similar papers found.
Xinzhe Zheng
Xinzhe Zheng
National University of Singapore
AI for BiomedicineAI for Science
Hao Du
Hao Du
ByteDance
Computer VisionMachine Learning
F
Fanding Xu
National University of Singapore, Xi’an Jiaotong University
Jinzhe Li
Jinzhe Li
Fudan University & Shanghai AI Lab
AI4ScienceMulti-Modal
Z
Zhiyuan Liu
National University of Singapore
Wenkang Wang
Wenkang Wang
Central South University, PhD candidate
BioinformaticsDeep Learning
T
Tao Chen
Fudan University
W
Wanli Ouyang
Shanghai Artificial Intelligence Laboratory, The Chinese University of Hong Kong
S
Stan Z. Li
Westlake University
Y
Yan Lu
Shanghai Artificial Intelligence Laboratory, The Chinese University of Hong Kong
Nanqing Dong
Nanqing Dong
Shanghai Artificial Intelligence Laboratory; University of Oxford
Machine LearningComputer VisionOptimizationAI for Science
Y
Yang Zhang
National University of Singapore