ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning

📅 2026-03-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes the first protein search agent that supports multi-turn, multimodal interaction, addressing the limitations of existing approaches that are typically confined to single-round, text-only queries and struggle to integrate sequence information or exert fine-grained control over the search process—often leading to reasoning drift. By incorporating protein sequences as a multimodal input into large language model–driven search decisions and introducing a multidimensional reward mechanism, the agent enables reinforcement learning–based supervision over the entire search trajectory. Evaluated on ProtMCQs—a newly curated benchmark comprising 3,000 multiple-choice questions spanning varying difficulty levels—the method demonstrates significant improvements in reasoning accuracy and controllability on complex tasks involving protein function, phenotypic effects, and pathway networks.

Technology Category

Application Category

📝 Abstract
Protein analysis tasks arising in healthcare settings often require accurate reasoning under protein sequence constraints, involving tasks such as functional interpretation of disease-related variants, protein-level analysis for clinical research, and similar scenarios. To address such tasks, search agents are introduced to search protein-related information, providing support for disease-related variant analysis and protein function reasoning in protein-centric inference. However, such search agents are mostly limited to single-round, text-only modality search, which prevents the protein sequence modality from being incorporated as a multimodal input into the search decision-making process. Meanwhile, their reliance on reinforcement learning (RL) supervision that focuses solely on the final answer results in a lack of search process constraints, making deviations in keyword selection and reasoning directions difficult to identify and correct in a timely manner. To address these limitations, we propose ProtRLSearch, a multi-round protein search agent trained with multi-dimensional reward based RL, which jointly leverages protein sequence and text as multimodal inputs during real-time search to produce high quality reports. To evaluate the ability of models to integrate protein sequence information and text-based multimodal inputs in realistic protein query settings, we construct ProtMCQs, a benchmark of 3,000 multiple choice questions (MCQs) organized into three difficulty levels. The benchmark evaluates protein query tasks that range from sequence constrained reasoning about protein function and phenotype changes to comprehensive protein reasoning that integrates multi-dimensional sequence features with signal pathways and regulatory networks.
Problem

Research questions and friction points this paper is trying to address.

protein search
multimodal input
reinforcement learning
search agent
sequence-constrained reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal protein search
reinforcement learning
multi-round reasoning
protein sequence integration
search agent
🔎 Similar Papers
No similar papers found.
C
Congying Liu
University of Chinese Academy of Sciences, Beijing, China
T
Taihao Li
Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
M
Ming Huang
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
X
Xingyuan Wei
University of Chinese Academy of Sciences, Beijing, China; Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
P
Peipei Liu
University of Chinese Academy of Sciences, Beijing, China; Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Yiqing Shen
Yiqing Shen
Johns Hopkins
Y
Yanxu Mao
Henan University, Kaifeng, China
T
Tiehan Cui
Henan University, Kaifeng, China