ReactEmbed: A Cross-Domain Framework for Protein-Molecule Representation Learning via Biochemical Reaction Networks

📅 2025-01-30
📈 Citations: 0
Influential: 0
📄 PDF

career value

182K/year
🤖 AI Summary
Existing methods inadequately capture joint protein–ligand representations and fail to model their complex biochemical relationships. Method: We propose the first cross-domain unified embedding framework that explicitly leverages biochemical reaction networks as supervisory signals. It integrates pre-trained protein and molecular representations and employs contrastive learning to align reaction pairs, jointly optimizing dual-modality embeddings in a shared latent space. Contribution/Results: Our approach innovatively structures biochemical reaction relationships as a supervised prior—enabling zero-shot generalization (e.g., blood–brain barrier permeability prediction). It achieves state-of-the-art performance across diverse tasks, including drug–target interaction prediction, protein–protein interaction inference, and molecular/protein property prediction. Crucially, experiments demonstrate successful zero-shot transfer to predicting lipid nanoparticle delivery efficiency—a challenging real-world application—validating strong generalizability and practical utility.

Technology Category

Application Category

📝 Abstract
The challenge in computational biology and drug discovery lies in creating comprehensive representations of proteins and molecules that capture their intrinsic properties and interactions. Traditional methods often focus on unimodal data, such as protein sequences or molecular structures, limiting their ability to capture complex biochemical relationships. This work enhances these representations by integrating biochemical reactions encompassing interactions between molecules and proteins. By leveraging reaction data alongside pre-trained embeddings from state-of-the-art protein and molecule models, we develop ReactEmbed, a novel method that creates a unified embedding space through contrastive learning. We evaluate ReactEmbed across diverse tasks, including drug-target interaction, protein-protein interaction, protein property prediction, and molecular property prediction, consistently surpassing all current state-of-the-art models. Notably, we showcase ReactEmbed's practical utility through successful implementation in lipid nanoparticle-based drug delivery, enabling zero-shot prediction of blood-brain barrier permeability for protein-nanoparticle complexes. The code and comprehensive database of reaction pairs are available for open use at href{https://github.com/amitaysicherman/ReactEmbed}{GitHub}.
Problem

Research questions and friction points this paper is trying to address.

Protein-small molecule interactions
Biochemistry
Drug discovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

ReactEmbed
Biochemistry Integration
Predictive Accuracy
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid