Learning the PTM Code through a Coarse-to-Fine, Mechanism-Aware Framework

📅 2025-10-27
📈 Citations: 0
Influential: 0
📄 PDF

career value

207K/year
🤖 AI Summary
Deciphering the mapping between post-translational modification (PTM) sites and their catalyzing enzymes—termed the PTM “combinatorial code”—remains a central challenge in understanding cellular signaling regulation and disease mechanisms. This work introduces the first mechanism-aware, coarse-to-fine unified framework that jointly models multi-label PTM site prediction and zero-shot enzyme identification, explicitly encoding synergistic or antagonistic syntactic relationships among PTMs. The method integrates evolution-informed protein language model representations, physicochemical priors, and interaction-aware prompting to effectively mitigate the dual long-tail distribution inherent in PTM data. Evaluated on multiple proteome-scale benchmarks, our approach achieves a 122% improvement in site-level F1 score and a 54% gain in zero-shot enzyme assignment accuracy. Moreover, it successfully identifies PTM rewiring events triggered by disease-associated variants.

Technology Category

Application Category

📝 Abstract
Post-translational modifications (PTMs) form a combinatorial "code" that regulates protein function, yet deciphering this code - linking modified sites to their catalytic enzymes - remains a central unsolved problem in understanding cellular signaling and disease. We introduce COMPASS-PTM, a mechanism-aware, coarse-to-fine learning framework that unifies residue-level PTM profiling with enzyme-substrate assignment. COMPASS-PTM integrates evolutionary representations from protein language models with physicochemical priors and a crosstalk-aware prompting mechanism that explicitly models inter-PTM dependencies. This design allows the model to learn biologically coherent patterns of cooperative and antagonistic modifications while addressing the dual long-tail distribution of PTM data. Across multiple proteome-scale benchmarks, COMPASS-PTM establishes new state-of-the-art performance, including a 122% relative F1 improvement in multi-label site prediction and a 54% gain in zero-shot enzyme assignment. Beyond accuracy, the model demonstrates interpretable generalization, recovering canonical kinase motifs and predicting disease-associated PTM rewiring caused by missense variants. By bridging statistical learning with biochemical mechanism, COMPASS-PTM unifies site-level and enzyme-level prediction into a single framework that learns the grammar underlying protein regulation and signaling.
Problem

Research questions and friction points this paper is trying to address.

Deciphering the PTM code linking modified sites to enzymes
Modeling inter-PTM dependencies and long-tail data distributions
Unifying site-level and enzyme-level prediction into one framework
Innovation

Methods, ideas, or system contributions that make the work stand out.

Coarse-to-fine framework integrating residue profiling and enzyme assignment
Protein language models combined with physicochemical priors and crosstalk
Addresses long-tail PTM data distribution for interpretable generalization
🔎 Similar Papers
No similar papers found.
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
J
Jingjie Zhang
Department of Computer Science and Engineering, The Chinese University of Hong Kong
Hanqun Cao
Hanqun Cao
The Chinese University of Hong Kong
Generative ModelingAI4Science
Z
Zijun Gao
Department of Computer Science and Engineering, The Chinese University of Hong Kong
Y
Yu Wang
School of Computer Science, Peking University
S
Shaoning Li
Department of Computer Science and Engineering, The Chinese University of Hong Kong
J
Jun Xu
Institute of Reproductive Medicine, Medical School, Nantong University
C
Cheng Tan
Department of Computer Science and Engineering, The Chinese University of Hong Kong
J
Jun Zhu
School of Life Sciences, Tsinghua University
Chang-Yu Hsieh
Chang-Yu Hsieh
Zhejiang University
Open Quantum SystemsQuantum SimulationsAI for Science
C
Chunbin Gu
Department of Computer Science and Engineering, The Chinese University of Hong Kong
Pheng Ann Heng
Pheng Ann Heng
Choh-Ming Li Professor of Computer Science and Engineering, The Chinese University of Hong Kong
Medical Image AnalysisSurgical SimulationVisualizationGraphicsVirtual Reality