MAMMAL - Molecular Aligned Multi-Modal Architecture and Language

📅 2024-10-28
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF

career value

184K/year
🤖 AI Summary
Existing biomolecular foundation models are predominantly unimodal—focused solely on proteins or small molecules—thus failing to capture cross-modal biological interactions, which hinders mechanistic understanding of diseases and drug discovery. To address this, we introduce BioM3, the first unified multimodal foundation model jointly encoding proteins, small molecules, and multi-omics data. BioM3 innovatively integrates a molecular alignment architecture with a structured prompt grammar to support classification, regression, and generative tasks. Its architecture employs a Transformer backbone, cross-modal embedding alignment, and a scalar-token hybrid input-output design, while incorporating AlphaFold 3–inspired binding-site prediction capabilities. Evaluated on 11 downstream tasks, BioM3 achieves state-of-the-art performance on 9; notably, it outperforms prior methods on 3 of 4 critical tasks—including antibody–antigen binding affinity prediction. The code and pretrained weights are publicly released.

Technology Category

Application Category

📝 Abstract
Large language models applied to vast biological datasets have the potential to transform biology by uncovering disease mechanisms and accelerating drug development. However, current models are often siloed, trained separately on small-molecules, proteins, or transcriptomic data, limiting their ability to capture complex, multi-modal interactions. Effective drug discovery requires computational tools that integrate multiple biological entities while supporting prediction and generation, a challenge existing models struggle to address. For this purpose, we present MAMMAL - Molecular Aligned Multi-Modal Architecture and Language - a versatile method applied to create a multi-task foundation model that learns from large-scale biological datasets across diverse modalities, including proteins, small-molecules, and omics. MAMMAL's structured prompt syntax supports classification, regression, and generation tasks while handling token and scalar inputs and outputs. Evaluated on eleven diverse downstream tasks, it reaches a new state of the art (SOTA) in nine tasks and is comparable to SOTA in two tasks, all within a unified architecture, unlike prior task-specific models. Additionally, we explored Alphafold 3 binding prediction capabilities on antibody-antigen and nanobody-antigen complexes showing significantly better classification performance of MAMMAL in 3 out of 4 targets. The model code and pretrained weights are publicly available at https://github.com/BiomedSciAI/biomed-multi-alignment and https://huggingface.co/ibm/biomed.omics.bl.sm.ma-ted-458m
Problem

Research questions and friction points this paper is trying to address.

Integrating multi-modal biological data for drug discovery
Overcoming siloed models in small-molecule and protein analysis
Unifying classification, regression, and generation in one architecture
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-modal foundation model integrating diverse biological data
Structured prompt syntax for versatile task handling
State-of-the-art performance in unified architecture
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
Y
Yoel Shoshan
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Moshiko Raboh
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
Michal Ozery-Flato
Michal Ozery-Flato
Research Staff Member at IBM Research
machine learningdata science
V
Vadim Ratner
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
A
Alex Golts
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
J
Jeffrey K. Weber
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
Ella Barkan
Ella Barkan
IBM Research
Medical ImagingDocument Processing
S
Simona Rabinovici-Cohen
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
S
Sagi Polaczek
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
I
Ido Amos
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
B
Ben Shapira
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
L
Liam Hazan
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Matan Ninio
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
S
Sivan Ravid
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Michael M. Danziger
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
J
Joseph A. Morrone
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
P
Parthasarathy Suryanarayanan
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
Michal Rosen-Zvi
Michal Rosen-Zvi
Director IBM Research
Machine LearningAIHealth Informatics
E
Efrat Hexter
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.