MAMMAL - Molecular Aligned Multi-Modal Architecture and Language

📅 2024-10-28
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Existing biomolecular foundation models are predominantly unimodal—focused solely on proteins or small molecules—thus failing to capture cross-modal biological interactions, which hinders mechanistic understanding of diseases and drug discovery. To address this, we introduce BioM3, the first unified multimodal foundation model jointly encoding proteins, small molecules, and multi-omics data. BioM3 innovatively integrates a molecular alignment architecture with a structured prompt grammar to support classification, regression, and generative tasks. Its architecture employs a Transformer backbone, cross-modal embedding alignment, and a scalar-token hybrid input-output design, while incorporating AlphaFold 3–inspired binding-site prediction capabilities. Evaluated on 11 downstream tasks, BioM3 achieves state-of-the-art performance on 9; notably, it outperforms prior methods on 3 of 4 critical tasks—including antibody–antigen binding affinity prediction. The code and pretrained weights are publicly released.

Technology Category

Application Category

📝 Abstract
Large language models applied to vast biological datasets have the potential to transform biology by uncovering disease mechanisms and accelerating drug development. However, current models are often siloed, trained separately on small-molecules, proteins, or transcriptomic data, limiting their ability to capture complex, multi-modal interactions. Effective drug discovery requires computational tools that integrate multiple biological entities while supporting prediction and generation, a challenge existing models struggle to address. For this purpose, we present MAMMAL - Molecular Aligned Multi-Modal Architecture and Language - a versatile method applied to create a multi-task foundation model that learns from large-scale biological datasets across diverse modalities, including proteins, small-molecules, and omics. MAMMAL's structured prompt syntax supports classification, regression, and generation tasks while handling token and scalar inputs and outputs. Evaluated on eleven diverse downstream tasks, it reaches a new state of the art (SOTA) in nine tasks and is comparable to SOTA in two tasks, all within a unified architecture, unlike prior task-specific models. Additionally, we explored Alphafold 3 binding prediction capabilities on antibody-antigen and nanobody-antigen complexes showing significantly better classification performance of MAMMAL in 3 out of 4 targets. The model code and pretrained weights are publicly available at https://github.com/BiomedSciAI/biomed-multi-alignment and https://huggingface.co/ibm/biomed.omics.bl.sm.ma-ted-458m
Problem

Research questions and friction points this paper is trying to address.

Integrating multi-modal biological data for drug discovery
Overcoming siloed models in small-molecule and protein analysis
Unifying classification, regression, and generation in one architecture
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-modal foundation model integrating diverse biological data
Structured prompt syntax for versatile task handling
State-of-the-art performance in unified architecture
Y
Yoel Shoshan
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Moshiko Raboh
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
Michal Ozery-Flato
Michal Ozery-Flato
Research Staff Member at IBM Research
machine learningdata science
V
Vadim Ratner
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
A
Alex Golts
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
J
Jeffrey K. Weber
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
Ella Barkan
Ella Barkan
IBM Research
Medical ImagingDocument Processing
S
Simona Rabinovici-Cohen
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
S
Sagi Polaczek
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
I
Ido Amos
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
B
Ben Shapira
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
L
Liam Hazan
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Matan Ninio
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
S
Sivan Ravid
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
M
Michael M. Danziger
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.
J
Joseph A. Morrone
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
P
Parthasarathy Suryanarayanan
IBM TJ Watson Research Center, IBM Research, 1101 Kitchawan Rd., NY, 10598, Yorktown Heights, USA.
Michal Rosen-Zvi
Michal Rosen-Zvi
Director IBM Research
Machine LearningAIHealth Informatics
E
Efrat Hexter
IBM Research Labs, IBM Research, Haifa, 3498825, Israel.