PepCompass: Navigating peptide embedding spaces using Riemannian Geometry

📅 2025-10-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Antimicrobial peptide (AMP) discovery faces dual challenges: the vastness of sequence space and the sparsity of active peptides. Existing generative models neglect the decoder-induced embedding manifold structure and rely on Euclidean metrics, leading to geometric distortion and inefficient search. To address this, we propose the first Riemannian-geometry-based navigation framework for AMP design. Our method constructs a κ-stable manifold family to model the intrinsic geometry of the latent space; introduces local geometric-aware second-order Riemannian Brownian motion sampling, tangent-space amino acid enumeration for mutation, and potential-minimizing geodesic search; and integrates Bayesian optimization with property-enhanced geodesic interpolation for directed optimization. Experimentally, our approach identified four high-activity seed peptides and generated 25 novel broad-spectrum AMPs—demonstrating efficacy against multidrug-resistant pathogens—while significantly improving both discovery efficiency and success rate.

Technology Category

Application Category

📝 Abstract
Antimicrobial peptide discovery is challenged by the astronomical size of peptide space and the relative scarcity of active peptides. Generative models provide continuous latent "maps" of peptide space, but conventionally ignore decoder-induced geometry and rely on flat Euclidean metrics, rendering exploration and optimization distorted and inefficient. Prior manifold-based remedies assume fixed intrinsic dimensionality, which critically fails in practice for peptide data. Here, we introduce PepCompass, a geometry-aware framework for peptide exploration and optimization. At its core, we define a Union of $κ$-Stable Riemannian Manifolds $mathbb{M}^κ$, a family of decoder-induced manifolds that captures local geometry while ensuring computational stability. We propose two local exploration methods: Second-Order Riemannian Brownian Efficient Sampling, which provides a convergent second-order approximation to Riemannian Brownian motion, and Mutation Enumeration in Tangent Space, which reinterprets tangent directions as discrete amino-acid substitutions. Combining these yields Local Enumeration Bayesian Optimization (LE-BO), an efficient algorithm for local activity optimization. Finally, we introduce Potential-minimizing Geodesic Search (PoGS), which interpolates between prototype embeddings along property-enriched geodesics, biasing discovery toward seeds, i.e. peptides with favorable activity. In-vitro validation confirms the effectiveness of PepCompass: PoGS yields four novel seeds, and subsequent optimization with LE-BO discovers 25 highly active peptides with broad-spectrum activity, including against resistant bacterial strains. These results demonstrate that geometry-informed exploration provides a powerful new paradigm for antimicrobial peptide design.
Problem

Research questions and friction points this paper is trying to address.

Navigating peptide embedding spaces using Riemannian geometry for efficient exploration
Addressing decoder-induced geometric distortions in peptide latent space representations
Developing geometry-aware optimization methods for discovering novel antimicrobial peptides
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decoder-induced Riemannian manifolds capture local geometry
Second-order Riemannian Brownian motion enables efficient sampling
Tangent space enumeration interprets directions as amino-acid substitutions
🔎 Similar Papers
No similar papers found.
M
Marcin Możejko
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
A
Adam Bielecki
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
J
Jurand Prądzyński
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
M
Marcin Traskowski
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
A
Antoni Janowski
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
K
Karol Jurasz
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
M
Michał Kucharczyk
Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
H
Hyun-Su Lee
University of Pennsylvania
Marcelo Der Torossian Torres
Marcelo Der Torossian Torres
University of Pennsylvania
Peptide ChemistryAntimicrobial PeptidesPeptide Design
C
Cesar de la Fuente-Nunez
University of Pennsylvania
P
Paulina Szymczak
Institute of AI for Health, Hemholtz Center Munich
M
Michał Kmicikiewicz
Institute of AI for Health, Hemholtz Center Munich
Ewa Szczurek
Ewa Szczurek
Associate Professor at University of Warsaw / Institute AI for Health, Helmholtz Zentrum München
computational biologymachine learningartificial intelligence