Displacement-Sparse Neural Optimal Transport

📅 2025-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Optimal transport (OT) methods often suffer from poor interpretability of learned transport maps and ambiguous displacement structures. Method: This paper proposes a displacement-sparse neural optimal transport framework. Its core innovation is the first formulation of the displacement vector Δ(x) = T(x) − x as an interpretable regularization target, enforced via ℓ₁/ℓ₀ sparsity constraints. We design a dual-path sparsity control strategy—dynamic parameter adaptation in low-dimensional subspaces and explicit dimension-wise sparsity constraints in high dimensions—and integrate input-convex neural networks (ICNNs) to enable minimax optimization of the Wasserstein distance. Results: Evaluated on synthetic single-cell RNA-seq data and real-world 4i cellular perturbation datasets, our method significantly improves transport feasibility and biological interpretability, outperforming state-of-the-art neural OT approaches.

Technology Category

Application Category

📝 Abstract
Optimal Transport (OT) theory seeks to determine the map $T:X o Y$ that transports a source measure $P$ to a target measure $Q$, minimizing the cost $c(mathbf{x}, T(mathbf{x}))$ between $mathbf{x}$ and its image $T(mathbf{x})$. Building upon the Input Convex Neural Network OT solver and incorporating the concept of displacement-sparse maps, we introduce a sparsity penalty into the minimax Wasserstein formulation, promote sparsity in displacement vectors $Delta(mathbf{x}) := T(mathbf{x}) - mathbf{x}$, and enhance the interpretability of the resulting map. However, increasing sparsity often reduces feasibility, causing $T_{#}(P)$ to deviate more significantly from the target measure. In low-dimensional settings, we propose a heuristic framework to balance the trade-off between sparsity and feasibility by dynamically adjusting the sparsity intensity parameter during training. For high-dimensional settings, we directly constrain the dimensionality of displacement vectors by enforcing $dim(Delta(mathbf{x})) leq l$, where $l<d$ for $X subseteq mathbb{R}^d$. Among maps satisfying this constraint, we aim to identify the most feasible one. This goal can be effectively achieved by adapting our low-dimensional heuristic framework without resorting to dimensionality reduction. We validate our method on both synthesized sc-RNA and real 4i cell perturbation datasets, demonstrating improvements over existing methods.
Problem

Research questions and friction points this paper is trying to address.

Minimize cost in Optimal Transport maps
Balance sparsity and feasibility trade-off
Enhance interpretability of transport maps
Innovation

Methods, ideas, or system contributions that make the work stand out.

Sparsity penalty in OT
Dynamic sparsity adjustment
Dimensionality constraint enforcement