A Truncated Newton Method for Optimal Transport

📅 2025-04-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the entropy-regularized optimal transport (OT) problem by proposing a truncated Newton solver tailored for large-scale, high-dimensional settings. Methodologically, it abandons the conventional Lipschitz Hessian assumption and establishes, for the first time under milder conditions, local quadratic convergence. It further introduces adaptive Hessian approximation and GPU-accelerated parallelization to jointly ensure theoretical rigor and computational efficiency. Innovatively integrating truncated Newton optimization with numerical stabilization mechanisms, the method consistently outperforms mainstream approaches—including Sinkhorn and L-BFGS—across 24 benchmark experiments. It successfully solves high-dimensional OT problems with $n approx 10^6$, achieving high accuracy, low runtime (several times faster than state-of-the-art solvers), and strong robustness. The proposed framework delivers a scalable, reliable, and efficient new paradigm for large-scale OT computation.

Technology Category

Application Category

📝 Abstract
Developing a contemporary optimal transport (OT) solver requires navigating trade-offs among several critical requirements: GPU parallelization, scalability to high-dimensional problems, theoretical convergence guarantees, empirical performance in terms of precision versus runtime, and numerical stability in practice. With these challenges in mind, we introduce a specialized truncated Newton algorithm for entropic-regularized OT. In addition to proving that locally quadratic convergence is possible without assuming a Lipschitz Hessian, we provide strategies to maximally exploit the high rate of local convergence in practice. Our GPU-parallel algorithm exhibits exceptionally favorable runtime performance, achieving high precision orders of magnitude faster than many existing alternatives. This is evidenced by wall-clock time experiments on 24 problem sets (12 datasets $ imes$ 2 cost functions). The scalability of the algorithm is showcased on an extremely large OT problem with $n approx 10^6$, solved approximately under weak entopric regularization.
Problem

Research questions and friction points this paper is trying to address.

Develops truncated Newton method for optimal transport problems
Addresses GPU parallelization and high-dimensional scalability challenges
Ensures fast convergence and numerical stability in practice
Innovation

Methods, ideas, or system contributions that make the work stand out.

Truncated Newton method for OT
GPU-parallel algorithm with scalability
Quadratic convergence without Lipschitz Hessian
🔎 Similar Papers
No similar papers found.