🤖 AI Summary
This work addresses the entropy-regularized optimal transport (OT) problem by proposing a truncated Newton solver tailored for large-scale, high-dimensional settings. Methodologically, it abandons the conventional Lipschitz Hessian assumption and establishes, for the first time under milder conditions, local quadratic convergence. It further introduces adaptive Hessian approximation and GPU-accelerated parallelization to jointly ensure theoretical rigor and computational efficiency. Innovatively integrating truncated Newton optimization with numerical stabilization mechanisms, the method consistently outperforms mainstream approaches—including Sinkhorn and L-BFGS—across 24 benchmark experiments. It successfully solves high-dimensional OT problems with $n approx 10^6$, achieving high accuracy, low runtime (several times faster than state-of-the-art solvers), and strong robustness. The proposed framework delivers a scalable, reliable, and efficient new paradigm for large-scale OT computation.
📝 Abstract
Developing a contemporary optimal transport (OT) solver requires navigating trade-offs among several critical requirements: GPU parallelization, scalability to high-dimensional problems, theoretical convergence guarantees, empirical performance in terms of precision versus runtime, and numerical stability in practice. With these challenges in mind, we introduce a specialized truncated Newton algorithm for entropic-regularized OT. In addition to proving that locally quadratic convergence is possible without assuming a Lipschitz Hessian, we provide strategies to maximally exploit the high rate of local convergence in practice. Our GPU-parallel algorithm exhibits exceptionally favorable runtime performance, achieving high precision orders of magnitude faster than many existing alternatives. This is evidenced by wall-clock time experiments on 24 problem sets (12 datasets $ imes$ 2 cost functions). The scalability of the algorithm is showcased on an extremely large OT problem with $n approx 10^6$, solved approximately under weak entopric regularization.