Level Set Teleportation: An Optimization Perspective

📅 2024-03-05
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF

career value

225K/year
🤖 AI Summary
This work addresses a theoretical gap in level set teleportation (LST) for accelerating gradient descent (GD). We establish, for the first time, a rigorous connection between maximizing the norm of the gradient on level sets and convergence rate. For Hessian-stable convex functions, we propose a projection-gradient–type teleportation subroutine that requires only Hessian-vector products—avoiding explicit Hessian storage or inversion. We prove it achieves hybrid sublinear/linear convergence, strictly faster than standard GD. Our analysis extends theoretically to nonconvex settings. Empirically, integrating the teleportation oracle into GD consistently outperforms both standard GD and truncated Newton methods across diverse machine learning tasks, validating both its acceleration capability and practical efficacy.

Technology Category

Application Category

📝 Abstract
We study level set teleportation, an optimization sub-routine which seeks to accelerate gradient methods by maximizing the gradient norm on a level-set of the objective function. Since the descent lemma implies that gradient descent (GD) decreases the objective proportional to the squared norm of the gradient, level-set teleportation maximizes this one-step progress guarantee. For convex functions satisfying Hessian stability, we prove that GD with level-set teleportation obtains a combined sub-linear/linear convergence rate which is strictly faster than standard GD when the optimality gap is small. This is in sharp contrast to the standard (strongly) convex setting, where we show level-set teleportation neither improves nor worsens convergence rates. To evaluate teleportation in practice, we develop a projected-gradient-type method requiring only Hessian-vector products. We use this method to show that gradient methods with access to a teleportation oracle uniformly out-perform their standard versions on a variety of learning problems.
Problem

Research questions and friction points this paper is trying to address.

Lack of convergence theory for convex functions in teleportation.
Absence of guarantees for solving the teleportation operator.
Need for empirical evidence showing acceleration via teleportation.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Level set teleportation accelerates gradient descent.
Proven faster convergence for convex functions.
Projected-gradient method uses Hessian-vector products.
🔎 Similar Papers
No similar papers found.
A
Aaron Mishkin
Department of Computer Science, Stanford University
A
A. Bietti
Center for Computational Mathematics, Flatiron Institute
R
Robert M. Gower
Center for Computational Mathematics, Flatiron Institute